Semantic caching service using Redis vector search and EmbeddingGemma (via Ollama) for multilingual LLM query caching. Supports Matryoshka dimensions (768/512/256/128) for flexible quality vs storage trade-offs.