Loading…
Loading…
Written by Max Zeshut
Founder at Agentmelt
A specialized AI model that converts text, images, or other data into dense numerical vectors (embeddings) that capture semantic meaning. Embedding models are distinct from LLMs—they don't generate text but instead create representations used for search, similarity matching, and clustering. Common embedding models include OpenAI's text-embedding-3, Cohere Embed, and open-source options like BGE and E5. Choosing the right embedding model affects RAG quality, search accuracy, and agent performance.
A legal agent uses an embedding model to convert 10,000 contract clauses into vectors. When a lawyer asks 'Find all clauses related to intellectual property assignment,' the embedding model converts the query into a vector and finds the 15 most semantically similar clauses—even those that use different terminology like 'IP transfer' or 'work product ownership.'