What embedding models are in the catalog?
24 embedding models listed with pricing and endpoint data.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
24 embedding models in the unified inference model catalog.
Browse embedding models with gateway pricing and eval-gated routing notes.
Every page in this index follows the same structure as the home site — answer-first, passage blocks, operational steps, and expanded FAQs. o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
24 embedding models listed with pricing and endpoint data.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
alibaba · alibaba/qwen3-embedding-0.6b
alibaba · alibaba/qwen3-embedding-4b
alibaba · alibaba/qwen3-embedding-8b
amazon · amazon/titan-embed-text-v2
cohere · cohere/embed-v4.0
google · google/gemini-embedding-001
google · google/gemini-embedding-2
google · google/text-embedding-005
google · google/text-multilingual-embedding-002
mistral · mistral/codestral-embed
mistral · mistral/mistral-embed
openai · openai/text-embedding-3-large
openai · openai/text-embedding-3-small
openai · openai/text-embedding-ada-002
voyage · voyage/voyage-3-large
voyage · voyage/voyage-3.5
voyage · voyage/voyage-3.5-lite
voyage · voyage/voyage-4
voyage · voyage/voyage-4-large
voyage · voyage/voyage-4-lite
voyage · voyage/voyage-code-2
voyage · voyage/voyage-code-3
voyage · voyage/voyage-finance-2
voyage · voyage/voyage-law-2
24 in the gateway catalog snapshot.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads. Route each workload to the cheapest compliant model tier after shadow-mode proof.
Unified inference gateway catalog snapshot — confirm live rates with your provider before enforce mode.
Paste a week of traffic. Get the number that books the audit.
See what you're overpaying →