o10Last updated 2026-06-14b

Embedding models

24 embedding models in the unified inference model catalog.

Browse embedding models with gateway pricing and eval-gated routing notes.

Dashboards observe.
o10 enforces.

Every page in this index follows the same structure as the home site — answer-first, passage blocks, operational steps, and expanded FAQs. o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.

Start hereQuick overview

How to use this index

What embedding models are in the catalog?

24 embedding models listed with pricing and endpoint data.

o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.

FAQFrequently asked questions

Common questions

How many embedding models?

24 in the gateway catalog snapshot.

How does o10 route embedding workloads?

o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads. Route each workload to the cheapest compliant model tier after shadow-mode proof.

Where is catalog pricing sourced?

Unified inference gateway catalog snapshot — confirm live rates with your provider before enforce mode.

o10Set the envelope. o10 holds it.

See what you're overpaying.

Paste a week of traffic. Get the number that books the audit.

See what you're overpaying