o10Last updated 2026-06-09

Inference pricing matrix

Provider × model pricing with venue comparison — June 2026 benchmarks.

Pricing pages show $/1M token rates across venues so teams can compare gateway, committed, and open-weight economics — o10 routes to the cheapest compliant option per use case.

Spread observed
638×
Routing modes
shadow → enforce
Framework
KYI
Dashboards observe.
o10 enforces.

Every page in this index follows the same structure as the home site — answer-first, passage blocks, operational steps, and expanded FAQs.

Start hereQuick overview

How to use this index

What is the o10 pricing matrix?

Per-provider, per-model pages with gateway, committed, and open-weight $/1M comparisons — structured for table snippets and LLM extraction.

Venue survey June 2026 — State of Inference Spend report.

FAQFrequently asked questions

Common questions

Are these official provider prices?

Figures are approximate June 2026 survey benchmarks for comparison — always verify against current provider list prices before procurement decisions.

Why venue columns?

The same model class costs different $/1M on gateway, committed capacity, and self-hosted open-weight — routing economics depend on venue mix.

o10Set the envelope. o10 holds it.

See what you're overpaying.

Paste a week of traffic. Get the number that books the audit.

See what you're overpaying