o10Last updated 2026-06-14b

Text models

194 text models in the unified inference model catalog.

Browse text models with gateway pricing and eval-gated routing notes.

Dashboards observe.
o10 enforces.

Every page in this index follows the same structure as the home site — answer-first, passage blocks, operational steps, and expanded FAQs. o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.

Start hereQuick overview

How to use this index

What text models are in the catalog?

194 text models listed with pricing and endpoint data.

o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.

Text194 models

Qwen3-14B

alibaba · alibaba/qwen-3-14b

Qwen3 235B A22B

alibaba · alibaba/qwen-3-235b

Qwen3-30B-A3B

alibaba · alibaba/qwen-3-30b

Qwen 3 32B

alibaba · alibaba/qwen-3-32b

Qwen 3.6 Max Preview

alibaba · alibaba/qwen-3.6-max-preview

Qwen3 VL 235B A22B Thinking

alibaba · alibaba/qwen3-235b-a22b-thinking

Qwen3 Coder 480B A35B Instruct

alibaba · alibaba/qwen3-coder

Qwen 3 Coder 30B A3B Instruct

alibaba · alibaba/qwen3-coder-30b-a3b

Qwen3 Coder Next

alibaba · alibaba/qwen3-coder-next

Qwen3 Coder Plus

alibaba · alibaba/qwen3-coder-plus

Qwen3 Max

alibaba · alibaba/qwen3-max

Qwen3 Max Preview

alibaba · alibaba/qwen3-max-preview

Qwen 3 Max Thinking

alibaba · alibaba/qwen3-max-thinking

Qwen3 Next 80B A3B Instruct

alibaba · alibaba/qwen3-next-80b-a3b-instruct

Qwen3 Next 80B A3B Thinking

alibaba · alibaba/qwen3-next-80b-a3b-thinking

Qwen3 VL 235B A22B Instruct

alibaba · alibaba/qwen3-vl-235b-a22b-instruct

Qwen3 VL 235B A22B Instruct

alibaba · alibaba/qwen3-vl-instruct

Qwen3 VL 235B A22B Thinking

alibaba · alibaba/qwen3-vl-thinking

Qwen 3.5 Flash

alibaba · alibaba/qwen3.5-flash

Qwen 3.5 Plus

alibaba · alibaba/qwen3.5-plus

Qwen 3.6 27B

alibaba · alibaba/qwen3.6-27b

Qwen 3.6 Plus

alibaba · alibaba/qwen3.6-plus

Qwen 3.7 Max

alibaba · alibaba/qwen3.7-max

Qwen 3.7 Plus

alibaba · alibaba/qwen3.7-plus

Nova 2 Lite

amazon · amazon/nova-2-lite

Nova Lite

amazon · amazon/nova-lite

Nova Micro

amazon · amazon/nova-micro

Nova Pro

amazon · amazon/nova-pro

Claude 3 Haiku

anthropic · anthropic/claude-3-haiku

Claude 3.5 Haiku

anthropic · anthropic/claude-3.5-haiku

Claude Haiku 4.5

anthropic · anthropic/claude-haiku-4.5

Claude Opus 4

anthropic · anthropic/claude-opus-4

Claude Opus 4.1

anthropic · anthropic/claude-opus-4.1

Claude Opus 4.5

anthropic · anthropic/claude-opus-4.5

Claude Opus 4.6

anthropic · anthropic/claude-opus-4.6

Claude Opus 4.7

anthropic · anthropic/claude-opus-4.7

Claude Opus 4.8

anthropic · anthropic/claude-opus-4.8

Claude Sonnet 4

anthropic · anthropic/claude-sonnet-4

Claude Sonnet 4.5

anthropic · anthropic/claude-sonnet-4.5

Claude Sonnet 4.6

anthropic · anthropic/claude-sonnet-4.6

Trinity Large Preview

arcee-ai · arcee-ai/trinity-large-preview

Trinity Large Thinking

arcee-ai · arcee-ai/trinity-large-thinking

Trinity Mini

arcee-ai · arcee-ai/trinity-mini

Seed 1.6

bytedance · bytedance/seed-1.6

Bytedance Seed 1.8

bytedance · bytedance/seed-1.8

Command A

cohere · cohere/command-a

DeepSeek-R1

deepseek · deepseek/deepseek-r1

DeepSeek V3 0324

deepseek · deepseek/deepseek-v3

DeepSeek V3.1

deepseek · deepseek/deepseek-v3.1

DeepSeek V3.1 Terminus

deepseek · deepseek/deepseek-v3.1-terminus

DeepSeek V3.2

deepseek · deepseek/deepseek-v3.2

DeepSeek V3.2 Thinking

deepseek · deepseek/deepseek-v3.2-thinking

DeepSeek V4 Flash

deepseek · deepseek/deepseek-v4-flash

DeepSeek V4 Pro

deepseek · deepseek/deepseek-v4-pro

Gemini 2.5 Flash

google · google/gemini-2.5-flash

Nano Banana (Gemini 2.5 Flash Image)

google · google/gemini-2.5-flash-image

Gemini 2.5 Flash Lite

google · google/gemini-2.5-flash-lite

Gemini 2.5 Pro

google · google/gemini-2.5-pro

Gemini 3 Flash

google · google/gemini-3-flash

Nano Banana Pro (Gemini 3 Pro Image)

google · google/gemini-3-pro-image

Gemini 3 Pro Preview

google · google/gemini-3-pro-preview

Gemini 3.1 Flash Image (Nano Banana 2)

google · google/gemini-3.1-flash-image

Gemini 3.1 Flash Image Preview (Nano Banana 2)

google · google/gemini-3.1-flash-image-preview

Gemini 3.1 Flash Lite

google · google/gemini-3.1-flash-lite

Gemini 3.1 Flash Lite Preview

google · google/gemini-3.1-flash-lite-preview

Gemini 3.1 Pro Preview

google · google/gemini-3.1-pro-preview

Gemini 3.5 Flash

google · google/gemini-3.5-flash

Gemma 4 26B A4B IT

google · google/gemma-4-26b-a4b-it

Gemma 4 31B IT

google · google/gemma-4-31b-it

Mercury 2

inception · inception/mercury-2

Mercury Coder Small Beta

inception · inception/mercury-coder-small

Interfaze Beta

interfaze · interfaze/interfaze-beta

KAT-Coder-Pro V1

kwaipilot · kwaipilot/kat-coder-pro-v1

Kat Coder Pro V2

kwaipilot · kwaipilot/kat-coder-pro-v2

LongCat Flash Chat

meituan · meituan/longcat-flash-chat

LongCat Flash Thinking 2601

meituan · meituan/longcat-flash-thinking-2601

Llama 3.1 70B Instruct

meta · meta/llama-3.1-70b

Llama 3.1 8B Instruct

meta · meta/llama-3.1-8b

Llama 3.2 11B Vision Instruct

meta · meta/llama-3.2-11b

Llama 3.2 1B Instruct

meta · meta/llama-3.2-1b

Llama 3.2 3B Instruct

meta · meta/llama-3.2-3b

Llama 3.2 90B Vision Instruct

meta · meta/llama-3.2-90b

Llama 3.3 70B Instruct

meta · meta/llama-3.3-70b

Llama 4 Maverick 17B Instruct

meta · meta/llama-4-maverick

Llama 4 Scout 17B Instruct

meta · meta/llama-4-scout

MiniMax M2

minimax · minimax/minimax-m2

MiniMax M2.1

minimax · minimax/minimax-m2.1

MiniMax M2.1 Lightning

minimax · minimax/minimax-m2.1-lightning

MiniMax M2.5

minimax · minimax/minimax-m2.5

MiniMax M2.5 High Speed

minimax · minimax/minimax-m2.5-highspeed

MiniMax M2.7

minimax · minimax/minimax-m2.7

MiniMax M2.7 High Speed

minimax · minimax/minimax-m2.7-highspeed

MiniMax M3

minimax · minimax/minimax-m3

Mistral Codestral

mistral · mistral/codestral

Devstral 2

mistral · mistral/devstral-2

Devstral Small 1.1

mistral · mistral/devstral-small

Devstral Small 2

mistral · mistral/devstral-small-2

Magistral Medium 2509

mistral · mistral/magistral-medium

Magistral Small 2509

mistral · mistral/magistral-small

Ministral 14B

mistral · mistral/ministral-14b

Ministral 3B

mistral · mistral/ministral-3b

Ministral 8B

mistral · mistral/ministral-8b

Mistral Large 3

mistral · mistral/mistral-large-3

Mistral Medium 3.1

mistral · mistral/mistral-medium

Mistral Medium Latest

mistral · mistral/mistral-medium-3.5

Mistral Nemo 12B

mistral · mistral/mistral-nemo

Mistral Small

mistral · mistral/mistral-small

Pixtral 12B 2409

mistral · mistral/pixtral-12b

Pixtral Large

mistral · mistral/pixtral-large

Kimi K2 Instruct

moonshotai · moonshotai/kimi-k2

Kimi K2 Thinking

moonshotai · moonshotai/kimi-k2-thinking

Kimi K2.5

moonshotai · moonshotai/kimi-k2.5

Kimi K2.6

moonshotai · moonshotai/kimi-k2.6

Kimi K2.7 Code

moonshotai · moonshotai/kimi-k2.7-code

Morph V3 Fast

morph · morph/morph-v3-fast

Morph V3 Large

morph · morph/morph-v3-large

Nemotron 3 Nano 30B A3B

nvidia · nvidia/nemotron-3-nano-30b-a3b

NVIDIA Nemotron 3 Super 120B A12B

nvidia · nvidia/nemotron-3-super-120b-a12b

Nemotron 3 Ultra

nvidia · nvidia/nemotron-3-ultra-550b-a55b

Nvidia Nemotron Nano 12B V2 VL

nvidia · nvidia/nemotron-nano-12b-v2-vl

Nvidia Nemotron Nano 9B V2

nvidia · nvidia/nemotron-nano-9b-v2

GPT-3.5 Turbo

openai · openai/gpt-3.5-turbo

GPT-3.5 Turbo Instruct

openai · openai/gpt-3.5-turbo-instruct

GPT-4 Turbo

openai · openai/gpt-4-turbo

GPT-4.1

openai · openai/gpt-4.1

GPT-4.1 mini

openai · openai/gpt-4.1-mini

GPT-4.1 nano

openai · openai/gpt-4.1-nano

GPT-4o

openai · openai/gpt-4o

GPT-4o mini

openai · openai/gpt-4o-mini

GPT 4o Mini Search Preview

openai · openai/gpt-4o-mini-search-preview

GPT-5

openai · openai/gpt-5

GPT 5 Chat

openai · openai/gpt-5-chat

GPT-5-Codex

openai · openai/gpt-5-codex

GPT-5 mini

openai · openai/gpt-5-mini

GPT-5 nano

openai · openai/gpt-5-nano

GPT-5 pro

openai · openai/gpt-5-pro

GPT-5.1-Codex

openai · openai/gpt-5.1-codex

GPT 5.1 Codex Max

openai · openai/gpt-5.1-codex-max

GPT 5.1 Codex Mini

openai · openai/gpt-5.1-codex-mini

GPT-5.1 Instant

openai · openai/gpt-5.1-instant

GPT 5.1 Thinking

openai · openai/gpt-5.1-thinking

GPT 5.2

openai · openai/gpt-5.2

GPT 5.2 Chat

openai · openai/gpt-5.2-chat

GPT 5.2 Codex

openai · openai/gpt-5.2-codex

GPT 5.2

openai · openai/gpt-5.2-pro

GPT-5.3 Chat

openai · openai/gpt-5.3-chat

GPT 5.3 Codex

openai · openai/gpt-5.3-codex

GPT 5.4

openai · openai/gpt-5.4

GPT 5.4 Mini

openai · openai/gpt-5.4-mini

GPT 5.4 Nano

openai · openai/gpt-5.4-nano

GPT 5.4 Pro

openai · openai/gpt-5.4-pro

GPT 5.5

openai · openai/gpt-5.5

GPT 5.5 Pro

openai · openai/gpt-5.5-pro

GPT OSS 120B

openai · openai/gpt-oss-120b

GPT OSS 20B

openai · openai/gpt-oss-20b

GPT OSS Safeguard 20B

openai · openai/gpt-oss-safeguard-20b

o1

openai · openai/o1

o3

openai · openai/o3

o3-deep-research

openai · openai/o3-deep-research

o3-mini

openai · openai/o3-mini

o3 Pro

openai · openai/o3-pro

o4-mini

openai · openai/o4-mini

Sonar

perplexity · perplexity/sonar

Sonar Pro

perplexity · perplexity/sonar-pro

Sonar Reasoning Pro

perplexity · perplexity/sonar-reasoning-pro

StepFun 3.5 Flash

stepfun · stepfun/step-3.5-flash

Step 3.7 Flash

stepfun · stepfun/step-3.7-flash

Grok 4.1 Fast Non-Reasoning

xai · xai/grok-4.1-fast-non-reasoning

Grok 4.1 Fast Reasoning

xai · xai/grok-4.1-fast-reasoning

Grok 4.20 Multi-Agent

xai · xai/grok-4.20-multi-agent

Grok 4.20 Multi Agent Beta

xai · xai/grok-4.20-multi-agent-beta

Grok 4.20 Non-Reasoning

xai · xai/grok-4.20-non-reasoning

Grok 4.20 Beta Non-Reasoning

xai · xai/grok-4.20-non-reasoning-beta

Grok 4.20 Reasoning

xai · xai/grok-4.20-reasoning

Grok 4.20 Beta Reasoning

xai · xai/grok-4.20-reasoning-beta

Grok 4.3

xai · xai/grok-4.3

Grok Build 0.1

xai · xai/grok-build-0.1

MiMo V2 Flash

xiaomi · xiaomi/mimo-v2-flash

MiMo V2 Pro

xiaomi · xiaomi/mimo-v2-pro

MiMo M2.5

xiaomi · xiaomi/mimo-v2.5

MiMo V2.5 Pro

xiaomi · xiaomi/mimo-v2.5-pro

GLM-4.5

zai · zai/glm-4.5

GLM 4.5 Air

zai · zai/glm-4.5-air

GLM 4.5V

zai · zai/glm-4.5v

GLM 4.6

zai · zai/glm-4.6

GLM-4.6V

zai · zai/glm-4.6v

GLM-4.6V-Flash

zai · zai/glm-4.6v-flash

GLM 4.7

zai · zai/glm-4.7

GLM 4.7 Flash

zai · zai/glm-4.7-flash

GLM 4.7 FlashX

zai · zai/glm-4.7-flashx

GLM 5

zai · zai/glm-5

GLM 5 Turbo

zai · zai/glm-5-turbo

GLM 5.1

zai · zai/glm-5.1

GLM 5V Turbo

zai · zai/glm-5v-turbo

FAQFrequently asked questions

Common questions

How many text models?

194 in the gateway catalog snapshot.

How does o10 route text workloads?

o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads. Route each workload to the cheapest compliant model tier after shadow-mode proof.

Where is catalog pricing sourced?

Unified inference gateway catalog snapshot — confirm live rates with your provider before enforce mode.

o10Set the envelope. o10 holds it.

See what you're overpaying.

Paste a week of traffic. Get the number that books the audit.

See what you're overpaying