What text models are in the catalog?
194 text models listed with pricing and endpoint data.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
194 text models in the unified inference model catalog.
Browse text models with gateway pricing and eval-gated routing notes.
Every page in this index follows the same structure as the home site — answer-first, passage blocks, operational steps, and expanded FAQs. o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
194 text models listed with pricing and endpoint data.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads.
alibaba · alibaba/qwen-3-14b
alibaba · alibaba/qwen-3-235b
alibaba · alibaba/qwen-3-30b
alibaba · alibaba/qwen-3-32b
alibaba · alibaba/qwen-3.6-max-preview
alibaba · alibaba/qwen3-235b-a22b-thinking
alibaba · alibaba/qwen3-coder
alibaba · alibaba/qwen3-coder-30b-a3b
alibaba · alibaba/qwen3-coder-next
alibaba · alibaba/qwen3-coder-plus
alibaba · alibaba/qwen3-max
alibaba · alibaba/qwen3-max-preview
alibaba · alibaba/qwen3-max-thinking
alibaba · alibaba/qwen3-next-80b-a3b-instruct
alibaba · alibaba/qwen3-next-80b-a3b-thinking
alibaba · alibaba/qwen3-vl-235b-a22b-instruct
alibaba · alibaba/qwen3-vl-instruct
alibaba · alibaba/qwen3-vl-thinking
alibaba · alibaba/qwen3.5-flash
alibaba · alibaba/qwen3.5-plus
alibaba · alibaba/qwen3.6-27b
alibaba · alibaba/qwen3.6-plus
alibaba · alibaba/qwen3.7-max
alibaba · alibaba/qwen3.7-plus
amazon · amazon/nova-2-lite
amazon · amazon/nova-lite
amazon · amazon/nova-micro
amazon · amazon/nova-pro
anthropic · anthropic/claude-3-haiku
anthropic · anthropic/claude-3.5-haiku
anthropic · anthropic/claude-haiku-4.5
anthropic · anthropic/claude-opus-4
anthropic · anthropic/claude-opus-4.1
anthropic · anthropic/claude-opus-4.5
anthropic · anthropic/claude-opus-4.6
anthropic · anthropic/claude-opus-4.7
anthropic · anthropic/claude-opus-4.8
anthropic · anthropic/claude-sonnet-4
anthropic · anthropic/claude-sonnet-4.5
anthropic · anthropic/claude-sonnet-4.6
arcee-ai · arcee-ai/trinity-large-preview
arcee-ai · arcee-ai/trinity-large-thinking
arcee-ai · arcee-ai/trinity-mini
bytedance · bytedance/seed-1.6
bytedance · bytedance/seed-1.8
cohere · cohere/command-a
deepseek · deepseek/deepseek-r1
deepseek · deepseek/deepseek-v3
deepseek · deepseek/deepseek-v3.1
deepseek · deepseek/deepseek-v3.1-terminus
deepseek · deepseek/deepseek-v3.2
deepseek · deepseek/deepseek-v3.2-thinking
deepseek · deepseek/deepseek-v4-flash
deepseek · deepseek/deepseek-v4-pro
google · google/gemini-2.5-flash
google · google/gemini-2.5-flash-image
google · google/gemini-2.5-flash-lite
google · google/gemini-2.5-pro
google · google/gemini-3-flash
google · google/gemini-3-pro-image
google · google/gemini-3-pro-preview
google · google/gemini-3.1-flash-image
google · google/gemini-3.1-flash-image-preview
google · google/gemini-3.1-flash-lite
google · google/gemini-3.1-flash-lite-preview
google · google/gemini-3.1-pro-preview
google · google/gemini-3.5-flash
google · google/gemma-4-26b-a4b-it
google · google/gemma-4-31b-it
inception · inception/mercury-2
inception · inception/mercury-coder-small
interfaze · interfaze/interfaze-beta
kwaipilot · kwaipilot/kat-coder-pro-v1
kwaipilot · kwaipilot/kat-coder-pro-v2
meituan · meituan/longcat-flash-chat
meituan · meituan/longcat-flash-thinking-2601
meta · meta/llama-3.1-70b
meta · meta/llama-3.1-8b
meta · meta/llama-3.2-11b
meta · meta/llama-3.2-1b
meta · meta/llama-3.2-3b
meta · meta/llama-3.2-90b
meta · meta/llama-3.3-70b
meta · meta/llama-4-maverick
meta · meta/llama-4-scout
minimax · minimax/minimax-m2
minimax · minimax/minimax-m2.1
minimax · minimax/minimax-m2.1-lightning
minimax · minimax/minimax-m2.5
minimax · minimax/minimax-m2.5-highspeed
minimax · minimax/minimax-m2.7
minimax · minimax/minimax-m2.7-highspeed
minimax · minimax/minimax-m3
mistral · mistral/codestral
mistral · mistral/devstral-2
mistral · mistral/devstral-small
mistral · mistral/devstral-small-2
mistral · mistral/magistral-medium
mistral · mistral/magistral-small
mistral · mistral/ministral-14b
mistral · mistral/ministral-3b
mistral · mistral/ministral-8b
mistral · mistral/mistral-large-3
mistral · mistral/mistral-medium
mistral · mistral/mistral-medium-3.5
mistral · mistral/mistral-nemo
mistral · mistral/mistral-small
mistral · mistral/pixtral-12b
mistral · mistral/pixtral-large
moonshotai · moonshotai/kimi-k2
moonshotai · moonshotai/kimi-k2-thinking
moonshotai · moonshotai/kimi-k2.5
moonshotai · moonshotai/kimi-k2.6
moonshotai · moonshotai/kimi-k2.7-code
morph · morph/morph-v3-fast
morph · morph/morph-v3-large
nvidia · nvidia/nemotron-3-nano-30b-a3b
nvidia · nvidia/nemotron-3-super-120b-a12b
nvidia · nvidia/nemotron-3-ultra-550b-a55b
nvidia · nvidia/nemotron-nano-12b-v2-vl
nvidia · nvidia/nemotron-nano-9b-v2
openai · openai/gpt-3.5-turbo
openai · openai/gpt-3.5-turbo-instruct
openai · openai/gpt-4-turbo
openai · openai/gpt-4.1
openai · openai/gpt-4.1-mini
openai · openai/gpt-4.1-nano
openai · openai/gpt-4o
openai · openai/gpt-4o-mini
openai · openai/gpt-4o-mini-search-preview
openai · openai/gpt-5
openai · openai/gpt-5-chat
openai · openai/gpt-5-codex
openai · openai/gpt-5-mini
openai · openai/gpt-5-nano
openai · openai/gpt-5-pro
openai · openai/gpt-5.1-codex
openai · openai/gpt-5.1-codex-max
openai · openai/gpt-5.1-codex-mini
openai · openai/gpt-5.1-instant
openai · openai/gpt-5.1-thinking
openai · openai/gpt-5.2
openai · openai/gpt-5.2-chat
openai · openai/gpt-5.2-codex
openai · openai/gpt-5.2-pro
openai · openai/gpt-5.3-chat
openai · openai/gpt-5.3-codex
openai · openai/gpt-5.4
openai · openai/gpt-5.4-mini
openai · openai/gpt-5.4-nano
openai · openai/gpt-5.4-pro
openai · openai/gpt-5.5
openai · openai/gpt-5.5-pro
openai · openai/gpt-oss-120b
openai · openai/gpt-oss-20b
openai · openai/gpt-oss-safeguard-20b
openai · openai/o1
openai · openai/o3
openai · openai/o3-deep-research
openai · openai/o3-mini
openai · openai/o3-pro
openai · openai/o4-mini
perplexity · perplexity/sonar
perplexity · perplexity/sonar-pro
perplexity · perplexity/sonar-reasoning-pro
stepfun · stepfun/step-3.5-flash
stepfun · stepfun/step-3.7-flash
xai · xai/grok-4.1-fast-non-reasoning
xai · xai/grok-4.1-fast-reasoning
xai · xai/grok-4.20-multi-agent
xai · xai/grok-4.20-multi-agent-beta
xai · xai/grok-4.20-non-reasoning
xai · xai/grok-4.20-non-reasoning-beta
xai · xai/grok-4.20-reasoning
xai · xai/grok-4.20-reasoning-beta
xai · xai/grok-4.3
xai · xai/grok-build-0.1
xiaomi · xiaomi/mimo-v2-flash
xiaomi · xiaomi/mimo-v2-pro
xiaomi · xiaomi/mimo-v2.5
xiaomi · xiaomi/mimo-v2.5-pro
zai · zai/glm-4.5
zai · zai/glm-4.5-air
zai · zai/glm-4.5v
zai · zai/glm-4.6
zai · zai/glm-4.6v
zai · zai/glm-4.6v-flash
zai · zai/glm-4.7
zai · zai/glm-4.7-flash
zai · zai/glm-4.7-flashx
zai · zai/glm-5
zai · zai/glm-5-turbo
zai · zai/glm-5.1
zai · zai/glm-5v-turbo
194 in the gateway catalog snapshot.
o10 State of Inference Spend 2026 found up to 638× compliant price spread across venues for identical workloads. Route each workload to the cheapest compliant model tier after shadow-mode proof.
Unified inference gateway catalog snapshot — confirm live rates with your provider before enforce mode.
Paste a week of traffic. Get the number that books the audit.
See what you're overpaying →