Fastest AI models

The fastest AI models by output throughput (tokens generated per second) — useful for chat, agents and high-volume work. All available on anyAInow, pay-as-you-go.

#ModelSpeed
1GPT-5.4 mini
OpenAIFast·from $10 / 1M
924 tok/sTry →
2Mistral Small
MistralFast·from $5 / 1M
737 tok/sTry →
3Mistral Medium
MistralBalanced·from $20 / 1M
730 tok/sTry →
4GPT-5 nano
OpenAIFree·Free
500 tok/sTry →
5GPT-5.4
OpenAIBalanced·from $30 / 1M
467 tok/sTry →
6Gemini 2.5 Pro
GoogleFlagship·from $20 / 1M
394 tok/sTry →
7MiniMax M3
MiniMaxOpen·from $10 / 1M
380 tok/sTry →
8Claude Haiku 4.5
AnthropicFast·from $10 / 1M
346 tok/sTry →
9GPT-5.5
OpenAIFlagship·from $50 / 1M
341 tok/sTry →
10Mistral Large
MistralFlagship·from $20 / 1M
333 tok/sTry →
11Claude Sonnet 4.6
AnthropicBalanced·from $30 / 1M
332 tok/sTry →
12Gemini 3.1 Pro
GoogleFlagship·from $20 / 1M
266 tok/sTry →
13MiMo V2 Pro
Xiaomi (MiMo)Open·from $10 / 1M
245 tok/sTry →
14Claude Opus 4.8
AnthropicFlagship·from $40 / 1M
217 tok/sTry →
15GLM 5.2
Z.ai (GLM)Open·from $10 / 1M
206 tok/sTry →
16Claude Opus 4.7
AnthropicFlagship·from $40 / 1M
183 tok/sTry →
17Qwen 3.6 Plus
QwenOpen·from $10 / 1M
171 tok/sTry →
18Qwen 3.7 Plus
QwenOpen·from $10 / 1M
142 tok/sTry →
19Nova Lite
AmazonFast·from $5 / 1M
100 tok/sTry →
20Nova Pro
AmazonBalanced·from $10 / 1M
100 tok/sTry →
21Kimi K2
Moonshot (Kimi)Balanced·from $10 / 1M
87 tok/sTry →
22Llama 4 Maverick
Meta LlamaBalanced·from $10 / 1M
84 tok/sTry →
23Llama 4 Scout
Meta LlamaOpen·from $5 / 1M
76 tok/sTry →
24DeepSeek R1
DeepSeekReasoning·from $10 / 1M
45 tok/sTry →
25Llama 3.3 70B
Meta LlamaFree·Free
42 tok/sTry →
26Gemma 3 27B
GoogleOpen·from $10 / 1M
33 tok/sTry →
27Phi-4
MicrosoftOpen·from $10 / 1M
33 tok/sTry →

Throughput via llm-stats.com (7-day average), as of 28 Jun 2026. Prices shown are the anyAInow pay-as-you-go floor (per 1M tokens); free-tier models deduct nothing.

More rankings