527 models · updated May 31, 2026
Intelligence vs. Price
Higher and further left is the sweet spot. The line traces the efficiency frontier — models nothing else beats on both smarts and cost.
Speed
Output tokens per second — higher is faster.
Price
Blended $/1M tokens (3:1) — lower is cheaper.
Latency
Time to first token (s) — lower is snappier.
Tokens used
Output tokens to run the Intelligence Index.
All models
527 models · click a column to sort · cells shaded by rank — brighter is better
Released→
| # | Model | Creator | Intelligence ↓ | GPQA | LiveCodeBench | AIME | MMLU-Pro | HLE | Context | Inputs | Type | Speed | Latency | In $/1M | Out $/1M | Blended | Tokens used | Cost to run | Released |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Claude Opus 4.8 (Adaptive Reasoning, Max Effort) | Anthropic | 61.4 | 92.0% | — | — | — | 45.7% | 1M | T 🖼 | 59 t/s | 17.95 s | $6.25 | $25.00 | $10.94 | 112.0M | $4.7k | May 2026 | |
| 2 | GPT-5.5 (xhigh) | OpenAI | 60.2 | 93.5% | — | — | — | 44.3% | 922k | T 🖼 | 56 t/s | 73.96 s | $5.00 | $30.00 | $11.25 | 75.2M | $3.4k | Apr 2026 | |
| 3 | GPT-5.5 (high) | OpenAI | 58.9 | 93.2% | — | — | — | 43.0% | 922k | T 🖼 | 51 t/s | 17.22 s | $5.00 | $30.00 | $11.25 | 44.5M | $2.2k | Apr 2026 | |
| 4 | Claude Opus 4.7 (Adaptive Reasoning, Max Effort) | Anthropic | 57.3 | 91.4% | — | — | — | 39.6% | 1M | T 🖼 | 43 t/s | 9.91 s | $6.25 | $25.00 | $10.94 | 111.9M | $5.1k | Apr 2026 | |
| 5 | Gemini 3.1 Pro Preview | 57.2 | 94.1% | — | — | — | 44.7% | 1M | T 🖼 🔊 🎬 | 123 t/s | 19.64 s | $2.00 | $12.00 | $4.50 | 57.3M | $892 | Feb 2026 | ||
| 6 | GPT-5.4 (xhigh) | OpenAI | 56.8 | 92.0% | — | — | — | 41.6% | 1.1M | T 🖼 | 84 t/s | 182.61 s | $2.50 | $15.00 | $5.63 | 120.6M | $2.9k | Mar 2026 | |
| 7 | GPT-5.5 (medium) | OpenAI | 56.7 | 92.6% | — | — | — | 40.6% | 922k | T 🖼 | 48 t/s | 6.72 s | $5.00 | $30.00 | $11.25 | 22.5M | $1.2k | Apr 2026 | |
| 8 | Qwen3.7 Max | Alibaba | 56.6 | 92.3% | — | — | — | 38.1% | 1M | T | 189 t/s | 2.58 s | $2.50 | $7.50 | $3.75 | 96.7M | $1.2k | May 2026 | |
| 9 | Gemini 3.5 Flash (high) | 55.3 | 92.2% | — | — | — | 41.0% | 1M | T 🖼 🔊 🎬 | 183 t/s | 18.18 s | $1.50 | $9.00 | $3.38 | 72.6M | $1.6k | May 2026 | ||
| 10 | Gemini 3.5 Flash (medium) | 54.8 | 92.1% | — | — | — | 39.9% | 1M | T 🖼 🔊 🎬 | 174 t/s | 12.72 s | $1.50 | $9.00 | $3.38 | 56.7M | $1.4k | May 2026 | ||
| 11 | Kimi K2.6 | Kimi | 53.9 | 91.1% | — | — | — | 35.9% | 256k | T 🖼 🎬 | 44 t/s | 2.31 s | $0.950 | $4.00 | $1.71 | 165.5M | $948 | Apr 2026 | |
| 12 | MiMo-V2.5-Pro | Xiaomi | 53.8 | 86.6% | — | — | — | 33.8% | 1M | T | 50 t/s | 3.52 s | $0.435 | $0.870 | $0.544 | 91.9M | $161 | Apr 2026 | |
| 13 | GPT-5.3 Codex (xhigh) | OpenAI | 53.6 | 91.5% | — | — | — | 39.9% | 400k | T 🖼 | 76 t/s | 95.05 s | $1.75 | $14.00 | $4.81 | 77.5M | $1.6k | Feb 2026 | |
| 14 | Grok 4.3 (high) | xAI | 53.2 | 90.1% | — | — | — | 35.0% | 1M | T 🖼 | 141 t/s | 10.38 s | $1.25 | $2.50 | $1.56 | 88.0M | $395 | Apr 2026 | |
| 15 | Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 52.9 | 89.6% | — | — | — | 36.7% | 1M | T 🖼 | 41 t/s | 19.70 s | $6.25 | $25.00 | $10.94 | 157.0M | $5.2k | Feb 2026 | |
| 16 | Muse Spark | Meta | 52.2 | 88.4% | — | — | — | 39.9% | 262k | T 🖼 🔊 | — | — | — | — | — | 58.4M | $0.000 | Apr 2026 | |
| 17 | Claude Opus 4.7 (Non-reasoning, High Effort) | Anthropic | 51.8 | 88.5% | — | — | — | 31.2% | 1M | T 🖼 | 42 t/s | 1.17 s | $6.25 | $25.00 | $10.94 | 11.6M | $1.2k | Apr 2026 | |
| 18 | Qwen3.6 Max Preview | Alibaba | 51.8 | 88.8% | — | — | — | 28.9% | 256k | T | 39 t/s | 3.41 s | $1.30 | $7.80 | $2.92 | 73.9M | $861 | Apr 2026 | |
| 19 | Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 51.7 | 87.5% | — | — | — | 30.0% | 1M | T 🖼 | 47 t/s | 104.53 s | $3.75 | $15.00 | $6.56 | 198.2M | $4.2k | Feb 2026 | |
| 20 | DeepSeek V4 Pro (Reasoning, Max Effort) | DeepSeek | 51.5 | 88.8% | — | — | — | 35.9% | 1M | T | 48 t/s | 1.85 s | $0.435 | $0.870 | $0.544 | 187.2M | $268 | Apr 2026 | |
| 21 | GLM-5.1 (Reasoning) | Z AI | 51.4 | 86.8% | — | — | — | 28.0% | 200k | T | 62 t/s | 1.51 s | $1.40 | $4.40 | $2.15 | undefined | — | Apr 2026 | |
| 22 | GPT-5.2 (xhigh) | OpenAI | 51.3 | 90.3% | 88.9% | 99.0% | 87.4% | 35.4% | 400k | T 🖼 | 71 t/s | 114.99 s | $1.75 | $14.00 | $4.81 | 129.6M | $2.3k | Dec 2025 | |
| 23 | GPT-5.5 (low) | OpenAI | 50.8 | 91.0% | — | — | — | 31.0% | 922k | T 🖼 | 51 t/s | 1.67 s | $5.00 | $30.00 | $11.25 | 7.0M | $501 | Apr 2026 | |
| 24 | Qwen3.6 Plus | Alibaba | 50.0 | 88.2% | — | — | — | 25.7% | 1M | T 🖼 🎬 | 53 t/s | 2.90 s | $0.500 | $3.00 | $1.13 | 101.3M | $483 | Apr 2026 | |
| 25 | DeepSeek V4 Pro (Reasoning, High Effort) | DeepSeek | 49.8 | 90.5% | — | — | — | 33.5% | 1M | T | 46 t/s | 1.80 s | $0.435 | $0.870 | $0.544 | 103.7M | $173 | Apr 2026 | |
| 26 | GLM-5 (Reasoning) | Z AI | 49.8 | 82.0% | — | — | — | 27.2% | 200k | T | 68 t/s | 1.52 s | $1.00 | $3.20 | $1.55 | 109.3M | $547 | Feb 2026 | |
| 27 | Claude Opus 4.5 (Reasoning) | Anthropic | 49.7 | 86.6% | 87.1% | 91.3% | 89.5% | 28.4% | 200k | T 🖼 | 47 t/s | 12.49 s | $6.25 | $25.00 | $10.94 | 71.9M | $3.0k | Nov 2025 | |
| 28 | MiniMax-M2.7 | MiniMax | 49.6 | 87.4% | — | — | — | 28.1% | 205k | T | 101 t/s | 2.65 s | $0.300 | $1.20 | $0.525 | 86.9M | $176 | Mar 2026 | |
| 29 | Grok 4.20 0309 v2 (Reasoning) | xAI | 49.3 | 91.1% | — | — | — | 32.2% | 2M | T 🖼 | 192 t/s | 10.40 s | $2.00 | $6.00 | $3.00 | 60.9M | $514 | Apr 2026 | |
| 30 | MiMo-V2-Pro | Xiaomi | 49.2 | 87.0% | — | — | — | 28.3% | 1M | T | 54 t/s | 2.95 s | $1.00 | $3.00 | $1.50 | 77.1M | $351 | Mar 2026 | |
| 31 | GPT-5.2 Codex (xhigh) | OpenAI | 49.0 | 89.9% | — | — | — | 33.5% | 400k | T 🖼 | 111 t/s | 33.11 s | $1.75 | $14.00 | $4.81 | 201.7M | $3.2k | Dec 2025 | |
| 32 | MiMo-V2.5 | Xiaomi | 49.0 | 84.9% | — | — | — | 25.2% | 1M | T 🖼 | 92 t/s | 2.93 s | $0.140 | $0.280 | $0.175 | 74.4M | $49.30 | Apr 2026 | |
| 33 | GPT-5.4 mini (xhigh) | OpenAI | 48.9 | 87.5% | — | — | — | 26.6% | 400k | T 🖼 | 161 t/s | 5.09 s | $0.750 | $4.50 | $1.69 | 235.3M | $1.4k | Mar 2026 | |
| 34 | Grok 4.3 (medium) | xAI | 48.8 | 89.0% | — | — | — | 28.1% | 1M | T 🖼 | 142 t/s | 7.52 s | $1.25 | $2.50 | $1.56 | 28.2M | $161 | Apr 2026 | |
| 35 | Grok 4.20 0309 (Reasoning) | xAI | 48.5 | 88.5% | — | — | — | 30.0% | 2M | T 🖼 | 195 t/s | 9.26 s | $2.00 | $6.00 | $3.00 | 54.4M | $484 | Mar 2026 | |
| 36 | Gemini 3 Pro Preview (high) | 48.4 | 90.8% | 91.7% | 95.7% | 89.8% | 37.2% | 1M | T 🖼 🔊 🎬 | — | — | $2.00 | $12.00 | $4.50 | 55.8M | $820 | Nov 2025 | ||
| 37 | GPT-5.4 (low) | OpenAI | 47.9 | 87.1% | — | — | — | 28.9% | 1.1M | T 🖼 | 61 t/s | 2.03 s | $2.50 | $15.00 | $5.63 | 9.9M | $413 | Mar 2026 | |
| 38 | GPT-5.1 (high) | OpenAI | 47.7 | 87.3% | 86.8% | 94.0% | 87.0% | 26.5% | 272k | T 🖼 | 123 t/s | 22.90 s | $1.25 | $10.00 | $3.44 | 68.7M | $779 | Nov 2025 | |
| 39 | Kimi K2.5 (Reasoning) | Kimi | 46.8 | 87.9% | — | — | — | 29.4% | 256k | T 🖼 🎬 | 37 t/s | 3.04 s | $0.580 | $3.00 | $1.19 | 88.6M | $367 | Jan 2026 | |
| 40 | GLM-5-Turbo | Z AI | 46.8 | 84.7% | — | — | — | 25.4% | 200k | T | — | — | — | — | — | 94.4M | $0.000 | Mar 2026 | |
| 41 | GPT-5.2 (medium) | OpenAI | 46.6 | 86.4% | 89.4% | 96.7% | 85.9% | 24.9% | 400k | T 🖼 | — | — | $1.75 | $14.00 | $4.81 | 21.5M | $700 | Dec 2025 | |
| 42 | DeepSeek V4 Flash (Reasoning, Max Effort) | DeepSeek | 46.5 | 89.4% | — | — | — | 32.1% | 1M | T | 106 t/s | 1.24 s | $0.140 | $0.280 | $0.175 | 241.1M | $113 | Apr 2026 | |
| 43 | Claude Opus 4.6 (Non-reasoning, High Effort) | Anthropic | 46.5 | 84.0% | — | — | — | 18.6% | 1M | T 🖼 | 38 t/s | 1.28 s | $6.25 | $25.00 | $10.94 | 10.9M | $1.7k | Feb 2026 | |
| 44 | Gemini 3 Flash Preview (Reasoning) | 46.4 | 89.8% | 90.8% | 97.0% | 89.0% | 34.7% | 1M | T 🖼 🔊 🎬 | 162 t/s | 7.20 s | $0.500 | $3.00 | $1.13 | 72.0M | $278 | Dec 2025 | ||
| 45 | DeepSeek V4 Flash (Reasoning, High Effort) | DeepSeek | 46.0 | 86.7% | — | — | — | 27.8% | 1M | T | — | — | $0.140 | $0.280 | $0.175 | 98.7M | $57.43 | Apr 2026 | |
| 46 | Qwen3.6 27B (Reasoning) | Alibaba | 45.8 | 84.2% | — | — | — | 21.6% | 262k | T 🖼 🎬 | 57 t/s | 3.86 s | $0.600 | $3.60 | $1.35 | 144.5M | $659 | Apr 2026 | |
| 47 | Qwen3.5 397B A17B (Reasoning) | Alibaba | 45.0 | 89.3% | — | — | — | 27.3% | 262k | T 🖼 | 53 t/s | 2.58 s | $0.600 | $3.60 | $1.35 | 85.9M | $418 | Feb 2026 | |
| 48 | MiMo-V2-Omni-0327 | Xiaomi | 44.9 | 85.5% | — | — | — | 20.4% | 256k | T 🖼 | 92 t/s | 3.20 s | $0.400 | $2.00 | $0.800 | 86.5M | $218 | Mar 2026 | |
| 49 | GPT-5 Codex (high) | OpenAI | 44.6 | 83.7% | 84.0% | 98.7% | 86.5% | 25.6% | 400k | T 🖼 | 177 t/s | 5.74 s | $1.25 | $10.00 | $3.44 | 72.1M | $995 | Sep 2025 | |
| 50 | GPT-5 (high) | OpenAI | 44.6 | 85.4% | 84.6% | 94.3% | 87.1% | 26.5% | 400k | T 🖼 | 82 t/s | 71.91 s | $1.25 | $10.00 | $3.44 | 76.2M | $913 | Aug 2025 | |
| 51 | Claude Sonnet 4.6 (Non-reasoning, High Effort) | Anthropic | 44.4 | 79.9% | — | — | — | 13.2% | 1M | T 🖼 | 42 t/s | 1.20 s | $3.75 | $15.00 | $6.56 | 13.8M | $1.7k | Feb 2026 | |
| 52 | GPT-5.4 nano (xhigh) | OpenAI | 44.0 | 81.7% | — | — | — | 26.5% | 400k | T 🖼 | 154 t/s | 3.80 s | $0.200 | $1.25 | $0.463 | 208.6M | $363 | Mar 2026 | |
| 53 | Grok 4.3 (low) | xAI | 43.9 | 84.3% | — | — | — | 17.3% | 1M | T 🖼 | 113 t/s | 3.99 s | $1.25 | $2.50 | $1.56 | 12.3M | $98.68 | Apr 2026 | |
| 54 | KAT Coder Pro V2 | KwaiKAT | 43.8 | 85.5% | — | — | — | 16.0% | 256k | T | 112 t/s | 1.26 s | $0.300 | $1.20 | $0.525 | 8.7M | $73.49 | Mar 2026 | |
| 55 | GLM-5.1 (Non-reasoning) | Z AI | 43.8 | 83.9% | — | — | — | 25.6% | 200k | T | 49 t/s | 1.77 s | $1.40 | $4.40 | $2.15 | 75.8M | $618 | Apr 2026 | |
| 56 | Qwen3.6 35B A3B (Reasoning) | Alibaba | 43.5 | 84.1% | — | — | — | 20.2% | 262k | T 🖼 | 174 t/s | 2.41 s | $0.248 | $1.49 | $0.557 | 143.2M | $280 | Apr 2026 | |
| 57 | MiMo-V2-Omni | Xiaomi | 43.4 | 82.8% | — | — | — | 19.9% | 256k | T 🖼 | 90 t/s | 3.78 s | — | — | — | 87.4M | $0.000 | Mar 2026 | |
| 58 | Gemini 3.5 Flash (minimal) | 43.3 | 82.8% | — | — | — | 23.1% | 1M | T 🖼 🔊 🎬 | 169 t/s | 0.89 s | $1.50 | $9.00 | $3.38 | 11.5M | $750 | May 2026 | ||
| 59 | GPT-5.1 Codex (high) | OpenAI | 43.1 | 86.0% | 84.9% | 95.7% | 86.0% | 23.4% | 400k | T 🖼 | 168 t/s | 4.67 s | $1.25 | $10.00 | $3.44 | 56.7M | $892 | Nov 2025 | |
| 60 | Claude Opus 4.5 (Non-reasoning) | Anthropic | 43.1 | 81.0% | 73.8% | 62.7% | 88.9% | 12.9% | 200k | T 🖼 | 45 t/s | 1.63 s | $6.25 | $25.00 | $10.94 | 7.9M | $1.4k | Nov 2025 | |
| 61 | Claude 4.5 Sonnet (Reasoning) | Anthropic | 43.0 | 83.4% | 71.4% | 88.0% | 87.5% | 17.3% | 1M | T 🖼 | 42 t/s | 11.39 s | $3.75 | $15.00 | $6.56 | 63.7M | $1.6k | Sep 2025 | |
| 62 | Kimi K2.6 (Non-reasoning) | Kimi | 42.9 | 78.8% | — | — | — | 18.2% | 256k | T 🖼 🎬 | 45 t/s | 2.48 s | $0.950 | $4.00 | $1.71 | 27.2M | $505 | Apr 2026 | |
| 63 | GLM 5V Turbo (Reasoning) | Z AI | 42.9 | 80.9% | — | — | — | 15.8% | 200k | T 🖼 🎬 | — | — | — | — | — | 42.3M | $0.000 | Apr 2026 | |
| 64 | Claude Sonnet 4.6 (Non-reasoning, Low Effort) | Anthropic | 42.6 | 79.7% | — | — | — | 10.8% | 1M | T 🖼 | 42 t/s | 1.28 s | $3.75 | $15.00 | $6.56 | 7.2M | $666 | Feb 2026 | |
| 65 | GLM-4.7 (Reasoning) | Z AI | 42.1 | 85.9% | 89.4% | 95.0% | 85.6% | 25.1% | 200k | T | 84 t/s | 1.43 s | $0.600 | $2.20 | $1.00 | 167.5M | $478 | Dec 2025 | |
| 66 | Qwen3.5 27B (Reasoning) | Alibaba | 42.1 | 85.8% | — | — | — | 22.2% | 262k | T 🖼 | 81 t/s | 5.69 s | $0.300 | $2.40 | $0.825 | 97.9M | $299 | Feb 2026 | |
| 67 | GPT-5 (medium) | OpenAI | 42.0 | 84.2% | 70.3% | 91.7% | 86.7% | 23.5% | 400k | T 🖼 | 82 t/s | 31.70 s | $1.25 | $10.00 | $3.44 | 42.1M | $552 | Aug 2025 | |
| 68 | Claude 4.1 Opus (Reasoning) | Anthropic | 42.0 | 80.9% | 65.4% | 80.3% | 88.0% | 11.9% | 200k | T 🖼 | 31 t/s | 12.39 s | $18.75 | $75.00 | $32.81 | undefined | — | Aug 2025 | |
| 69 | MiniMax-M2.5 | MiniMax | 41.9 | 84.8% | — | — | — | 19.1% | 205k | T | 198 t/s | 3.06 s | $0.300 | $1.20 | $0.525 | 56.3M | $125 | Feb 2026 | |
| 70 | Hy3-preview (Reasoning) | Tencent | 41.9 | 86.7% | — | — | — | 25.5% | 256k | T | 98 t/s | 3.87 s | $0.123 | $0.430 | $0.200 | 123.6M | $84.43 | Apr 2026 | |
| 71 | GPT-5.5 Instant (May 2026) | OpenAI | 41.8 | 84.6% | — | — | — | 20.3% | 400k | T 🖼 | — | — | $5.00 | $30.00 | $11.25 | 2.3M | $368 | May 2026 | |
| 72 | DeepSeek V3.2 (Reasoning) | DeepSeek | 41.7 | 84.0% | 86.2% | 92.0% | 86.2% | 22.2% | 128k | T | — | — | $0.300 | $0.450 | $0.337 | 61.4M | $75.68 | Dec 2025 | |
| 73 | Qwen3.5 122B A10B (Reasoning) | Alibaba | 41.6 | 85.7% | — | — | — | 23.4% | 262k | T 🖼 | 140 t/s | 2.52 s | $0.400 | $3.20 | $1.10 | 91.3M | $354 | Feb 2026 | |
| 74 | Grok 4 | xAI | 41.5 | 87.7% | 81.9% | 92.7% | 86.6% | 23.9% | 256k | T 🖼 | — | — | $5.50 | $27.50 | $11.00 | 88.4M | $2.9k | Jul 2025 | |
| 75 | MiMo-V2-Flash (Feb 2026) | Xiaomi | 41.5 | 83.5% | — | — | — | 20.0% | 256k | T | 131 t/s | 2.07 s | $0.100 | $0.300 | $0.150 | 91.6M | $66.54 | Dec 2025 | |
| 76 | Gemini 3 Pro Preview (low) | 41.3 | 88.7% | 85.7% | 86.7% | 89.5% | 27.6% | 1M | T 🖼 🔊 🎬 | — | — | $2.00 | $12.00 | $4.50 | 16.4M | $355 | Nov 2025 | ||
| 77 | GPT-5 mini (high) | OpenAI | 41.2 | 82.8% | 83.8% | 90.7% | 83.7% | 19.7% | 400k | T 🖼 | 87 t/s | 78.81 s | $0.250 | $2.00 | $0.688 | 68.9M | $168 | Aug 2025 | |
| 78 | GPT-5.5 (Non-reasoning) | OpenAI | 40.9 | 76.8% | — | — | — | 12.6% | 922k | T 🖼 | 50 t/s | 0.97 s | $5.00 | $30.00 | $11.25 | 2.8M | $361 | Apr 2026 | |
| 79 | Kimi K2 Thinking | Kimi | 40.9 | 83.8% | 85.3% | 94.7% | 84.8% | 22.3% | 256k | T | 122 t/s | 1.49 s | $0.600 | $2.50 | $1.07 | 100.0M | $308 | Nov 2025 | |
| 80 | o3-pro | OpenAI | 40.7 | 84.5% | — | — | — | — | 200k | T 🖼 | 39 t/s | 77.30 s | $20.00 | $80.00 | $35.00 | undefined | — | Jun 2025 | |
| 81 | GLM-5 (Non-reasoning) | Z AI | 40.6 | 66.6% | — | — | — | 7.2% | 200k | T | 61 t/s | 1.61 s | $1.00 | $3.20 | $1.55 | 12.6M | $240 | Feb 2026 | |
| 82 | Qwen3.5 397B A17B (Non-reasoning) | Alibaba | 40.1 | 86.1% | — | — | — | 18.8% | 262k | T 🖼 | 53 t/s | 2.58 s | $0.600 | $3.60 | $1.35 | 20.0M | $186 | Feb 2026 | |
| 83 | Qwen3 Max Thinking | Alibaba | 39.8 | 86.1% | — | — | — | 26.2% | 256k | T | 43 t/s | 4.55 s | $1.20 | $6.00 | $2.40 | 85.7M | $669 | Jan 2026 | |
| 84 | MiniMax-M2.1 | MiniMax | 39.4 | 83.0% | 81.0% | 82.7% | 87.5% | 22.2% | 205k | T | 142 t/s | 2.97 s | $0.300 | $1.20 | $0.525 | 58.4M | $114 | Dec 2025 | |
| 85 | DeepSeek V4 Pro (Non-reasoning) | DeepSeek | 39.3 | 71.7% | — | — | — | 7.7% | 1M | T | 52 t/s | 1.98 s | $0.435 | $0.870 | $0.544 | 13.5M | $154 | Apr 2026 | |
| 86 | MiMo-V2-Flash (Reasoning) | Xiaomi | 39.2 | 84.6% | 86.8% | 96.3% | 84.3% | 21.1% | 256k | T | 125 t/s | 2.18 s | $0.100 | $0.300 | $0.150 | 98.1M | $47.49 | Dec 2025 | |
| 87 | Mistral Medium 3.5 | Mistral | 39.2 | 74.8% | — | — | — | 12.8% | 256k | T 🖼 | 148 t/s | 1.83 s | $1.50 | $7.50 | $3.00 | 89.9M | $1.0k | Apr 2026 | |
| 88 | GPT-5 (low) | OpenAI | 39.2 | 80.8% | 76.3% | 83.0% | 86.0% | 18.4% | 400k | T 🖼 | 76 t/s | 7.70 s | $1.25 | $10.00 | $3.44 | 15.0M | $228 | Aug 2025 | |
| 89 | Gemma 4 31B (Reasoning) | 39.2 | 85.7% | — | — | — | 22.7% | 256k | T 🖼 🎬 | 35 t/s | 1.07 s | — | — | — | 39.2M | $0.000 | Apr 2026 | ||
| 90 | Claude 4 Opus (Reasoning) | Anthropic | 39.0 | 79.6% | 63.6% | 73.3% | 87.3% | 11.7% | 200k | T 🖼 | 30 t/s | 7.89 s | $18.75 | $75.00 | $32.81 | undefined | — | May 2025 | |
| 91 | GPT-5 mini (medium) | OpenAI | 38.9 | 80.3% | 69.2% | 85.0% | 82.8% | 14.6% | 400k | T 🖼 | 93 t/s | 13.72 s | $0.250 | $2.00 | $0.688 | 20.6M | $61.91 | Aug 2025 | |
| 92 | Claude 4 Sonnet (Reasoning) | Anthropic | 38.7 | 77.7% | 65.5% | 74.3% | 84.2% | 9.6% | 1M | T 🖼 | 41 t/s | 12.72 s | $3.75 | $15.00 | $6.56 | 55.5M | $1.3k | May 2025 | |
| 93 | Qwen3.5 Omni Plus | Alibaba | 38.6 | 82.6% | — | — | — | 13.9% | 256k | T 🖼 🔊 🎬 | 55 t/s | 2.46 s | $0.400 | $4.80 | $1.50 | 15.6M | $150 | Mar 2026 | |
| 94 | GPT-5.1 Codex mini (high) | OpenAI | 38.6 | 81.3% | 83.6% | 91.7% | 82.0% | 16.9% | 400k | T 🖼 | 207 t/s | 3.76 s | $0.250 | $2.00 | $0.688 | 74.7M | $202 | Nov 2025 | |
| 95 | Grok 4.1 Fast (Reasoning) | xAI | 38.6 | 85.3% | 82.2% | 89.3% | 85.4% | 17.6% | 2M | T 🖼 | — | — | — | — | — | 52.6M | $0.000 | Nov 2025 | |
| 96 | Step 3.5 Flash 2603 | StepFun | 38.5 | 82.6% | — | — | — | 22.6% | 256k | T | 160 t/s | 1.20 s | — | — | — | 260.6M | $0.000 | Apr 2026 | |
| 97 | Ring-2.6-1T | InclusionAI | 38.5 | 85.7% | — | — | — | 18.3% | 262k | T | 120 t/s | 3.18 s | $0.300 | $2.50 | $0.850 | 104.9M | $334 | May 2026 | |
| 98 | o3 | OpenAI | 38.4 | 82.7% | 80.8% | 88.3% | 85.3% | 20.0% | 200k | T 🖼 | 126 t/s | 5.47 s | $2.00 | $8.00 | $3.50 | 48.4M | $1.0k | Apr 2025 | |
| 99 | GPT-5.4 nano (medium) | OpenAI | 38.1 | 76.1% | — | — | — | 14.7% | 400k | T 🖼 | 146 t/s | 2.94 s | $0.200 | $1.25 | $0.463 | 26.5M | $90.57 | Mar 2026 | |
| 100 | Step 3.5 Flash | StepFun | 37.8 | 83.1% | — | — | — | 19.1% | 256k | T | 163 t/s | 1.17 s | $0.100 | $0.300 | $0.150 | 202.8M | $74.80 | Feb 2026 | |
| 101 | GPT-5.4 mini (medium) | OpenAI | 37.7 | 82.3% | — | — | — | 17.1% | 400k | T 🖼 | 157 t/s | 3.83 s | $0.750 | $4.50 | $1.69 | 35.3M | $302 | Mar 2026 | |
| 102 | Kimi K2.5 (Non-reasoning) | Kimi | 37.3 | 78.9% | — | — | — | 12.3% | 256k | T 🖼 🎬 | 39 t/s | 3.43 s | $0.600 | $3.00 | $1.20 | 12.8M | $141 | Jan 2026 | |
| 103 | Qwen3.5 27B (Non-reasoning) | Alibaba | 37.2 | 84.2% | — | — | — | 13.2% | 262k | T 🖼 | 89 t/s | 5.69 s | $0.300 | $2.60 | $0.875 | 25.1M | $128 | Feb 2026 | |
| 104 | Command A+ | Cohere | 37.2 | 76.1% | — | — | — | 11.4% | 192k | T 🖼 | 223 t/s | 0.26 s | — | — | — | 66.4M | $0.000 | May 2026 | |
| 105 | Qwen3.6 27B (Non-reasoning) | Alibaba | 37.1 | 82.9% | — | — | — | 13.6% | 262k | T 🖼 🎬 | 56 t/s | 3.86 s | $0.600 | $3.60 | $1.35 | 22.3M | $234 | Apr 2026 | |
| 106 | Claude 4.5 Sonnet (Non-reasoning) | Anthropic | 37.1 | 72.7% | 59.0% | 37.0% | 86.0% | 7.1% | 1M | T 🖼 | 39 t/s | 1.52 s | $3.75 | $15.00 | $6.56 | 7.9M | $827 | Sep 2025 | |
| 107 | Qwen3.5 35B A3B (Reasoning) | Alibaba | 37.1 | 84.5% | — | — | — | 19.7% | 262k | T 🖼 | 145 t/s | 2.26 s | $0.250 | $2.00 | $0.688 | 100.5M | $302 | Feb 2026 | |
| 108 | Claude 4.5 Haiku (Reasoning) | Anthropic | 37.1 | 67.2% | 61.5% | 83.7% | 76.0% | 9.7% | 200k | T 🖼 | 94 t/s | 20.98 s | $1.25 | $5.00 | $2.19 | 87.3M | $620 | Oct 2025 | |
| 109 | DeepSeek V4 Flash (Non-reasoning) | DeepSeek | 36.5 | 71.6% | — | — | — | 7.0% | 1M | T | 106 t/s | 1.39 s | $0.140 | $0.280 | $0.175 | 10.9M | $40.05 | Apr 2026 | |
| 110 | JT-35B-Flash | China Mobile | 36.1 | 82.9% | — | — | — | 6.1% | 256k | T | — | — | — | — | — | 17.3M | $0.000 | May 2026 | |
| 111 | MiniMax-M2 | MiniMax | 36.1 | 77.7% | 82.6% | 78.3% | 82.0% | 12.5% | 205k | T | 108 t/s | 1.72 s | $0.300 | $1.20 | $0.525 | 67.5M | $116 | Oct 2025 | |
| 112 | KAT-Coder-Pro V1 | KwaiKAT | 36.0 | 76.4% | 74.7% | 94.7% | 81.3% | 33.4% | 256k | T | 113 t/s | 1.26 s | $0.300 | $1.20 | $0.525 | 4.5M | $76.16 | Nov 2025 | |
| 113 | Claude 4.1 Opus (Non-reasoning) | Anthropic | 36.0 | — | — | — | — | — | 200k | T 🖼 | 30 t/s | 2.06 s | $18.75 | $75.00 | $32.81 | undefined | — | Aug 2025 | |
| 114 | NVIDIA Nemotron 3 Super 120B A12B (Reasoning) | NVIDIA | 36.0 | 80.0% | — | — | — | 19.2% | 1M | T | 184 t/s | 1.82 s | $0.300 | $0.750 | $0.413 | 104.0M | $140 | Mar 2026 | |
| 115 | Qwen3.5 122B A10B (Non-reasoning) | Alibaba | 35.9 | 82.7% | — | — | — | 14.8% | 262k | T 🖼 | 162 t/s | 2.52 s | $0.400 | $3.20 | $1.10 | 29.3M | $166 | Feb 2026 | |
| 116 | Nova 2.0 Pro Preview (medium) | Amazon | 35.7 | 78.5% | 73.0% | 89.0% | 83.0% | 8.9% | 256k | T 🖼 | 114 t/s | 12.99 s | $1.25 | $10.00 | $3.44 | 36.0M | $467 | Nov 2025 | |
| 117 | MiMo-V2.5-Pro (Non-reasoning) | Xiaomi | 35.6 | 76.2% | — | — | — | 13.3% | 1M | T | 48 t/s | 3.23 s | $0.900 | $2.70 | $1.35 | 28.4M | $633 | Apr 2026 | |
| 118 | GPT-5.4 (Non-reasoning) | OpenAI | 35.4 | 74.8% | — | — | — | 10.6% | 1.1M | T 🖼 | 63 t/s | 0.83 s | $2.50 | $15.00 | $5.63 | 3.9M | $272 | Mar 2026 | |
| 119 | Grok 4 Fast (Reasoning) | xAI | 35.1 | 84.7% | 83.2% | 89.7% | 85.0% | 17.0% | 2M | T 🖼 | — | — | $0.200 | $0.500 | $0.275 | 43.4M | $35.99 | Sep 2025 | |
| 120 | Gemini 3 Flash Preview (Non-reasoning) | 35.0 | 81.2% | 79.7% | 55.7% | 88.2% | 14.1% | 1M | T 🖼 🔊 🎬 | 152 t/s | 8.80 s | $0.500 | $3.00 | $1.13 | 4.1M | $65.98 | Dec 2025 | ||
| 121 | Claude 3.7 Sonnet (Reasoning) | Anthropic | 34.7 | 77.2% | 47.3% | 56.3% | 83.7% | 10.3% | 200k | T 🖼 | — | — | — | — | — | 57.9M | $0.000 | Feb 2025 | |
| 122 | Gemini 2.5 Pro | 34.6 | 84.4% | 80.1% | 87.7% | 86.2% | 21.1% | 1M | T 🖼 🔊 🎬 | 123 t/s | 23.26 s | $1.25 | $10.00 | $3.44 | 54.6M | $648 | Jun 2025 | ||
| 123 | Nova 2.0 Lite (high) | Amazon | 34.5 | 81.1% | 71.1% | 94.3% | 81.8% | 10.9% | 1M | T 🖼 | 148 t/s | 14.33 s | $0.300 | $2.50 | $0.850 | undefined | — | Oct 2025 | |
| 124 | GLM-4.7 (Non-reasoning) | Z AI | 34.2 | 66.4% | 56.2% | 48.0% | 79.4% | 6.1% | 200k | T | 78 t/s | 1.36 s | $0.600 | $2.20 | $1.00 | 13.1M | $147 | Dec 2025 | |
| 125 | DeepSeek V3.1 Terminus (Reasoning) | DeepSeek | 33.9 | 79.2% | 79.8% | 89.7% | 85.1% | 15.2% | 128k | T | — | — | $1.64 | $2.75 | $1.91 | 42.5M | $278 | Sep 2025 | |
| 126 | Hy3-preview (Non-reasoning) | Tencent | 33.7 | 73.2% | — | — | — | 6.3% | 256k | T | 89 t/s | 4.01 s | $0.123 | $0.430 | $0.200 | 13.8M | $36.07 | Apr 2026 | |
| 127 | Ling-2.6-1T | InclusionAI | 33.6 | 75.2% | — | — | — | 8.2% | 262k | T | — | — | $0.300 | $2.50 | $0.850 | 15.7M | $95.05 | Apr 2026 | |
| 128 | GPT-5.2 (Non-reasoning) | OpenAI | 33.6 | 71.2% | 66.9% | 51.0% | 81.4% | 7.3% | 400k | T 🖼 | 63 t/s | 0.91 s | $1.75 | $14.00 | $4.81 | 3.8M | $225 | Dec 2025 | |
| 129 | Doubao Seed Code | ByteDance Seed | 33.5 | 76.4% | 76.6% | 79.3% | 85.4% | 13.3% | 256k | T 🖼 | — | — | — | — | — | 41.0M | $0.000 | Nov 2025 | |
| 130 | Gemini 3.1 Flash-Lite | 33.5 | 82.2% | — | — | — | 16.2% | 1M | T 🖼 🔊 🎬 | 256 t/s | 5.57 s | $0.250 | $1.50 | $0.563 | 52.6M | $93.60 | Mar 2026 | ||
| 131 | gpt-oss-120b (high) | OpenAI | 33.3 | 78.2% | 87.8% | 93.4% | 80.8% | 18.5% | 131k | T | 324 t/s | 0.86 s | $0.150 | $0.600 | $0.262 | 77.7M | $67.37 | Aug 2025 | |
| 132 | o4-mini (high) | OpenAI | 33.1 | 78.4% | 85.9% | 90.7% | 83.2% | 17.5% | 200k | T 🖼 | 153 t/s | 22.06 s | $1.10 | $4.40 | $1.93 | 82.4M | $461 | Apr 2025 | |
| 133 | Claude 4 Opus (Non-reasoning) | Anthropic | 33.0 | 70.1% | 54.2% | 36.3% | 86.0% | 5.9% | 200k | T 🖼 | 31 t/s | 2.00 s | $18.75 | $75.00 | $32.81 | undefined | — | May 2025 | |
| 134 | Claude 4 Sonnet (Non-reasoning) | Anthropic | 33.0 | 68.3% | 44.9% | 38.0% | 83.7% | 4.0% | 1M | T 🖼 | 38 t/s | 1.03 s | $3.75 | $15.00 | $6.56 | 6.0M | $568 | May 2025 | |
| 135 | DeepSeek V3.2 Exp (Reasoning) | DeepSeek | 32.9 | 79.7% | 78.9% | 87.7% | 85.0% | 13.8% | 128k | T | — | — | $0.275 | $0.415 | $0.310 | 39.7M | $39.05 | Sep 2025 | |
| 136 | Mercury 2 | Inception | 32.8 | 77.0% | — | — | — | 15.5% | 128k | T | 744 t/s | 2.54 s | $0.250 | $0.750 | $0.375 | 69.6M | $80.68 | Feb 2026 | |
| 137 | GLM-4.6 (Reasoning) | Z AI | 32.5 | 78.0% | 69.5% | 86.0% | 82.9% | 13.3% | 200k | T | 37 t/s | 3.46 s | $0.550 | $2.20 | $0.963 | 57.4M | $192 | Sep 2025 | |
| 138 | Qwen3 Max Thinking (Preview) | Alibaba | 32.5 | 77.6% | 53.5% | 82.3% | 82.4% | 12.0% | 262k | T | 49 t/s | 4.13 s | $1.20 | $6.00 | $2.40 | 30.5M | $293 | Nov 2025 | |
| 139 | Qwen3.5 9B (Reasoning) | Alibaba | 32.4 | 80.6% | — | — | — | 13.3% | 262k | T 🖼 🎬 | 68 t/s | 2.32 s | $0.100 | $0.150 | $0.113 | 201.8M | $82.06 | Mar 2026 | |
| 140 | Gemma 4 31B (Non-reasoning) | 32.3 | 76.3% | — | — | — | 11.5% | 256k | T 🖼 🎬 | 17 t/s | 1.39 s | $0.140 | $0.400 | $0.205 | 7.1M | $19.43 | Apr 2026 | ||
| 141 | K-EXAONE (Reasoning) | LG AI Research | 32.1 | 78.3% | 76.8% | 90.3% | 83.8% | 13.1% | 256k | T | — | — | — | — | — | 109.5M | $0.000 | Dec 2025 | |
| 142 | DeepSeek V3.2 (Non-reasoning) | DeepSeek | 32.1 | 75.1% | 59.3% | 59.0% | 83.7% | 10.5% | 128k | T | — | — | $0.500 | $1.60 | $0.775 | 15.3M | $197 | Dec 2025 | |
| 143 | Grok 3 mini Reasoning (high) | xAI | 32.1 | 79.1% | 69.6% | 84.7% | 82.8% | 11.1% | 1M | T | 77 t/s | 0.60 s | $0.300 | $0.500 | $0.350 | 99.9M | $105 | Feb 2025 | |
| 144 | Nova 2.0 Pro Preview (low) | Amazon | 31.9 | 75.1% | 63.8% | 63.3% | 82.2% | 5.2% | 256k | T 🖼 | 118 t/s | 11.16 s | $1.25 | $10.00 | $3.44 | 12.5M | $205 | Nov 2025 | |
| 145 | Trinity Large Thinking | Arcee AI | 31.9 | 75.2% | — | — | — | 14.7% | 512k | T | 157 t/s | 1.17 s | $0.235 | $0.875 | $0.395 | 154.8M | $175 | Apr 2026 | |
| 146 | Qwen3.6 35B A3B (Non-reasoning) | Alibaba | 31.5 | 81.7% | — | — | — | 12.5% | 262k | T 🖼 🎬 | 178 t/s | 2.52 s | $0.375 | $2.25 | $0.844 | 24.3M | $189 | Apr 2026 | |
| 147 | Qwen3 Max | Alibaba | 31.4 | 76.4% | 76.7% | 80.7% | 84.1% | 11.1% | 262k | T | 39 t/s | 2.46 s | $1.65 | $7.22 | $3.05 | 12.7M | $258 | Sep 2025 | |
| 148 | Gemma 4 26B A4B (Reasoning) | 31.2 | 79.2% | — | — | — | 18.3% | 256k | T 🖼 🎬 | — | — | $0.130 | $0.400 | $0.198 | 73.0M | $41.51 | Apr 2026 | ||
| 149 | Gemini 2.5 Flash Preview (Sep '25) (Reasoning) | 31.1 | 79.3% | 71.3% | 78.3% | 84.2% | 12.7% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | 42.9M | $0.000 | Sep 2025 | ||
| 150 | Claude 4.5 Haiku (Non-reasoning) | Anthropic | 31.0 | 64.6% | 51.1% | 39.0% | 80.0% | 4.3% | 200k | T 🖼 | 90 t/s | 0.81 s | $1.25 | $5.00 | $2.19 | 8.3M | $246 | Oct 2025 | |
| 151 | Grok 4.3 (Non-reasoning) | xAI | 31.0 | 65.8% | — | — | — | 6.5% | 1M | T 🖼 | 110 t/s | 0.61 s | $1.25 | $2.50 | $1.56 | 7.8M | $260 | Apr 2026 | |
| 152 | Kimi K2 0905 | Kimi | 30.9 | 76.7% | 61.0% | 57.3% | 81.9% | 6.3% | 256k | T | 24 t/s | 2.56 s | $0.600 | $2.50 | $1.07 | 7.9M | $127 | Sep 2025 | |
| 153 | Claude 3.7 Sonnet (Non-reasoning) | Anthropic | 30.8 | 65.6% | 39.4% | 21.0% | 80.3% | 4.8% | 200k | T 🖼 | — | — | $3.75 | $15.00 | $6.56 | 5.6M | $525 | Feb 2025 | |
| 154 | o1 | OpenAI | 30.7 | 74.7% | 67.9% | 72.3% | 84.1% | 7.7% | 200k | T 🖼 | 98 t/s | 29.01 s | $15.00 | $60.00 | $26.25 | 35.2M | $3.3k | Dec 2024 | |
| 155 | Qwen3.5 35B A3B (Non-reasoning) | Alibaba | 30.7 | 81.9% | — | — | — | 12.8% | 262k | T 🖼 | 155 t/s | 2.13 s | $0.250 | $2.00 | $0.688 | 36.6M | $126 | Feb 2026 | |
| 156 | MiMo-V2-Flash (Non-reasoning) | Xiaomi | 30.3 | 65.6% | 40.2% | 67.7% | 74.4% | 8.0% | 256k | T | 130 t/s | 2.07 s | $0.100 | $0.300 | $0.150 | 16.8M | $21.38 | Dec 2025 | |
| 157 | Gemini 2.5 Pro Preview (Mar' 25) | 30.3 | 83.6% | 77.8% | 87.0% | 85.8% | 17.1% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Mar 2025 | ||
| 158 | GLM-4.6 (Non-reasoning) | Z AI | 30.2 | 63.2% | 56.1% | 44.3% | 78.4% | 5.2% | 200k | T | 43 t/s | 3.82 s | $0.600 | $2.20 | $1.00 | 8.7M | $111 | Sep 2025 | |
| 159 | EXAONE 4.5 33B | LG AI Research | 30.2 | 79.4% | — | — | — | 11.6% | 262k | T 🖼 | — | — | — | — | — | 149.2M | $0.000 | Apr 2026 | |
| 160 | GLM-4.7-Flash (Reasoning) | Z AI | 30.1 | 58.1% | — | — | — | 7.1% | 200k | T | 80 t/s | 1.14 s | $0.070 | $0.400 | $0.153 | 63.9M | $40.01 | Jan 2026 | |
| 161 | Nova 2.0 Lite (medium) | Amazon | 29.7 | 76.8% | 66.3% | 88.7% | 81.3% | 8.6% | 1M | T 🖼 | 147 t/s | 21.21 s | $0.300 | $2.50 | $0.850 | 62.0M | $183 | Oct 2025 | |
| 162 | Grok 4.20 0309 (Non-reasoning) | xAI | 29.7 | 78.5% | — | — | — | 22.5% | 2M | T 🖼 | 191 t/s | 0.60 s | $2.00 | $6.00 | $3.00 | 30.4M | $382 | Mar 2026 | |
| 163 | Gemini 2.5 Pro Preview (May' 25) | 29.5 | 82.2% | 77.0% | 84.3% | 83.7% | 15.4% | 1M | T 🔊 🎬 | — | — | $1.25 | $10.00 | $3.44 | undefined | — | May 2025 | ||
| 164 | Qwen3 235B A22B 2507 (Reasoning) | Alibaba | 29.5 | 79.0% | 78.8% | 91.0% | 84.3% | 15.0% | 256k | T | 59 t/s | 2.90 s | $0.400 | $2.15 | $0.838 | 63.3M | $158 | Jul 2025 | |
| 165 | DeepSeek V3.2 Speciale | DeepSeek | 29.4 | 87.1% | 89.6% | 96.7% | 86.3% | 26.1% | 128k | T | — | — | — | — | — | undefined | — | Dec 2025 | |
| 166 | ERNIE 5.0 Thinking Preview | Baidu | 29.1 | 77.7% | 81.2% | 85.0% | 83.0% | 12.7% | 128k | T 🖼 🎬 | — | — | — | — | — | 39.6M | $0.000 | Nov 2025 | |
| 167 | Grok 4.20 0309 v2 (Non-reasoning) | xAI | 29.0 | 77.6% | — | — | — | 24.2% | 2M | T 🖼 | 182 t/s | 0.68 s | $2.00 | $6.00 | $3.00 | 36.0M | $426 | Apr 2026 | |
| 168 | Grok Code Fast 1 | xAI | 28.7 | 72.7% | 65.7% | 43.3% | 79.3% | 7.5% | 256k | T | — | — | — | — | — | 63.5M | $0.000 | Aug 2025 | |
| 169 | DeepSeek V3.1 Terminus (Non-reasoning) | DeepSeek | 28.5 | 75.1% | 52.9% | 53.7% | 83.6% | 8.4% | 128k | T | — | — | $0.270 | $1.00 | $0.453 | 8.1M | $40.89 | Sep 2025 | |
| 170 | DeepSeek V3.2 Exp (Non-reasoning) | DeepSeek | 28.4 | 73.8% | 55.4% | 57.7% | 83.6% | 8.6% | 128k | T | — | — | $0.275 | $0.415 | $0.310 | 10.2M | $46.85 | Sep 2025 | |
| 171 | Nemotron Cascade 2 30B A3B | NVIDIA | 28.4 | 75.8% | — | — | — | 11.4% | 1M | T | — | — | — | — | — | 66.0M | $0.000 | Mar 2026 | |
| 172 | Apriel-v1.5-15B-Thinker | ServiceNow | 28.3 | 71.3% | 72.8% | 87.5% | 77.3% | 12.0% | 128k | T 🖼 | — | — | — | — | — | undefined | — | Sep 2025 | |
| 173 | Qwen3 Coder Next | Alibaba | 28.3 | 73.7% | — | — | — | 9.3% | 256k | T | 107 t/s | 1.62 s | $0.350 | $1.20 | $0.563 | 26.0M | $125 | Feb 2026 | |
| 174 | DeepSeek V3.1 (Non-reasoning) | DeepSeek | 28.1 | 73.5% | 57.7% | 49.7% | 83.3% | 6.3% | 128k | T | — | — | $0.555 | $1.67 | $0.834 | 9.2M | $99.61 | Aug 2025 | |
| 175 | Nova 2.0 Omni (medium) | Amazon | 28.0 | 76.0% | 66.0% | 89.7% | 80.9% | 6.8% | 1M | T 🖼 | — | — | $0.300 | $2.50 | $0.850 | 36.5M | $109 | Nov 2025 | |
| 176 | Mistral Small 4 (Reasoning) | Mistral | 27.8 | 76.9% | — | — | — | 9.5% | 256k | T 🖼 | 181 t/s | 0.71 s | $0.150 | $0.600 | $0.262 | 52.8M | $47.88 | Mar 2026 | |
| 177 | DeepSeek V3.1 (Reasoning) | DeepSeek | 27.7 | 77.9% | 78.4% | 89.7% | 85.1% | 13.0% | 128k | T | — | — | $0.590 | $1.69 | $0.865 | 59.7M | $484 | Aug 2025 | |
| 178 | Qwen3 VL 235B A22B (Reasoning) | Alibaba | 27.6 | 77.2% | 64.6% | 88.3% | 83.6% | 10.1% | 262k | T 🖼 | 32 t/s | 5.60 s | $0.840 | $6.18 | $2.17 | 42.8M | $346 | Sep 2025 | |
| 179 | Apriel-v1.6-15B-Thinker | ServiceNow | 27.6 | 73.3% | 80.7% | 88.0% | 79.0% | 9.8% | 128k | T 🖼 | — | — | — | — | — | 74.7M | $0.000 | Nov 2025 | |
| 180 | GPT-5.1 (Non-reasoning) | OpenAI | 27.4 | 64.3% | 49.4% | 38.0% | 80.1% | 5.2% | 400k | T 🖼 | 118 t/s | 1.05 s | $1.25 | $10.00 | $3.44 | 3.9M | $94.73 | Nov 2025 | |
| 181 | Qwen3.5 9B (Non-reasoning) | Alibaba | 27.3 | 78.6% | — | — | — | 8.6% | 262k | T 🖼 🎬 | — | — | — | — | — | 78.4M | $0.000 | Mar 2026 | |
| 182 | Magistral Medium 1.2 | Mistral | 27.1 | 73.9% | 75.0% | 82.0% | 81.5% | 9.6% | 128k | T 🖼 | 39 t/s | 1.90 s | $2.00 | $5.00 | $2.75 | 43.3M | $387 | Sep 2025 | |
| 183 | Gemma 4 26B A4B (Non-reasoning) | 27.1 | 71.4% | — | — | — | 10.7% | 256k | T 🖼 🎬 | 79 t/s | 1.59 s | $0.130 | $0.400 | $0.198 | 13.9M | $27.25 | Apr 2026 | ||
| 184 | Qwen3.5 4B (Reasoning) | Alibaba | 27.1 | 77.1% | — | — | — | 7.8% | 262k | T 🖼 🎬 | 194 t/s | 0.44 s | $0.030 | $0.150 | $0.060 | 236.7M | $44.87 | Mar 2026 | |
| 185 | DeepSeek R1 0528 (May '25) | DeepSeek | 27.1 | 81.3% | 77.0% | 76.0% | 84.9% | 14.9% | 128k | T | — | — | $1.35 | $4.20 | $2.06 | 49.1M | $280 | May 2025 | |
| 186 | Gemini 2.5 Flash (Reasoning) | 27.0 | 79.0% | 69.5% | 73.3% | 83.2% | 11.1% | 1M | T 🖼 🔊 🎬 | 207 t/s | 19.80 s | $0.300 | $2.50 | $0.850 | 52.3M | $172 | May 2025 | ||
| 187 | GPT-5 nano (high) | OpenAI | 26.8 | 67.6% | 78.9% | 83.7% | 78.0% | 8.2% | 400k | T 🖼 | 176 t/s | 77.30 s | $0.050 | $0.400 | $0.138 | 110.1M | $51.71 | Aug 2025 | |
| 188 | Qwen3 Next 80B A3B (Reasoning) | Alibaba | 26.7 | 75.9% | 78.4% | 84.3% | 82.4% | 11.7% | 262k | T | 136 t/s | 2.31 s | $0.500 | $6.00 | $1.88 | 51.3M | $357 | Sep 2025 | |
| 189 | GLM-4.5 (Reasoning) | Z AI | 26.4 | 78.2% | 73.8% | 73.7% | 83.5% | 12.2% | 128k | T | 49 t/s | 3.53 s | $0.600 | $2.20 | $1.00 | 60.8M | $408 | Jul 2025 | |
| 190 | Kimi K2 | Kimi | 26.3 | 76.6% | 55.6% | 57.0% | 82.4% | 7.0% | 128k | T | 26 t/s | 2.36 s | $0.585 | $2.40 | $1.04 | 11.5M | $181 | Jul 2025 | |
| 191 | GPT-4.1 | OpenAI | 26.3 | 66.6% | 45.7% | 34.7% | 80.6% | 4.6% | 1M | T 🖼 | 123 t/s | 1.09 s | $2.00 | $8.00 | $3.50 | 4.5M | $278 | Apr 2025 | |
| 192 | Ling 2.6 Flash | InclusionAI | 26.2 | 59.3% | — | — | — | 6.2% | 262k | T | — | — | $0.100 | $0.300 | $0.150 | 14.7M | $22.90 | Apr 2026 | |
| 193 | Qwen3 Max (Preview) | Alibaba | 26.1 | 76.4% | 65.1% | 75.0% | 83.8% | 9.3% | 262k | T | 54 t/s | 4.18 s | $1.20 | $6.00 | $2.40 | 7.9M | $250 | Sep 2025 | |
| 194 | GPT-5 nano (medium) | OpenAI | 25.9 | 67.0% | 76.3% | 78.3% | 77.2% | 7.6% | 400k | T 🖼 | 161 t/s | 36.98 s | $0.050 | $0.400 | $0.138 | 45.2M | $25.85 | Aug 2025 | |
| 195 | Solar Pro 3 | Upstage | 25.9 | 72.4% | — | — | — | 10.1% | 128k | T | — | — | — | — | — | 120.5M | $0.000 | Apr 2026 | |
| 196 | Qwen3.5 Omni Flash | Alibaba | 25.9 | 74.2% | — | — | — | 7.1% | 256k | T 🖼 🔊 🎬 | 241 t/s | 1.89 s | $0.100 | $0.800 | $0.275 | 19.4M | $38.80 | Mar 2026 | |
| 197 | o3-mini | OpenAI | 25.9 | 74.8% | 71.7% | 77.0% | 79.1% | 8.7% | 200k | T | 195 t/s | 5.42 s | $1.10 | $4.40 | $1.93 | undefined | — | Jan 2025 | |
| 198 | o1-pro | OpenAI | 25.8 | — | — | — | — | — | 200k | T 🖼 | — | — | $150 | $600 | $263 | undefined | — | Mar 2025 | |
| 199 | Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) | 25.7 | 76.6% | 62.5% | 56.7% | 83.6% | 7.8% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | 23.4M | $0.000 | Sep 2025 | ||
| 200 | JT-MINI | China Mobile | 25.4 | 67.6% | — | — | — | 6.6% | 128k | T | — | — | — | — | — | 17.6M | $0.000 | Apr 2026 | |
| 201 | o3-mini (high) | OpenAI | 25.2 | 77.3% | 73.4% | 86.0% | 80.2% | 12.3% | 200k | T | 176 t/s | 17.84 s | $1.10 | $4.40 | $1.93 | 61.1M | $315 | Jan 2025 | |
| 202 | Grok 3 | xAI | 25.2 | 69.3% | 42.5% | 58.0% | 79.9% | 5.1% | 1M | T | — | — | $4.00 | $20.00 | $8.00 | 5.3M | $416 | Feb 2025 | |
| 203 | Seed-OSS-36B-Instruct | ByteDance Seed | 25.2 | 72.6% | 76.5% | 84.7% | 81.5% | 9.1% | 512k | T | 37 t/s | 3.61 s | $0.210 | $0.570 | $0.300 | 53.8M | $53.05 | Aug 2025 | |
| 204 | Qwen3 235B A22B 2507 Instruct | Alibaba | 25.0 | 75.3% | 52.4% | 71.7% | 82.8% | 10.6% | 256k | T | 50 t/s | 2.40 s | $0.200 | $0.825 | $0.356 | 14.8M | $71.81 | Jul 2025 | |
| 205 | Qwen3 Coder 480B A35B Instruct | Alibaba | 24.8 | 61.8% | 58.5% | 39.3% | 78.8% | 4.4% | 262k | T | 59 t/s | 2.99 s | $0.300 | $1.80 | $0.675 | 9.0M | $159 | Jul 2025 | |
| 206 | Qwen3 VL 32B (Reasoning) | Alibaba | 24.7 | 73.3% | 73.8% | 84.7% | 81.8% | 9.6% | 256k | T 🖼 | 89 t/s | 2.76 s | $0.700 | $8.40 | $2.63 | 99.7M | $972 | Oct 2025 | |
| 207 | Sonar Reasoning Pro | Perplexity | 24.6 | — | — | 79.0% | — | — | 127k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 208 | Nova 2.0 Lite (low) | Amazon | 24.6 | 69.8% | 46.9% | 46.7% | 78.8% | 4.2% | 1M | T 🖼 | 152 t/s | 9.42 s | $0.300 | $2.50 | $0.850 | 19.6M | $72.96 | Oct 2025 | |
| 209 | gpt-oss-20B (high) | OpenAI | 24.5 | 68.8% | 77.7% | 89.3% | 74.8% | 9.8% | 131k | T | 238 t/s | 0.74 s | $0.050 | $0.200 | $0.088 | 61.0M | $19.22 | Aug 2025 | |
| 210 | gpt-oss-120b (low) | OpenAI | 24.5 | 67.2% | 70.7% | 66.7% | 77.5% | 5.2% | 131k | T | 347 t/s | 0.87 s | $0.150 | $0.600 | $0.262 | 7.7M | $15.90 | Aug 2025 | |
| 211 | MiniMax M1 80k | MiniMax | 24.4 | 69.7% | 71.1% | 61.0% | 81.6% | 8.2% | 1M | T | — | — | $0.550 | $2.20 | $0.963 | 24.3M | $115 | Jun 2025 | |
| 212 | GPT-5.4 nano (Non-Reasoning) | OpenAI | 24.4 | 55.8% | — | — | — | 4.2% | 400k | T 🖼 | 148 t/s | 0.63 s | $0.200 | $1.25 | $0.463 | 3.7M | $23.55 | Mar 2026 | |
| 213 | Gemini 2.5 Flash Preview (Reasoning) | 24.3 | 69.8% | 50.5% | 84.3% | 80.0% | 11.6% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Apr 2025 | ||
| 214 | NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | NVIDIA | 24.3 | 75.7% | 74.1% | 91.0% | 79.4% | 10.2% | 1M | T | 132 t/s | 2.02 s | $0.055 | $0.220 | $0.096 | 143.4M | $42.08 | Dec 2025 | |
| 215 | K2 Think V2 | MBZUAI Institute of Foundation Models | 24.1 | 71.3% | — | — | — | 9.5% | 262k | T | — | — | — | — | — | 99.2M | $0.000 | Dec 2025 | |
| 216 | LongCat Flash Lite | LongCat | 23.9 | 63.6% | — | — | — | 6.0% | 256k | T | 86 t/s | 7.74 s | — | — | — | 15.0M | $0.000 | Jan 2026 | |
| 217 | GPT-5 (minimal) | OpenAI | 23.9 | 67.3% | 55.8% | 31.7% | 80.6% | 5.4% | 400k | T 🖼 | 78 t/s | 1.17 s | $1.25 | $10.00 | $3.44 | 4.0M | $152 | Aug 2025 | |
| 218 | o1-preview | OpenAI | 23.7 | — | — | — | — | — | 128k | — | — | — | $16.50 | $66.00 | $28.88 | undefined | — | Sep 2024 | |
| 219 | HyperCLOVA X SEED Think (32B) | Naver | 23.7 | 61.5% | 62.9% | 59.0% | 78.5% | 5.5% | 128k | T 🖼 🎬 | — | — | — | — | — | 22.8M | $0.000 | Dec 2025 | |
| 220 | Grok 4.1 Fast (Non-reasoning) | xAI | 23.6 | 63.7% | 39.9% | 34.3% | 74.3% | 5.0% | 2M | T 🖼 | — | — | — | — | — | 4.4M | $0.000 | Nov 2025 | |
| 221 | GLM-4.6V (Reasoning) | Z AI | 23.4 | 71.9% | 16.0% | 85.3% | 79.9% | 8.9% | 128k | T 🖼 | 64 t/s | 3.94 s | $0.300 | $0.900 | $0.450 | 90.1M | $185 | Dec 2025 | |
| 222 | K-EXAONE (Non-reasoning) | LG AI Research | 23.4 | 69.5% | — | 44.0% | 81.0% | 5.4% | 256k | T | — | — | — | — | — | 19.2M | $0.000 | Dec 2025 | |
| 223 | GPT-5.4 mini (Non-Reasoning) | OpenAI | 23.3 | 60.6% | — | — | — | 5.7% | 400k | T 🖼 | 143 t/s | 0.71 s | $0.750 | $4.50 | $1.69 | 2.4M | $56.36 | Mar 2026 | |
| 224 | Nova 2.0 Omni (low) | Amazon | 23.2 | 69.9% | 59.2% | 56.0% | 79.8% | 4.0% | 1M | T 🖼 | — | — | $0.300 | $2.50 | $0.850 | 17.6M | $75.63 | Nov 2025 | |
| 225 | GLM-4.5-Air | Z AI | 23.2 | 73.3% | 68.4% | 80.7% | 81.5% | 6.8% | 128k | T | 94 t/s | 3.00 s | $0.170 | $0.980 | $0.373 | 67.9M | $93.38 | Jul 2025 | |
| 226 | Grok 4 Fast (Non-reasoning) | xAI | 23.1 | 60.6% | 40.1% | 41.3% | 73.0% | 5.0% | 2M | T 🖼 | — | — | $0.200 | $0.500 | $0.275 | 4.3M | $17.45 | Sep 2025 | |
| 227 | Nova 2.0 Pro Preview (Non-reasoning) | Amazon | 23.1 | 63.6% | 47.3% | 30.7% | 77.2% | 4.0% | 256k | T 🖼 | 119 t/s | 1.08 s | $1.25 | $10.00 | $3.44 | 43.5M | $1.3k | Nov 2025 | |
| 228 | Mi:dm K 2.5 Pro | Korea Telecom | 23.1 | 70.1% | 65.6% | 76.7% | 80.9% | 7.7% | 128k | T | — | — | — | — | — | 60.9M | $0.000 | Dec 2025 | |
| 229 | GPT-4.1 mini | OpenAI | 22.9 | 66.4% | 48.3% | 46.3% | 78.1% | 4.6% | 1M | T 🖼 | 84 t/s | 0.81 s | $0.400 | $1.60 | $0.700 | 4.6M | $53.73 | Apr 2025 | |
| 230 | Mistral Large 3 | Mistral | 22.8 | 68.0% | 46.5% | 38.0% | 80.7% | 4.1% | 256k | T 🖼 | 52 t/s | 1.08 s | $0.500 | $1.50 | $0.750 | 5.2M | $38.67 | Dec 2025 | |
| 231 | Ring-1T | InclusionAI | 22.8 | 77.4% | 64.3% | 89.3% | 80.6% | 10.2% | 128k | T | — | — | — | — | — | 50.4M | $0.000 | Oct 2025 | |
| 232 | Qwen3.5 4B (Non-reasoning) | Alibaba | 22.6 | 71.2% | — | — | — | 7.5% | 262k | T 🖼 🎬 | 199 t/s | 0.45 s | $0.030 | $0.150 | $0.060 | 75.1M | $26.22 | Mar 2026 | |
| 233 | Qwen3 30B A3B 2507 (Reasoning) | Alibaba | 22.4 | 70.7% | 70.7% | 56.3% | 80.5% | 9.8% | 262k | T | 128 t/s | 2.45 s | $0.280 | $1.85 | $0.673 | 52.8M | $118 | Jul 2025 | |
| 234 | DeepSeek V3 0324 | DeepSeek | 22.3 | 65.5% | 40.5% | 41.0% | 81.9% | 5.2% | 128k | T | — | — | $1.19 | $1.25 | $1.21 | 4.0M | $59.25 | Mar 2025 | |
| 235 | INTELLECT-3 | Prime Intellect | 22.2 | 76.1% | 77.7% | 88.0% | 82.2% | 12.1% | 131k | T | — | — | — | — | — | 73.6M | $0.000 | Nov 2025 | |
| 236 | GLM-4.7-Flash (Non-reasoning) | Z AI | 22.1 | 45.2% | — | — | — | 4.9% | 200k | T | 102 t/s | 1.64 s | $0.070 | $0.400 | $0.153 | 19.8M | $22.19 | Jan 2026 | |
| 237 | Devstral 2 | Mistral | 22.0 | 59.4% | 44.8% | 36.7% | 76.2% | 3.6% | 256k | T | 64 t/s | 1.23 s | — | — | — | 7.4M | $0.000 | Dec 2025 | |
| 238 | GPT-5 (ChatGPT) | OpenAI | 21.8 | 68.6% | 54.3% | 48.3% | 82.0% | 5.8% | 128k | T 🖼 | 168 t/s | 0.80 s | $1.25 | $10.00 | $3.44 | undefined | — | Aug 2025 | |
| 239 | Solar Open 100B (Reasoning) | Upstage | 21.7 | 65.7% | — | — | — | 9.2% | 128k | T | — | — | — | — | — | 120.3M | $0.000 | Dec 2025 | |
| 240 | Grok 3 Reasoning Beta | xAI | 21.6 | — | — | — | — | — | 1M | T | — | — | — | — | — | undefined | — | Feb 2025 | |
| 241 | Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | 21.6 | 70.9% | 68.8% | 68.7% | 80.8% | 6.6% | 1M | T 🖼 🔊 🎬 | — | — | $0.100 | $0.400 | $0.175 | 44.5M | $38.21 | Sep 2025 | ||
| 242 | Nemotron 3 Nano Omni 30B A3B Reasoning | NVIDIA | 21.4 | 46.9% | — | — | — | 5.3% | 256k | T 🖼 🔊 🎬 | 299 t/s | 1.03 s | $0.075 | $0.300 | $0.131 | 78.5M | $39.37 | Apr 2026 | |
| 243 | Mistral Medium 3.1 | Mistral | 21.3 | 58.8% | 40.6% | 38.3% | 68.3% | 4.4% | 128k | T 🖼 | 73 t/s | 1.49 s | $0.400 | $2.00 | $0.800 | 7.6M | $51.65 | Aug 2025 | |
| 244 | MiniMax M1 40k | MiniMax | 20.9 | 68.2% | 65.7% | 13.7% | 80.8% | 7.5% | 1M | T | — | — | — | — | — | undefined | — | Jun 2025 | |
| 245 | gpt-oss-20B (low) | OpenAI | 20.8 | 61.1% | 65.2% | 62.3% | 71.8% | 5.1% | 131k | T | 242 t/s | 0.78 s | $0.060 | $0.200 | $0.095 | 9.7M | $7.68 | Aug 2025 | |
| 246 | Qwen3 VL 235B A22B Instruct | Alibaba | 20.8 | 71.2% | 59.4% | 70.7% | 82.3% | 6.3% | 262k | T 🖼 | 47 t/s | 2.68 s | $0.300 | $1.90 | $0.700 | 10.7M | $121 | Sep 2025 | |
| 247 | GPT-5 mini (minimal) | OpenAI | 20.7 | 68.7% | 54.5% | 46.7% | 77.5% | 5.0% | 400k | T 🖼 | 89 t/s | 0.87 s | $0.250 | $2.00 | $0.688 | 2.9M | $25.16 | Aug 2025 | |
| 248 | K2-V2 (high) | MBZUAI Institute of Foundation Models | 20.6 | 68.1% | 69.4% | 78.3% | 78.6% | 9.8% | 512k | T | — | — | — | — | — | 106.2M | $0.000 | Dec 2025 | |
| 249 | Gemini 2.5 Flash (Non-reasoning) | 20.6 | 68.3% | 49.5% | 60.3% | 80.9% | 5.1% | 1M | T 🖼 🔊 🎬 | 186 t/s | 0.64 s | $0.300 | $2.50 | $0.850 | 17.5M | $84.27 | May 2025 | ||
| 250 | o1-mini | OpenAI | 20.4 | 60.3% | 57.6% | 60.3% | 74.2% | 4.9% | 128k | — | — | — | — | — | — | undefined | — | Sep 2024 | |
| 251 | Qwen3 Next 80B A3B Instruct | Alibaba | 20.1 | 73.8% | 68.4% | 66.3% | 81.9% | 7.3% | 262k | T | 149 t/s | 2.29 s | $0.500 | $2.00 | $0.875 | 14.5M | $133 | Sep 2025 | |
| 252 | Tri-21B-think Preview | Trillion Labs | 20.0 | 53.8% | — | — | — | 5.7% | 32k | T | — | — | — | — | — | 128.5M | $0.000 | Feb 2026 | |
| 253 | Qwen3 Coder 30B A3B Instruct | Alibaba | 20.0 | 51.6% | 40.3% | 29.0% | 70.6% | 4.0% | 262k | T | 92 t/s | 2.69 s | $0.190 | $0.839 | $0.352 | 13.2M | $29.29 | Jul 2025 | |
| 254 | GPT-4.5 (Preview) | OpenAI | 20.0 | — | — | — | — | — | 128k | T 🖼 | — | — | — | — | — | undefined | — | Feb 2025 | |
| 255 | Qwen3 235B A22B (Reasoning) | Alibaba | 19.8 | 70.0% | 62.2% | 82.0% | 82.8% | 11.7% | 33k | T | 51 t/s | 2.84 s | $0.700 | $8.40 | $2.63 | 30.4M | $396 | Apr 2025 | |
| 256 | QwQ 32B | Alibaba | 19.7 | 59.3% | 63.1% | 29.0% | 76.4% | 8.2% | 131k | T | 30 t/s | 2.10 s | $0.660 | $1.00 | $0.745 | undefined | — | Mar 2025 | |
| 257 | Qwen3 VL 30B A3B (Reasoning) | Alibaba | 19.7 | 72.0% | 69.7% | 82.3% | 80.7% | 8.7% | 256k | T 🖼 | 110 t/s | 2.22 s | $0.200 | $0.750 | $0.338 | 42.7M | $86.23 | Oct 2025 | |
| 258 | Gemini 2.0 Flash Thinking Experimental (Jan '25) | 19.6 | 70.1% | 32.1% | 50.0% | 79.8% | 7.1% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Jan 2025 | ||
| 259 | Devstral Small 2 | Mistral | 19.5 | 53.2% | 34.8% | 34.3% | 67.8% | 3.4% | 256k | T 🖼 | 68 t/s | 1.13 s | — | — | — | 8.6M | $0.000 | Dec 2025 | |
| 260 | Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | 19.4 | 65.1% | 64.1% | 46.7% | 79.6% | 4.6% | 1M | T 🖼 🔊 🎬 | — | — | $0.100 | $0.400 | $0.175 | 27.8M | $33.39 | Sep 2025 | ||
| 261 | Motif-2-12.7B-Reasoning | Motif Technologies | 19.1 | 69.5% | 65.1% | 80.3% | 79.6% | 8.2% | 128k | T | — | — | — | — | — | 125.2M | $0.000 | Dec 2025 | |
| 262 | Ling-1T | InclusionAI | 19.0 | 71.9% | 67.7% | 71.3% | 82.2% | 7.2% | 128k | T | — | — | — | — | — | 12.4M | $0.000 | Oct 2025 | |
| 263 | Nova Premier | Amazon | 19.0 | 56.9% | 31.7% | 17.3% | 73.3% | 4.7% | 1M | T | 35 t/s | 2.92 s | $2.50 | $12.50 | $5.00 | 4.2M | $376 | Apr 2025 | |
| 264 | DeepSeek R1 (Jan '25) | DeepSeek | 18.8 | 70.8% | 61.7% | 68.0% | 84.4% | 9.3% | 128k | T | — | — | $1.68 | $4.70 | $2.43 | 62.3M | $1.6k | Jan 2025 | |
| 265 | Solar Pro 2 (Preview) (Reasoning) | Upstage | 18.8 | 57.8% | 46.2% | 66.3% | 76.8% | 5.7% | 64k | T | — | — | — | — | — | undefined | — | May 2025 | |
| 266 | Magistral Medium 1 | Mistral | 18.8 | 67.9% | 52.7% | 40.3% | 75.3% | 9.5% | 40k | T | — | — | — | — | — | 50.0M | $0.000 | Jun 2025 | |
| 267 | Gemma 4 E4B (Reasoning) | 18.8 | 57.6% | — | — | — | 3.7% | 128k | T 🖼 🔊 🎬 | — | — | — | — | — | 22.3M | $0.000 | Apr 2026 | ||
| 268 | Mistral Medium 3 | Mistral | 18.8 | 57.8% | 40.0% | 30.3% | 76.0% | 4.3% | 128k | T 🖼 | 36 t/s | 1.54 s | $0.400 | $2.00 | $0.800 | 4.0M | $40.24 | May 2025 | |
| 269 | K2-V2 (medium) | MBZUAI Institute of Foundation Models | 18.7 | 59.8% | 54.1% | 64.7% | 76.1% | 4.4% | 512k | T | — | — | — | — | — | 15.9M | $0.000 | Dec 2025 | |
| 270 | Llama Nemotron Super 49B v1.5 (Reasoning) | NVIDIA | 18.7 | 74.8% | 73.7% | 76.7% | 81.4% | 6.8% | 128k | T | 47 t/s | 1.34 s | $0.100 | $0.400 | $0.175 | 53.0M | $37.18 | Jul 2025 | |
| 271 | Claude 3.5 Haiku | Anthropic | 18.7 | 40.8% | 31.4% | 3.3% | 63.4% | 3.5% | 200k | 🖼 | — | — | $1.00 | $4.00 | $1.75 | 2.3M | $56.52 | Oct 2024 | |
| 272 | Devstral Medium | Mistral | 18.7 | 49.2% | 33.7% | 4.7% | 70.8% | 3.8% | 256k | T | 87 t/s | 1.44 s | $0.400 | $2.00 | $0.800 | 3.4M | $67.39 | Jul 2025 | |
| 273 | GPT-4o (Aug '24) | OpenAI | 18.6 | 52.1% | 31.7% | 11.7% | — | 2.9% | 128k | 🖼 | 108 t/s | 1.11 s | $2.50 | $10.00 | $4.38 | 2.9M | $221 | Aug 2024 | |
| 274 | Mistral Small 4 (Non-reasoning) | Mistral | 18.6 | 57.1% | — | — | — | 3.7% | 256k | T 🖼 | 159 t/s | 0.69 s | $0.150 | $0.600 | $0.262 | 3.9M | $16.37 | Mar 2026 | |
| 275 | Tri-21B-Think | Trillion Labs | 18.6 | 60.1% | — | — | — | 6.1% | 32k | T | — | — | — | — | — | 57.9M | $0.000 | Feb 2026 | |
| 276 | GPT-4o (March 2025, chatgpt-4o-latest) | OpenAI | 18.6 | 65.5% | 42.5% | 25.7% | 80.3% | 5.0% | 128k | T 🖼 | — | — | — | — | — | undefined | — | Mar 2025 | |
| 277 | Hermes 4 - Llama-3.1 405B (Reasoning) | Nous Research | 18.6 | 72.7% | 68.6% | 69.7% | 82.9% | 10.3% | 128k | T | 39 t/s | 2.34 s | $1.00 | $3.00 | $1.50 | 39.0M | $294 | Aug 2025 | |
| 278 | Gemini 2.0 Flash (Feb '25) | 18.5 | 62.3% | 33.4% | 21.7% | 77.9% | 5.3% | 1M | T 🖼 🔊 🎬 | — | — | $0.150 | $0.600 | $0.262 | 5.0M | $18.86 | Feb 2025 | ||
| 279 | Llama 3.3 Nemotron Super 49B v1 (Reasoning) | NVIDIA | 18.5 | 64.3% | 27.7% | 54.7% | 78.5% | 6.5% | 128k | T | — | — | — | — | — | undefined | — | Mar 2025 | |
| 280 | Llama 4 Maverick | Meta | 18.4 | 67.1% | 39.7% | 19.3% | 80.9% | 4.8% | 1M | T 🖼 | 114 t/s | 0.99 s | $0.350 | $0.850 | $0.475 | 8.2M | $62.28 | Apr 2025 | |
| 281 | Qwen3 4B 2507 (Reasoning) | Alibaba | 18.2 | 66.7% | 64.1% | 82.7% | 74.3% | 5.9% | 262k | T | — | — | — | — | — | 52.9M | $0.000 | Aug 2025 | |
| 282 | Magistral Small 1.2 | Mistral | 18.2 | 66.3% | 72.3% | 80.3% | 76.8% | 6.1% | 128k | T 🖼 | 108 t/s | 0.81 s | $0.500 | $1.50 | $0.750 | 29.3M | $89.83 | Sep 2025 | |
| 283 | Sarvam 105B (high) | Sarvam | 18.2 | 73.8% | — | — | — | 10.1% | 128k | T | 94 t/s | 2.09 s | $0.042 | $0.170 | $0.074 | 70.6M | $18.66 | Mar 2026 | |
| 284 | Gemini 2.0 Pro Experimental (Feb '25) | 18.1 | 62.2% | 34.7% | 36.0% | 80.5% | 6.8% | 2M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Feb 2025 | ||
| 285 | Nova 2.0 Lite (Non-reasoning) | Amazon | 18.0 | 60.3% | 34.6% | 33.7% | 74.3% | 3.0% | 1M | T 🖼 | 142 t/s | 1.34 s | $0.300 | $2.50 | $0.850 | 66.1M | $411 | Oct 2025 | |
| 286 | Devstral Small (May '25) | Mistral | 18.0 | 43.4% | 25.8% | 6.7% | 63.2% | 4.0% | 256k | T | — | — | — | — | — | 5.1M | $0.000 | May 2025 | |
| 287 | Claude 3 Opus | Anthropic | 18.0 | 48.9% | 27.9% | 3.3% | 69.6% | 3.1% | 200k | 🖼 | — | — | $18.75 | $75.00 | $32.81 | undefined | — | Mar 2024 | |
| 288 | MiniCPM5-1B (Non-reasoning) | OpenBMB | 17.9 | 26.9% | — | — | — | 4.6% | 128k | T | — | — | — | — | — | 12.6M | $0.000 | May 2026 | |
| 289 | Sonar Reasoning | Perplexity | 17.9 | 62.3% | — | 77.0% | — | — | 127k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 290 | Gemini 2.5 Flash Preview (Non-reasoning) | 17.8 | 59.4% | 40.6% | 43.3% | 78.3% | 5.0% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Apr 2025 | ||
| 291 | Hermes 4 - Llama-3.1 405B (Non-reasoning) | Nous Research | 17.6 | 53.6% | 54.6% | 15.3% | 72.9% | 4.2% | 128k | T | 39 t/s | 2.40 s | $1.00 | $3.00 | $1.50 | 3.9M | $179 | Aug 2025 | |
| 292 | Gemini 2.5 Flash-Lite (Reasoning) | 17.6 | 62.5% | 59.3% | 53.3% | 75.9% | 6.4% | 1M | T 🖼 🔊 🎬 | 240 t/s | 12.72 s | $0.100 | $0.400 | $0.175 | 107.6M | $77.86 | Jun 2025 | ||
| 293 | Llama 3.1 Instruct 405B | Meta | 17.4 | 51.5% | 30.5% | 3.0% | 73.2% | 4.2% | 128k | — | 40 t/s | 2.35 s | $2.75 | $6.50 | $3.69 | 3.9M | $838 | Jul 2024 | |
| 294 | GPT-4o (Nov '24) | OpenAI | 17.3 | 54.3% | 30.9% | 6.0% | 74.8% | 3.3% | 128k | 🖼 | 138 t/s | 0.89 s | $2.50 | $10.00 | $4.38 | 2.7M | $197 | Nov 2024 | |
| 295 | Qwen3 VL 32B Instruct | Alibaba | 17.2 | 67.1% | 51.4% | 68.3% | 79.1% | 6.3% | 256k | T 🖼 | 71 t/s | 2.70 s | $0.700 | $2.80 | $1.23 | 14.8M | $275 | Oct 2025 | |
| 296 | DeepSeek R1 Distill Qwen 32B | DeepSeek | 17.2 | 61.5% | 27.0% | 63.0% | 73.9% | 5.5% | 128k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 297 | GLM-4.6V (Non-reasoning) | Z AI | 17.1 | 56.6% | 41.1% | 26.3% | 75.2% | 3.7% | 128k | T 🖼 | 69 t/s | 3.91 s | $0.300 | $0.900 | $0.450 | 6.4M | $61.16 | Dec 2025 | |
| 298 | Qwen3 235B A22B (Non-reasoning) | Alibaba | 17.0 | 61.3% | 34.3% | 23.7% | 76.2% | 4.7% | 33k | T | 55 t/s | 2.79 s | $0.450 | $1.80 | $0.787 | 4.1M | $104 | Apr 2025 | |
| 299 | Magistral Small 1 | Mistral | 16.8 | 64.1% | 51.4% | 41.3% | 74.6% | 7.2% | 40k | T | — | — | — | — | — | undefined | — | Jun 2025 | |
| 300 | Gemini 2.0 Flash (experimental) | 16.8 | 63.6% | 21.0% | 30.0% | 78.2% | 4.7% | 1M | 🖼 | — | — | — | — | — | undefined | — | Dec 2024 | ||
| 301 | EXAONE 4.0 32B (Reasoning) | LG AI Research | 16.7 | 73.9% | 74.7% | 80.0% | 81.8% | 10.5% | 131k | T | — | — | — | — | — | 61.0M | $0.000 | Jul 2025 | |
| 302 | Qwen3 VL 8B (Reasoning) | Alibaba | 16.7 | 57.9% | 35.3% | 30.7% | 74.9% | 3.3% | 256k | T 🖼 | 113 t/s | 2.39 s | $0.180 | $2.10 | $0.660 | 46.4M | $148 | Oct 2025 | |
| 303 | Nova 2.0 Omni (Non-reasoning) | Amazon | 16.6 | 55.5% | 30.5% | 37.0% | 71.9% | 3.9% | 1M | T 🖼 | — | — | $0.300 | $2.50 | $0.850 | 84.6M | $598 | Nov 2025 | |
| 304 | Qwen3 32B (Reasoning) | Alibaba | 16.5 | 66.8% | 54.6% | 73.0% | 79.8% | 8.3% | 33k | T | 88 t/s | 2.49 s | $0.195 | $0.520 | $0.276 | 29.9M | $35.08 | Apr 2025 | |
| 305 | DeepSeek V3 (Dec '24) | DeepSeek | 16.5 | 55.7% | 35.9% | 26.0% | 75.2% | 3.6% | 128k | T | — | — | $0.400 | $0.890 | $0.523 | 2.6M | $26.78 | Dec 2024 | |
| 306 | DeepSeek R1 0528 Qwen3 8B | DeepSeek | 16.4 | 61.2% | 51.3% | 63.7% | 73.9% | 5.6% | 33k | T | — | — | — | — | — | undefined | — | May 2025 | |
| 307 | Qwen3.5 2B (Reasoning) | Alibaba | 16.3 | 45.6% | — | — | — | 2.1% | 262k | T 🖼 | — | — | $0.020 | $0.100 | $0.040 | 389.5M | $45.61 | Mar 2026 | |
| 308 | Qwen2.5 Max | Alibaba | 16.3 | 58.7% | 35.9% | 23.3% | 76.2% | 4.5% | 32k | T | 43 t/s | 3.38 s | $1.60 | $6.40 | $2.80 | undefined | — | Jan 2025 | |
| 309 | Qwen3 14B (Reasoning) | Alibaba | 16.2 | 60.4% | 52.3% | 55.7% | 77.4% | 4.3% | 33k | T | 63 t/s | 2.83 s | $0.235 | $2.22 | $0.731 | 25.9M | $71.22 | Apr 2025 | |
| 310 | Nanbeige4.1-3B | Nanbeige | 16.1 | 84.9% | — | — | — | 10.0% | 256k | T | — | — | — | — | — | 146.2M | $0.000 | Feb 2026 | |
| 311 | Qwen3 VL 30B A3B Instruct | Alibaba | 16.0 | 69.5% | 47.6% | 72.3% | 76.4% | 6.4% | 256k | T 🖼 | 110 t/s | 2.29 s | $0.200 | $0.600 | $0.300 | 20.7M | $117 | Oct 2025 | |
| 312 | Hermes 4 - Llama-3.1 70B (Reasoning) | Nous Research | 16.0 | 69.9% | 65.3% | 68.7% | 81.1% | 7.9% | 128k | T | 85 t/s | 1.43 s | $0.130 | $0.400 | $0.198 | 49.0M | $47.99 | Aug 2025 | |
| 313 | Gemini 1.5 Pro (Sep '24) | 16.0 | 58.9% | 31.6% | 23.0% | 75.0% | 4.9% | 2M | 🖼 | — | — | — | — | — | undefined | — | Sep 2024 | ||
| 314 | Solar Pro 2 (Preview) (Non-reasoning) | Upstage | 16.0 | 54.4% | 38.5% | 29.7% | 72.5% | 3.8% | 64k | T | — | — | — | — | — | undefined | — | May 2025 | |
| 315 | Ministral 3 14B | Mistral | 16.0 | 57.2% | 35.1% | 30.0% | 69.3% | 4.6% | 256k | T 🖼 | 77 t/s | 0.80 s | $0.200 | $0.200 | $0.200 | 10.8M | $23.19 | Dec 2025 | |
| 316 | DeepSeek R1 Distill Llama 70B | DeepSeek | 16.0 | 40.2% | 26.6% | 53.7% | 79.5% | 6.1% | 128k | T | 45 t/s | 1.65 s | $0.700 | $1.05 | $0.787 | undefined | — | Jan 2025 | |
| 317 | Claude 3.5 Sonnet (Oct '24) | Anthropic | 15.9 | 59.9% | 38.1% | 15.7% | 77.2% | 3.9% | 200k | 🖼 | — | — | $3.75 | $15.00 | $6.56 | undefined | — | Oct 2024 | |
| 318 | DeepSeek R1 Distill Qwen 14B | DeepSeek | 15.8 | 48.4% | 37.6% | 55.7% | 74.0% | 4.4% | 128k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 319 | Falcon-H1R-7B | TII UAE | 15.8 | 66.1% | 72.4% | 80.0% | 72.5% | 10.8% | 256k | T | — | — | — | — | — | 143.7M | $0.000 | Jan 2026 | |
| 320 | Ling-flash-2.0 | InclusionAI | 15.7 | 65.7% | 58.9% | 65.3% | 77.7% | 6.3% | 128k | T | 80 t/s | 2.53 s | $0.140 | $0.570 | $0.248 | 15.5M | $16.73 | Sep 2025 | |
| 321 | Qwen3 Omni 30B A3B (Reasoning) | Alibaba | 15.6 | 72.6% | 67.9% | 74.0% | 79.2% | 7.3% | 66k | T 🖼 🔊 🎬 | 89 t/s | 1.96 s | $0.250 | $0.970 | $0.430 | 74.7M | $132 | Sep 2025 | |
| 322 | Qwen2.5 Instruct 72B | Alibaba | 15.6 | 49.1% | 27.6% | 14.0% | 72.0% | 4.2% | 131k | — | 28 t/s | 3.50 s | $0.360 | $0.400 | $0.370 | undefined | — | Sep 2024 | |
| 323 | Sonar | Perplexity | 15.5 | 47.1% | 29.5% | 48.7% | 68.9% | 7.3% | 127k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 324 | Step3 VL 10B | StepFun | 15.5 | 69.0% | — | — | — | 10.2% | 66k | T 🖼 | — | — | — | — | — | 81.6M | $0.000 | Jan 2026 | |
| 325 | Qwen3 30B A3B (Reasoning) | Alibaba | 15.3 | 61.6% | 50.6% | 72.3% | 77.7% | 6.6% | 33k | T | 62 t/s | 2.44 s | $0.090 | $0.450 | $0.180 | 32.1M | $23.58 | Apr 2025 | |
| 326 | Sonar Pro | Perplexity | 15.2 | 57.8% | 27.5% | 29.0% | 75.5% | 7.9% | 200k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 327 | Devstral Small (Jul '25) | Mistral | 15.2 | 41.4% | 25.4% | 29.3% | 62.2% | 3.7% | 256k | T | 201 t/s | 0.69 s | $0.100 | $0.300 | $0.150 | 4.4M | $23.10 | Jul 2025 | |
| 328 | Gemma 4 E2B (Reasoning) | 15.2 | 43.3% | — | — | — | 4.8% | 128k | T 🖼 🔊 🎬 | — | — | — | — | — | 20.0M | $0.000 | Apr 2026 | ||
| 329 | QwQ 32B-Preview | Alibaba | 15.2 | 55.7% | 33.7% | 45.3% | 64.8% | 4.8% | 33k | — | — | — | — | — | — | undefined | — | Nov 2024 | |
| 330 | GLM-4.5V (Reasoning) | Z AI | 15.1 | 68.4% | 60.4% | 73.0% | 78.8% | 5.9% | 64k | T 🖼 | 23 t/s | 3.50 s | $0.600 | $1.80 | $0.900 | 30.7M | $227 | Aug 2025 | |
| 331 | Mistral Large 2 (Nov '24) | Mistral | 15.1 | 48.6% | 29.3% | 14.0% | 69.7% | 4.0% | 128k | — | 32 t/s | 1.71 s | $2.00 | $6.00 | $3.00 | 2.6M | $124 | Nov 2024 | |
| 332 | Mistral Small 3.2 | Mistral | 15.1 | 50.5% | 27.5% | 27.0% | 68.1% | 4.3% | 128k | T 🖼 | 125 t/s | 0.65 s | $0.087 | $0.250 | $0.128 | 4.5M | $19.17 | Jun 2025 | |
| 333 | Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) | NVIDIA | 15.0 | 72.8% | 64.1% | 63.7% | 82.5% | 8.1% | 128k | T | 52 t/s | 2.42 s | $0.600 | $1.80 | $0.900 | 47.4M | $487 | Apr 2025 | |
| 334 | Qwen3 30B A3B 2507 Instruct | Alibaba | 15.0 | 65.9% | 51.5% | 66.3% | 77.7% | 6.8% | 262k | T | 115 t/s | 2.00 s | $0.150 | $0.400 | $0.213 | 14.7M | $53.64 | Jul 2025 | |
| 335 | ERNIE 4.5 300B A47B | Baidu | 15.0 | 81.1% | 46.7% | 41.3% | 77.6% | 3.5% | 131k | T | 25 t/s | 3.57 s | $0.280 | $1.10 | $0.485 | undefined | — | Jun 2025 | |
| 336 | Solar Pro 2 (Reasoning) | Upstage | 14.9 | 68.7% | 61.6% | 61.3% | 80.5% | 7.0% | 66k | T | — | — | — | — | — | 32.1M | $0.000 | Jul 2025 | |
| 337 | NVIDIA Nemotron Nano 12B v2 VL (Reasoning) | NVIDIA | 14.9 | 57.2% | 69.4% | 75.0% | 75.9% | 5.3% | 128k | T 🖼 | — | — | $0.200 | $0.600 | $0.300 | 68.5M | $123 | Oct 2025 | |
| 338 | Ministral 3 8B | Mistral | 14.8 | 47.1% | 30.3% | 31.7% | 64.2% | 4.3% | 256k | T 🖼 | 98 t/s | 0.65 s | $0.150 | $0.150 | $0.150 | 13.4M | $21.45 | Dec 2025 | |
| 339 | Gemma 4 E4B (Non-reasoning) | 14.8 | 54.9% | — | — | — | 4.7% | 128k | T 🖼 🔊 🎬 | — | — | — | — | — | 7.9M | $0.000 | Apr 2026 | ||
| 340 | NVIDIA Nemotron Nano 9B V2 (Reasoning) | NVIDIA | 14.8 | 57.0% | 72.4% | 69.7% | 74.2% | 4.6% | 131k | T | 122 t/s | 0.70 s | $0.040 | $0.160 | $0.070 | 33.0M | $10.53 | Aug 2025 | |
| 341 | Gemini 2.0 Flash-Lite (Feb '25) | 14.7 | 53.5% | 18.5% | 27.7% | 72.4% | 3.6% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Feb 2025 | ||
| 342 | Granite 4.1 30B | IBM | 14.7 | 48.1% | — | — | — | 4.2% | 131k | T | — | — | — | — | — | 4.6M | $0.000 | Apr 2026 | |
| 343 | NVIDIA Nemotron 3 Nano 4B | NVIDIA | 14.7 | 51.3% | — | — | — | 4.8% | 262k | T | — | — | — | — | — | 21.8M | $0.000 | Mar 2026 | |
| 344 | Qwen3.5 2B (Non-reasoning) | Alibaba | 14.7 | 43.8% | — | — | — | 4.9% | 262k | T 🖼 🎬 | 247 t/s | 0.42 s | $0.020 | $0.100 | $0.040 | 100.2M | $13.35 | Mar 2026 | |
| 345 | Llama Nemotron Super 49B v1.5 (Non-reasoning) | NVIDIA | 14.6 | 48.1% | 29.0% | 8.0% | 69.2% | 4.3% | 128k | T | 48 t/s | 1.30 s | $0.100 | $0.400 | $0.175 | 6.5M | $20.69 | Jul 2025 | |
| 346 | Qwen3 32B (Non-reasoning) | Alibaba | 14.5 | 53.5% | 28.8% | 19.7% | 72.7% | 4.3% | 33k | T | 89 t/s | 2.57 s | $0.150 | $0.590 | $0.260 | undefined | — | Apr 2025 | |
| 347 | GPT-4o (May '24) | OpenAI | 14.5 | 52.6% | 33.4% | 11.0% | 74.0% | 2.8% | 128k | 🖼 | 114 t/s | 1.12 s | $5.00 | $15.00 | $7.50 | undefined | — | May 2024 | |
| 348 | Llama 3.3 Instruct 70B | Meta | 14.5 | 49.8% | 28.8% | 7.7% | 71.3% | 4.0% | 128k | — | 81 t/s | 1.60 s | $0.585 | $0.710 | $0.616 | 3.8M | $86.26 | Dec 2024 | |
| 349 | Gemini 2.0 Flash-Lite (Preview) | 14.5 | 54.2% | 17.9% | 30.3% | — | 4.4% | 1M | T 🖼 🔊 🎬 | — | — | — | — | — | undefined | — | Feb 2025 | ||
| 350 | Mistral Small 3.1 | Mistral | 14.5 | 45.4% | 21.2% | 3.7% | 65.9% | 4.8% | 128k | T 🖼 | 165 t/s | 0.69 s | $0.105 | $0.235 | $0.138 | 4.7M | $48.02 | Mar 2025 | |
| 351 | K2-V2 (low) | MBZUAI Institute of Foundation Models | 14.4 | 54.1% | 39.3% | 35.3% | 71.3% | 3.9% | 512k | T | — | — | — | — | — | 8.4M | $0.000 | Dec 2025 | |
| 352 | Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) | NVIDIA | 14.4 | 40.8% | 49.3% | 50.0% | 55.6% | 5.1% | 128k | T | — | — | — | — | — | undefined | — | May 2025 | |
| 353 | Kimi Linear 48B A3B Instruct | Kimi | 14.4 | 41.2% | 37.8% | 36.3% | 58.5% | 2.7% | 1M | T | — | — | — | — | — | undefined | — | Oct 2025 | |
| 354 | Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) | NVIDIA | 14.3 | 51.7% | 28.0% | 7.7% | 69.8% | 3.5% | 128k | T | — | — | — | — | — | undefined | — | Mar 2025 | |
| 355 | Qwen3 VL 8B Instruct | Alibaba | 14.3 | 42.7% | 33.2% | 27.3% | 68.6% | 2.9% | 256k | T 🖼 | 120 t/s | 2.28 s | $0.180 | $0.700 | $0.310 | 26.3M | $72.73 | Oct 2025 | |
| 356 | Qwen3 4B (Reasoning) | Alibaba | 14.2 | 52.2% | 46.5% | 22.3% | 69.6% | 5.1% | 32k | T | 96 t/s | 2.44 s | $0.110 | $1.26 | $0.398 | undefined | — | Apr 2025 | |
| 357 | Claude 3.5 Sonnet (June '24) | Anthropic | 14.2 | 56.0% | — | 9.7% | 75.1% | 3.7% | 200k | 🖼 | — | — | $3.75 | $15.00 | $6.56 | undefined | — | Jun 2024 | |
| 358 | Llama 3.1 Tulu3 405B | Allen Institute for AI | 14.1 | 51.6% | 29.1% | 13.3% | 71.6% | 3.5% | 128k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 359 | GPT-4o (ChatGPT) | OpenAI | 14.1 | 51.1% | — | 10.3% | 77.3% | 3.7% | 128k | T 🖼 | — | — | — | — | — | undefined | — | Feb 2025 | |
| 360 | Ring-flash-2.0 | InclusionAI | 14.0 | 72.5% | 62.8% | 83.7% | 79.3% | 8.9% | 128k | T | — | — | $0.140 | $0.570 | $0.248 | undefined | — | Sep 2025 | |
| 361 | Pixtral Large | Mistral | 14.0 | 50.5% | 26.1% | 2.3% | 70.1% | 3.6% | 128k | 🖼 | 52 t/s | 1.64 s | $2.00 | $6.00 | $3.00 | undefined | — | Nov 2024 | |
| 362 | Olmo 3.1 32B Think | Allen Institute for AI | 13.9 | 59.1% | 69.5% | 77.3% | 76.3% | 6.0% | 66k | T | — | — | — | — | — | undefined | — | Dec 2025 | |
| 363 | Grok 2 (Dec '24) | xAI | 13.9 | 51.0% | 26.7% | 13.3% | 70.9% | 3.8% | 131k | T | — | — | — | — | — | undefined | — | Dec 2024 | |
| 364 | GPT-5 nano (minimal) | OpenAI | 13.8 | 42.8% | 47.0% | 27.3% | 55.6% | 4.1% | 400k | T 🖼 | 165 t/s | 0.77 s | $0.050 | $0.400 | $0.138 | 9.0M | $24.88 | Aug 2025 | |
| 365 | Gemini 1.5 Flash (Sep '24) | 13.8 | 46.3% | 27.3% | 18.0% | 68.0% | 3.5% | 1M | 🖼 | — | — | — | — | — | undefined | — | Sep 2024 | ||
| 366 | Qwen3 VL 4B (Reasoning) | Alibaba | 13.7 | 49.4% | 32.0% | 25.7% | 70.0% | 4.4% | 256k | T 🖼 | — | — | — | — | — | 62.2M | $0.000 | Oct 2025 | |
| 367 | GPT-4 Turbo | OpenAI | 13.7 | — | 29.1% | 15.0% | 69.4% | 3.3% | 128k | 🖼 | 26 t/s | 3.24 s | $10.00 | $30.00 | $15.00 | undefined | — | Nov 2023 | |
| 368 | Solar Pro 2 (Non-reasoning) | Upstage | 13.6 | 56.1% | 42.4% | 30.0% | 75.0% | 3.8% | 66k | T | — | — | — | — | — | 6.7M | $0.000 | Jul 2025 | |
| 369 | Llama 4 Scout | Meta | 13.5 | 58.7% | 29.9% | 14.0% | 75.2% | 4.3% | 10M | T 🖼 | 106 t/s | 0.86 s | $0.170 | $0.660 | $0.293 | 7.3M | $26.42 | Apr 2025 | |
| 370 | Nova Pro | Amazon | 13.5 | 49.9% | 23.3% | 7.0% | 69.1% | 3.4% | 300k | T 🖼 | — | — | $0.800 | $3.20 | $1.40 | 4.4M | $201 | Dec 2024 | |
| 371 | Command A | Cohere | 13.5 | 52.7% | 28.7% | 13.0% | 71.2% | 4.6% | 256k | T | 54 t/s | 1.76 s | $2.50 | $10.00 | $4.38 | 3.8M | $299 | Mar 2025 | |
| 372 | Llama 3.1 Nemotron Instruct 70B | NVIDIA | 13.4 | 46.5% | 16.9% | 11.0% | 69.0% | 4.6% | 128k | — | 292 t/s | 0.51 s | $1.20 | $1.20 | $1.20 | 3.8M | $73.54 | Oct 2024 | |
| 373 | Grok Beta | xAI | 13.3 | 47.1% | 24.1% | 10.3% | 70.3% | 4.7% | 128k | — | — | — | — | — | — | undefined | — | Aug 2024 | |
| 374 | Qwen2.5 Instruct 32B | Alibaba | 13.2 | 46.6% | 24.8% | 11.0% | 69.7% | 3.8% | 128k | T | — | — | — | — | — | undefined | — | Sep 2024 | |
| 375 | Qwen3 8B (Reasoning) | Alibaba | 13.2 | 58.9% | 40.6% | 19.0% | 74.3% | 4.2% | 131k | T | 39 t/s | 3.73 s | $0.110 | $1.15 | $0.370 | 38.5M | $52.46 | Apr 2025 | |
| 376 | NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | NVIDIA | 13.2 | 39.9% | 36.0% | 13.3% | 57.9% | 4.6% | 1M | T | 87 t/s | 0.43 s | $0.050 | $0.200 | $0.088 | 12.7M | $13.73 | Dec 2025 | |
| 377 | NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | NVIDIA | 13.2 | 55.7% | 70.1% | 62.3% | 73.9% | 4.0% | 131k | T | 142 t/s | 1.03 s | $0.050 | $0.195 | $0.086 | 22.9M | $15.16 | Aug 2025 | |
| 378 | GPT-4.1 nano | OpenAI | 13.0 | 51.2% | 32.6% | 24.0% | 65.7% | 3.9% | 1M | T 🖼 | 187 t/s | 0.62 s | $0.100 | $0.400 | $0.175 | 5.2M | $31.03 | Apr 2025 | |
| 379 | Mistral Large 2 (Jul '24) | Mistral | 13.0 | 47.2% | 26.7% | 0.0% | 68.3% | 3.2% | 128k | — | — | — | $2.00 | $6.00 | $3.00 | undefined | — | Jul 2024 | |
| 380 | Qwen3 4B 2507 Instruct | Alibaba | 12.9 | 51.7% | 37.7% | 52.3% | 67.2% | 4.7% | 262k | T | — | — | — | — | — | 21.8M | $0.000 | Aug 2025 | |
| 381 | Qwen2.5 Coder Instruct 32B | Alibaba | 12.9 | 41.7% | 29.5% | 12.0% | 63.5% | 3.8% | 131k | — | — | — | — | — | — | undefined | — | Nov 2024 | |
| 382 | Qwen3 14B (Non-reasoning) | Alibaba | 12.8 | 47.0% | 28.0% | 58.0% | 67.5% | 4.2% | 33k | T | 63 t/s | 2.75 s | $0.235 | $0.820 | $0.381 | 4.8M | $18.79 | Apr 2025 | |
| 383 | GPT-4 | OpenAI | 12.8 | — | — | — | — | — | 8k | — | — | — | $30.00 | $60.00 | $37.50 | undefined | — | Mar 2023 | |
| 384 | GLM-4.5V (Non-reasoning) | Z AI | 12.7 | 57.3% | 35.2% | 15.3% | 75.1% | 3.6% | 64k | T 🖼 | 29 t/s | 40.65 s | $0.600 | $1.80 | $0.900 | 7.0M | $203 | Aug 2025 | |
| 385 | Mistral Small 3 | Mistral | 12.7 | 46.2% | 25.2% | 4.3% | 65.2% | 4.1% | 32k | T | 166 t/s | 0.67 s | $0.075 | $0.190 | $0.104 | undefined | — | Jan 2025 | |
| 386 | Gemini 2.5 Flash-Lite (Non-reasoning) | 12.7 | 47.4% | 40.0% | 35.3% | 72.4% | 3.7% | 1M | T 🖼 🔊 🎬 | 227 t/s | 0.60 s | $0.100 | $0.400 | $0.175 | 35.6M | $29.87 | Jun 2025 | ||
| 387 | MiniCPM-V 4.6 1.3B | OpenBMB | 12.7 | 30.5% | — | — | — | 4.9% | 262k | T 🖼 🎬 | — | — | — | — | — | 5.4M | $0.000 | May 2026 | |
| 388 | Nova Lite | Amazon | 12.7 | 43.3% | 16.7% | 7.0% | 59.0% | 4.6% | 300k | 🖼 | 164 t/s | 1.01 s | $0.060 | $0.240 | $0.105 | 7.0M | $21.16 | Dec 2024 | |
| 389 | GPT-4o mini | OpenAI | 12.6 | 42.6% | 23.4% | 14.7% | 64.8% | 4.0% | 128k | 🖼 | 64 t/s | 1.18 s | $0.150 | $0.600 | $0.262 | undefined | — | Jul 2024 | |
| 390 | Hermes 4 - Llama-3.1 70B (Non-reasoning) | Nous Research | 12.6 | 49.1% | 26.9% | 11.3% | 66.4% | 3.6% | 128k | T | 89 t/s | 1.38 s | $0.130 | $0.400 | $0.198 | 6.4M | $29.85 | Aug 2025 | |
| 391 | Qwen3 30B A3B (Non-reasoning) | Alibaba | 12.5 | 51.5% | 32.2% | 21.7% | 71.0% | 4.6% | 33k | T | 60 t/s | 2.55 s | $0.080 | $0.290 | $0.133 | 3.1M | $10.78 | Apr 2025 | |
| 392 | DeepSeek-V2.5 (Dec '24) | DeepSeek | 12.5 | — | — | — | — | — | 128k | — | — | — | — | — | — | undefined | — | Dec 2024 | |
| 393 | Qwen3 4B (Non-reasoning) | Alibaba | 12.5 | 39.8% | 23.3% | 21.3% | 58.6% | 3.7% | 32k | T | 95 t/s | 2.35 s | $0.110 | $0.420 | $0.188 | undefined | — | Apr 2025 | |
| 394 | Llama 3.1 Instruct 70B | Meta | 12.5 | 40.9% | 23.2% | 4.0% | 67.6% | 4.6% | 128k | — | 33 t/s | 1.89 s | $0.560 | $0.560 | $0.560 | 4.7M | $138 | Jul 2024 | |
| 395 | Granite 4.1 8B | IBM | 12.4 | 43.3% | — | — | — | 3.8% | 131k | T | 108 t/s | 0.78 s | $0.050 | $0.100 | $0.063 | 4.0M | $7.48 | Apr 2026 | |
| 396 | Sarvam 30B (high) | Sarvam | 12.3 | 63.3% | — | — | — | 7.0% | 66k | T | 163 t/s | 1.93 s | $0.026 | $0.110 | $0.047 | 76.6M | $15.26 | Mar 2026 | |
| 397 | Gemini 2.0 Flash Thinking Experimental (Dec '24) | 12.3 | — | — | — | — | — | 2M | T 🖼 | — | — | — | — | — | undefined | — | Dec 2024 | ||
| 398 | DeepSeek-V2.5 | DeepSeek | 12.3 | — | — | — | — | — | 128k | — | — | — | — | — | — | undefined | — | Sep 2024 | |
| 399 | Claude 3 Haiku | Anthropic | 12.3 | 37.4% | 15.4% | 1.0% | — | 3.9% | 200k | 🖼 | — | — | $0.250 | $1.25 | $0.500 | 3.1M | $36.44 | Mar 2024 | |
| 400 | Olmo 3.1 32B Instruct | Allen Institute for AI | 12.2 | 53.9% | — | — | — | 4.9% | 66k | T | — | — | — | — | — | 17.6M | $0.000 | Jan 2026 | |
| 401 | Mistral Saba | Mistral | 12.1 | 42.4% | — | 13.0% | 61.1% | 4.1% | 32k | T | — | — | — | — | — | undefined | — | Feb 2025 | |
| 402 | DeepSeek R1 Distill Llama 8B | DeepSeek | 12.1 | 30.2% | 23.3% | 41.3% | 54.3% | 4.2% | 128k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 403 | Gemma 4 E2B (Non-reasoning) | 12.1 | 40.5% | — | — | — | 4.5% | 128k | T 🖼 🔊 🎬 | — | — | — | — | — | 8.3M | $0.000 | Apr 2026 | ||
| 404 | Olmo 3 32B Think | Allen Institute for AI | 12.1 | 61.0% | 67.2% | 73.7% | 75.9% | 5.9% | 66k | T | — | — | — | — | — | undefined | — | Nov 2025 | |
| 405 | Gemini 1.5 Pro (May '24) | 12.0 | 37.1% | 24.4% | 8.0% | 65.7% | 3.9% | 2M | 🖼 | — | — | — | — | — | undefined | — | May 2024 | ||
| 406 | R1 1776 | Perplexity | 12.0 | — | — | — | — | — | 128k | T | — | — | — | — | — | undefined | — | Feb 2025 | |
| 407 | Qwen2.5 Turbo | Alibaba | 12.0 | 41.0% | 16.3% | 12.0% | 63.3% | 4.2% | 1M | T | 66 t/s | 2.42 s | $0.050 | $0.200 | $0.088 | undefined | — | Nov 2024 | |
| 408 | Reka Flash (Sep '24) | Reka AI | 12.0 | — | — | — | — | — | 128k | 🖼 | 57 t/s | 2.47 s | $0.200 | $0.800 | $0.350 | undefined | — | Oct 2024 | |
| 409 | Llama 3.2 Instruct 90B (Vision) | Meta | 11.9 | 43.2% | 21.4% | 5.0% | 67.1% | 4.9% | 128k | 🖼 | 58 t/s | 1.23 s | $1.38 | $1.38 | $1.38 | undefined | — | Sep 2024 | |
| 410 | Solar Mini | Upstage | 11.9 | — | — | — | — | — | 4k | — | — | — | $0.150 | $0.150 | $0.150 | undefined | — | Jan 2024 | |
| 411 | Llama 3.1 Instruct 8B | Meta | 11.8 | 25.9% | 11.6% | 4.3% | 47.6% | 5.1% | 128k | — | 157 t/s | 0.81 s | $0.100 | $0.100 | $0.100 | 5.2M | $27.71 | Jul 2024 | |
| 412 | Grok-1 | xAI | 11.7 | — | — | — | — | — | 8k | — | — | — | — | — | — | undefined | — | Mar 2024 | |
| 413 | Qwen2 Instruct 72B | Alibaba | 11.7 | 37.1% | 15.9% | 14.7% | 62.2% | 3.7% | 131k | — | — | — | — | — | — | undefined | — | Jun 2024 | |
| 414 | EXAONE 4.0 32B (Non-reasoning) | LG AI Research | 11.7 | 62.8% | 47.2% | 39.3% | 76.8% | 4.9% | 131k | T | — | — | — | — | — | 4.3M | $0.000 | Jul 2025 | |
| 415 | Ministral 3 3B | Mistral | 11.2 | 35.8% | 24.7% | 22.0% | 52.4% | 5.3% | 256k | T 🖼 | 199 t/s | 0.51 s | $0.100 | $0.100 | $0.100 | 15.5M | $15.93 | Dec 2025 | |
| 416 | Gemini 1.5 Flash-8B | 11.1 | 35.9% | 21.7% | 3.3% | 56.9% | 4.5% | 1.0M | 🖼 | — | — | — | — | — | undefined | — | Oct 2024 | ||
| 417 | DeepHermes 3 - Mistral 24B Preview (Non-reasoning) | Nous Research | 10.9 | 38.2% | 19.5% | 4.7% | 58.0% | 3.9% | 32k | T | — | — | — | — | — | undefined | — | Mar 2025 | |
| 418 | Jamba 1.7 Large | AI21 Labs | 10.9 | 39.0% | 18.1% | 2.3% | 57.7% | 3.8% | 256k | T | 62 t/s | 1.61 s | $2.00 | $8.00 | $3.50 | 8.1M | $965 | Jul 2025 | |
| 419 | Granite 4.0 H Small | IBM | 10.8 | 41.6% | 25.1% | 13.7% | 62.4% | 3.7% | 128k | T | 350 t/s | 10.40 s | $0.060 | $0.250 | $0.108 | 2.3M | $4.48 | Sep 2025 | |
| 420 | Jamba 1.5 Large | AI21 Labs | 10.7 | 42.7% | 14.3% | 4.7% | 57.2% | 4.0% | 256k | — | — | — | $2.00 | $8.00 | $3.50 | undefined | — | Aug 2024 | |
| 421 | Qwen3 Omni 30B A3B Instruct | Alibaba | 10.7 | 62.0% | 42.2% | 52.3% | 72.5% | 5.1% | 66k | T 🖼 🔊 🎬 | 96 t/s | 2.06 s | $0.250 | $0.970 | $0.430 | 36.5M | $85.07 | Sep 2025 | |
| 422 | Hermes 3 - Llama-3.1 70B | Nous Research | 10.6 | 40.1% | 18.8% | 2.3% | 57.1% | 4.1% | 128k | — | 33 t/s | 1.91 s | $0.300 | $0.300 | $0.300 | undefined | — | Aug 2024 | |
| 423 | Qwen3 8B (Non-reasoning) | Alibaba | 10.6 | 45.2% | 20.2% | 24.3% | 64.3% | 2.8% | 33k | T | 38 t/s | 3.91 s | $0.180 | $0.200 | $0.185 | 11.8M | $34.73 | Apr 2025 | |
| 424 | DeepSeek-Coder-V2 | DeepSeek | 10.6 | — | — | — | — | — | 128k | — | — | — | — | — | — | undefined | — | Jun 2024 | |
| 425 | OLMo 2 32B | Allen Institute for AI | 10.6 | 32.8% | 6.8% | 3.3% | 51.1% | 3.7% | 4k | T | — | — | — | — | — | undefined | — | Mar 2025 | |
| 426 | Jamba 1.6 Large | AI21 Labs | 10.6 | 38.7% | 17.2% | 4.7% | 56.5% | 4.0% | 256k | T | 62 t/s | 1.63 s | $2.00 | $8.00 | $3.50 | undefined | — | Mar 2025 | |
| 427 | Qwen3.5 0.8B (Reasoning) | Alibaba | 10.5 | 11.1% | — | — | — | 1.2% | 262k | T 🖼 🎬 | — | — | $0.010 | $0.050 | $0.020 | 232.8M | $14.66 | Mar 2026 | |
| 428 | LFM2 24B A2B | Liquid AI | 10.5 | 47.4% | — | — | — | 4.4% | 33k | T | 120 t/s | 0.58 s | $0.030 | $0.120 | $0.053 | 11.2M | $8.78 | Feb 2026 | |
| 429 | Gemini 1.5 Flash (May '24) | 10.5 | 32.4% | 19.6% | 9.3% | 57.4% | 4.2% | 1M | 🖼 | — | — | — | — | — | undefined | — | May 2024 | ||
| 430 | Phi-4 | Microsoft | 10.4 | 57.5% | 23.1% | 18.0% | 71.4% | 4.1% | 16k | — | 38 t/s | 2.04 s | $0.125 | $0.500 | $0.219 | undefined | — | Dec 2024 | |
| 431 | Gemma 3 27B Instruct | 10.3 | 42.8% | 13.7% | 20.7% | 66.9% | 4.7% | 128k | T 🖼 | — | — | $0.110 | $0.250 | $0.145 | 4.0M | $19.52 | Mar 2025 | ||
| 432 | Claude 3 Sonnet | Anthropic | 10.3 | 40.0% | 17.5% | 4.7% | 57.9% | 3.8% | 200k | 🖼 | — | — | $3.00 | $15.00 | $6.00 | undefined | — | Mar 2024 | |
| 433 | Nova Micro | Amazon | 10.3 | 35.8% | 14.0% | 6.0% | 53.1% | 4.7% | 130k | — | 290 t/s | 0.92 s | $0.035 | $0.140 | $0.061 | 7.1M | $6.38 | Dec 2024 | |
| 434 | Mistral Small (Sep '24) | Mistral | 10.2 | 38.1% | 14.1% | 6.3% | 52.9% | 4.3% | 33k | — | 166 t/s | 0.72 s | $0.200 | $0.600 | $0.300 | undefined | — | Sep 2024 | |
| 435 | Gemini 1.0 Ultra | 10.1 | — | — | — | — | — | 33k | T | — | — | — | — | — | undefined | — | Dec 2023 | ||
| 436 | Phi-3 Mini Instruct 3.8B | Microsoft | 10.1 | 31.9% | 11.6% | 0.3% | 43.5% | 4.4% | 4k | — | — | — | — | — | — | undefined | — | Apr 2024 | |
| 437 | NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) | NVIDIA | 10.1 | 43.9% | 34.5% | 26.7% | 64.9% | 4.5% | 128k | T 🖼 | 227 t/s | 1.06 s | $0.200 | $0.600 | $0.300 | 7.8M | $78.55 | Oct 2025 | |
| 438 | Gemma 3n E4B Instruct Preview (May '25) | 10.1 | 27.8% | 13.8% | 10.7% | 48.3% | 4.9% | 32k | T | — | — | — | — | — | undefined | — | May 2025 | ||
| 439 | Phi-4 Multimodal Instruct | Microsoft | 10.0 | 31.5% | 13.1% | 9.3% | 48.5% | 4.4% | 128k | T 🖼 🔊 | 16 t/s | 0.89 s | — | — | — | undefined | — | Feb 2025 | |
| 440 | Qwen2.5 Coder Instruct 7B | Alibaba | 10.0 | 33.9% | 12.6% | 5.3% | 47.3% | 4.8% | 131k | — | — | — | — | — | — | undefined | — | Sep 2024 | |
| 441 | Qwen3.5 0.8B (Non-reasoning) | Alibaba | 9.9 | 23.6% | — | — | — | 4.9% | 262k | T 🖼 🎬 | 74 t/s | 0.44 s | $0.010 | $0.050 | $0.020 | 101.5M | $6.67 | Mar 2026 | |
| 442 | Mistral Large (Feb '24) | Mistral | 9.9 | 35.1% | 17.8% | 0.0% | 51.5% | 3.4% | 33k | — | — | — | $4.00 | $12.00 | $6.00 | undefined | — | Feb 2024 | |
| 443 | Mixtral 8x22B Instruct | Mistral | 9.8 | 33.2% | 14.8% | 0.0% | 53.7% | 4.1% | 65k | — | — | — | — | — | — | undefined | — | Apr 2024 | |
| 444 | Llama 2 Chat 7B | Meta | 9.7 | 22.7% | 0.2% | 0.0% | 16.4% | 5.8% | 4k | — | — | — | $0.050 | $0.250 | $0.100 | undefined | — | Jul 2023 | |
| 445 | Llama 3.2 Instruct 3B | Meta | 9.7 | 25.5% | 8.3% | 3.3% | 34.7% | 5.2% | 128k | — | 51 t/s | 1.10 s | $0.150 | $0.150 | $0.150 | undefined | — | Sep 2024 | |
| 446 | Jamba Reasoning 3B | AI21 Labs | 9.6 | 33.3% | 21.0% | 10.7% | 57.7% | 4.6% | 262k | T | — | — | — | — | — | 31.9M | $0.000 | Oct 2025 | |
| 447 | Qwen3 VL 4B Instruct | Alibaba | 9.6 | 37.1% | 29.0% | 37.0% | 63.4% | 3.7% | 256k | T 🖼 | — | — | — | — | — | 37.4M | $0.000 | Oct 2025 | |
| 448 | Qwen1.5 Chat 110B | Alibaba | 9.5 | 28.9% | — | — | — | — | 32k | T | — | — | — | — | — | undefined | — | Apr 2024 | |
| 449 | Reka Flash 3 | Reka AI | 9.5 | 52.9% | 43.5% | 33.7% | 66.9% | 5.1% | 128k | T | — | — | $0.200 | $0.800 | $0.350 | undefined | — | Mar 2025 | |
| 450 | Olmo 3 7B Think | Allen Institute for AI | 9.4 | 51.6% | 61.7% | 70.7% | 65.5% | 5.7% | 66k | T | — | — | — | — | — | undefined | — | Nov 2025 | |
| 451 | Claude 2.1 | Anthropic | 9.3 | 31.9% | 19.5% | 3.3% | 49.5% | 4.2% | 200k | — | — | — | — | — | — | undefined | — | Nov 2023 | |
| 452 | OLMo 2 7B | Allen Institute for AI | 9.3 | 28.8% | 4.1% | 0.7% | 28.2% | 5.5% | 4k | T | — | — | — | — | — | undefined | — | Nov 2024 | |
| 453 | Molmo 7B-D | Allen Institute for AI | 9.2 | 24.0% | 3.9% | 0.0% | 37.1% | 5.1% | 4k | T 🖼 | — | — | — | — | — | undefined | — | Sep 2024 | |
| 454 | Ling-mini-2.0 | InclusionAI | 9.2 | 56.2% | 42.9% | 49.3% | 67.1% | 5.0% | 131k | T | — | — | — | — | — | 23.6M | $0.000 | Sep 2025 | |
| 455 | DeepSeek R1 Distill Qwen 1.5B | DeepSeek | 9.1 | 9.8% | 7.0% | 22.0% | 26.9% | 3.3% | 128k | T | — | — | — | — | — | undefined | — | Jan 2025 | |
| 456 | Claude 2.0 | Anthropic | 9.1 | 34.4% | 17.1% | 0.0% | 48.6% | — | 100k | — | — | — | — | — | — | undefined | — | Jul 2023 | |
| 457 | DeepSeek-V2-Chat | DeepSeek | 9.1 | — | — | — | — | — | 128k | — | — | — | — | — | — | undefined | — | May 2024 | |
| 458 | Mistral Small (Feb '24) | Mistral | 9.0 | 30.2% | 11.1% | 0.7% | 41.9% | 4.4% | 33k | — | 170 t/s | 0.81 s | $1.00 | $3.00 | $1.50 | undefined | — | Feb 2024 | |
| 459 | Mistral Medium | Mistral | 9.0 | 34.9% | 9.9% | 3.7% | 49.1% | 3.4% | 33k | — | 70 t/s | 1.53 s | $2.75 | $8.10 | $4.09 | undefined | — | Dec 2023 | |
| 460 | GPT-3.5 Turbo | OpenAI | 9.0 | 29.7% | — | — | 46.2% | — | 4k | — | — | — | $0.500 | $1.50 | $0.750 | undefined | — | Nov 2022 | |
| 461 | Llama 3 Instruct 70B | Meta | 8.9 | 37.9% | 19.8% | 0.0% | 57.4% | 4.4% | 8k | — | — | — | $0.650 | $2.75 | $1.18 | undefined | — | Apr 2024 | |
| 462 | Arctic Instruct | Snowflake | 8.8 | — | — | — | — | — | 4k | — | — | — | — | — | — | undefined | — | Apr 2024 | |
| 463 | Qwen Chat 72B | Alibaba | 8.8 | — | — | — | — | — | 34k | T | — | — | — | — | — | undefined | — | Nov 2023 | |
| 464 | Gemma 3 12B Instruct | 8.8 | 34.9% | 13.7% | 18.3% | 59.5% | 4.8% | 128k | T 🖼 | — | — | $0.090 | $0.290 | $0.140 | 5.3M | $20.16 | Mar 2025 | ||
| 465 | LFM 40B | Liquid AI | 8.8 | 32.7% | 9.6% | 2.3% | 42.5% | 4.9% | 32k | 🖼 | — | — | — | — | — | undefined | — | Sep 2024 | |
| 466 | Llama 3.2 Instruct 11B (Vision) | Meta | 8.7 | 22.1% | 11.0% | 1.7% | 46.4% | 5.2% | 128k | 🖼 | 53 t/s | 0.70 s | $0.245 | $0.245 | $0.245 | undefined | — | Sep 2024 | |
| 467 | PALM-2 | 8.6 | — | — | — | — | — | 8k | T | — | — | — | — | — | undefined | — | May 2023 | ||
| 468 | Granite 4.1 3B | IBM | 8.5 | 31.4% | — | — | — | 3.4% | 131k | T | — | — | — | — | — | 2.7M | $0.000 | Apr 2026 | |
| 469 | Gemini 1.0 Pro | 8.5 | 27.7% | 11.6% | 0.7% | 43.1% | 4.6% | 33k | 🖼 | — | — | — | — | — | undefined | — | Dec 2023 | ||
| 470 | DeepSeek Coder V2 Lite Instruct | DeepSeek | 8.5 | 31.9% | 15.8% | — | 42.9% | 5.3% | 128k | — | — | — | — | — | — | undefined | — | Jun 2024 | |
| 471 | Sarvam M (Reasoning) | Sarvam | 8.4 | 41.6% | 29.5% | 20.3% | 69.6% | 3.3% | 33k | T | — | — | — | — | — | undefined | — | May 2025 | |
| 472 | Phi-4 Mini Instruct | Microsoft | 8.4 | 33.1% | 12.6% | 6.7% | 46.5% | 4.2% | 128k | — | — | — | — | — | — | 30.5M | $0.000 | Feb 2024 | |
| 473 | Llama 2 Chat 70B | Meta | 8.4 | 32.7% | 9.8% | 0.0% | 40.6% | 5.0% | 4k | — | — | — | — | — | — | undefined | — | Jul 2023 | |
| 474 | DeepSeek LLM 67B Chat (V1) | DeepSeek | 8.4 | — | — | — | — | — | 4k | T | — | — | — | — | — | undefined | — | Nov 2023 | |
| 475 | Llama 2 Chat 13B | Meta | 8.4 | 32.1% | 9.8% | 1.7% | 40.6% | 4.7% | 4k | — | — | — | — | — | — | undefined | — | Jul 2023 | |
| 476 | Command-R+ (Apr '24) | Cohere | 8.3 | 32.3% | 12.2% | 0.7% | 43.2% | 4.5% | 128k | — | — | — | $3.00 | $15.00 | $6.00 | undefined | — | Apr 2024 | |
| 477 | OpenChat 3.5 (1210) | OpenChat | 8.3 | 23.0% | 11.5% | 0.0% | 31.0% | 4.8% | 8k | — | — | — | — | — | — | undefined | — | Dec 2023 | |
| 478 | DBRX Instruct | Databricks | 8.3 | 33.1% | 9.3% | 3.0% | 39.7% | 6.6% | 33k | — | — | — | — | — | — | undefined | — | Mar 2024 | |
| 479 | Exaone 4.0 1.2B (Reasoning) | LG AI Research | 8.3 | 51.5% | 51.6% | 50.3% | 58.8% | 5.8% | 64k | T | — | — | — | — | — | 29.1M | $0.000 | Jul 2025 | |
| 480 | Olmo 3 7B Instruct | Allen Institute for AI | 8.1 | 40.0% | 26.6% | 41.3% | 52.2% | 5.8% | 66k | T | — | — | $0.100 | $0.200 | $0.125 | 11.3M | $29.68 | Nov 2025 | |
| 481 | Exaone 4.0 1.2B (Non-reasoning) | LG AI Research | 8.1 | 42.4% | 29.3% | 24.0% | 50.0% | 5.8% | 64k | T | — | — | — | — | — | 2.0M | $0.000 | Jul 2025 | |
| 482 | LFM2.5-1.2B-Thinking | Liquid AI | 8.1 | 33.9% | — | — | — | 6.1% | 32k | T | — | — | — | — | — | 31.1M | $0.000 | Jan 2026 | |
| 483 | Jamba 1.7 Mini | AI21 Labs | 8.1 | 32.2% | 6.1% | 0.3% | 38.8% | 4.5% | 258k | T | — | — | — | — | — | 5.3M | $0.000 | Jul 2025 | |
| 484 | LFM2 2.6B | Liquid AI | 8.0 | 30.6% | 8.1% | 8.3% | 29.8% | 5.2% | 33k | T | — | — | — | — | — | 7.8M | $0.000 | Sep 2025 | |
| 485 | LFM2.5-1.2B-Instruct | Liquid AI | 8.0 | 32.6% | — | — | — | 6.8% | 32k | T | — | — | — | — | — | 4.6M | $0.000 | Jan 2026 | |
| 486 | Jamba 1.5 Mini | AI21 Labs | 8.0 | 30.2% | 6.2% | 1.0% | 37.1% | 5.1% | 256k | — | — | — | $0.200 | $0.400 | $0.250 | undefined | — | Aug 2024 | |
| 487 | Granite 4.0 H 1B | IBM | 8.0 | 26.3% | 11.5% | 6.3% | 27.7% | 5.0% | 128k | T | — | — | — | — | — | 2.9M | $0.000 | Oct 2025 | |
| 488 | Qwen3 1.7B (Reasoning) | Alibaba | 8.0 | 35.6% | 30.8% | 38.7% | 57.0% | 4.8% | 32k | T | 128 t/s | 1.84 s | $0.110 | $1.26 | $0.398 | 36.7M | $52.53 | Apr 2025 | |
| 489 | Jamba 1.6 Mini | AI21 Labs | 7.9 | 30.0% | 7.1% | 3.3% | 36.7% | 4.6% | 256k | T | 185 t/s | 1.02 s | $0.200 | $0.400 | $0.250 | undefined | — | Mar 2025 | |
| 490 | Mixtral 8x7B Instruct | Mistral | 7.7 | 29.2% | 6.6% | 0.0% | 38.7% | 4.5% | 33k | — | — | — | $0.450 | $0.700 | $0.513 | undefined | — | Dec 2023 | |
| 491 | Gemma 3 270M | 7.7 | 22.4% | 0.3% | 2.3% | 5.5% | 4.2% | 32k | T | — | — | — | — | — | undefined | — | Aug 2025 | ||
| 492 | Apertus 70B Instruct | Swiss AI Initiative | 7.7 | 27.2% | — | — | — | 5.5% | 66k | T | — | — | $0.820 | $2.92 | $1.34 | undefined | — | Sep 2025 | |
| 493 | Granite 4.0 Micro | IBM | 7.7 | 33.6% | 18.0% | 6.0% | 44.7% | 5.1% | 128k | T | — | — | — | — | — | 6.6M | $0.000 | Sep 2025 | |
| 494 | DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) | Nous Research | 7.6 | 27.0% | 8.5% | 0.0% | 36.5% | 4.3% | 128k | T | — | — | — | — | — | undefined | — | Feb 2025 | |
| 495 | Llama 65B | Meta | 7.4 | — | — | — | — | — | 2k | T | — | — | — | — | — | undefined | — | Feb 2023 | |
| 496 | Qwen Chat 14B | Alibaba | 7.4 | — | — | — | — | — | 8k | T | — | — | — | — | — | undefined | — | Sep 2023 | |
| 497 | Claude Instant | Anthropic | 7.4 | 33.0% | 10.9% | 0.0% | 43.4% | 3.8% | 100k | — | — | — | — | — | — | undefined | — | Mar 2023 | |
| 498 | Mistral 7B Instruct | Mistral | 7.4 | 17.7% | 4.6% | 0.0% | 24.5% | 4.3% | 8k | — | 106 t/s | 0.66 s | $0.200 | $0.225 | $0.206 | undefined | — | Sep 2023 | |
| 499 | Command-R (Mar '24) | Cohere | 7.4 | 28.4% | 4.8% | 0.7% | 33.8% | 4.8% | 128k | — | — | — | $0.500 | $1.50 | $0.750 | undefined | — | Mar 2024 | |
| 500 | Granite 4.0 1B | IBM | 7.3 | 28.1% | 4.7% | 6.3% | 32.5% | 5.1% | 128k | T | — | — | — | — | — | 3.9M | $0.000 | Oct 2025 | |
| 501 | Molmo2-8B | Allen Institute for AI | 7.3 | 42.5% | — | — | — | 4.4% | 37k | T 🖼 🎬 | — | — | — | — | — | undefined | — | Dec 2025 | |
| 502 | LFM2 8B A1B | Liquid AI | 7.0 | 34.4% | 15.1% | 25.3% | 50.5% | 4.9% | 33k | T | — | — | — | — | — | 7.8M | $0.000 | Oct 2025 | |
| 503 | Granite 3.3 8B (Non-reasoning) | IBM | 7.0 | 33.8% | 12.7% | 6.7% | 46.8% | 4.2% | 128k | T | 336 t/s | 26.24 s | $0.030 | $0.250 | $0.085 | 8.3M | $13.36 | Apr 2025 | |
| 504 | Qwen3 1.7B (Non-reasoning) | Alibaba | 6.8 | 28.3% | 12.6% | 7.3% | 41.1% | 5.2% | 32k | T | 128 t/s | 1.78 s | $0.110 | $0.420 | $0.188 | 4.4M | $15.77 | Apr 2025 | |
| 505 | Qwen3 0.6B (Reasoning) | Alibaba | 6.5 | 23.9% | 12.1% | 18.0% | 34.7% | 5.7% | 32k | T | 191 t/s | 1.74 s | $0.110 | $1.26 | $0.398 | 19.5M | $30.42 | Apr 2025 | |
| 506 | Llama 3 Instruct 8B | Meta | 6.4 | 29.6% | 9.6% | 0.0% | 40.5% | 5.1% | 8k | — | — | — | $0.045 | $0.145 | $0.070 | undefined | — | Apr 2024 | |
| 507 | Gemma 3n E4B Instruct | 6.4 | 29.6% | 14.6% | 14.3% | 48.8% | 4.4% | 32k | T 🖼 | 50 t/s | 1.42 s | $0.020 | $0.040 | $0.025 | 10.5M | $5.72 | Jun 2025 | ||
| 508 | LFM2 1.2B | Liquid AI | 6.3 | 22.8% | 2.0% | 3.3% | 25.7% | 5.7% | 33k | T | — | — | — | — | — | 18.6M | $0.000 | Jul 2025 | |
| 509 | Gemma 3 4B Instruct | 6.3 | 29.1% | 11.2% | 12.7% | 41.7% | 5.2% | 128k | T 🖼 | — | — | $0.040 | $0.080 | $0.050 | 3.8M | $6.52 | Mar 2025 | ||
| 510 | Llama 3.2 Instruct 1B | Meta | 6.3 | 19.6% | 1.9% | 0.0% | 20.0% | 5.3% | 128k | — | 87 t/s | 0.93 s | $0.050 | $0.050 | $0.050 | undefined | — | Sep 2024 | |
| 511 | LFM2.5-VL-1.6B | Liquid AI | 6.2 | 28.9% | — | — | — | 5.1% | 32k | T 🖼 | — | — | — | — | — | 8.2M | $0.000 | Jan 2026 | |
| 512 | Granite 4.0 350M | IBM | 6.1 | 26.1% | 2.4% | 0.0% | 12.4% | 5.7% | 33k | T | — | — | — | — | — | 7.5M | $0.000 | Oct 2025 | |
| 513 | Apertus 8B Instruct | Swiss AI Initiative | 5.9 | 25.6% | — | — | — | 5.0% | 66k | T | — | — | $0.100 | $0.200 | $0.125 | undefined | — | Sep 2025 | |
| 514 | Qwen3 0.6B (Non-reasoning) | Alibaba | 5.7 | 23.1% | 7.3% | 10.3% | 23.1% | 5.2% | 32k | T | 189 t/s | 1.67 s | $0.110 | $0.420 | $0.188 | 2.9M | $21.34 | Apr 2025 | |
| 515 | Gemma 3 1B Instruct | 5.6 | 23.7% | 1.7% | 3.3% | 13.5% | 5.2% | 32k | T | — | — | — | — | — | undefined | — | Mar 2025 | ||
| 516 | Granite 4.0 H 350M | IBM | 5.4 | 25.7% | 1.9% | 1.3% | 12.7% | 6.4% | 33k | T | — | — | — | — | — | 1.8M | $0.000 | Oct 2025 | |
| 517 | Gemma 3n E2B Instruct | 4.8 | 22.9% | 9.5% | 10.3% | 37.8% | 4.0% | 32k | T 🖼 | — | — | — | — | — | undefined | — | Jun 2025 | ||
| 518 | Tiny Aya Global | Cohere | 4.7 | 30.5% | — | — | — | 5.2% | 8k | T | — | — | — | — | — | undefined | — | Feb 2026 | |
| 519 | GPT-5.5 Pro (xhigh) | OpenAI | — | — | — | — | — | — | 922k | T 🖼 | — | — | — | — | — | undefined | — | Apr 2026 | |
| 520 | Gemini 3 Deep Think | — | — | — | — | — | — | 128k | T | — | — | — | — | — | undefined | — | Feb 2026 | ||
| 521 | EXAONE 4.5 33B (Non-reasoning) | LG AI Research | — | — | — | — | — | — | 262k | T 🖼 | — | — | — | — | — | undefined | — | Apr 2026 | |
| 522 | Cogito v2.1 (Reasoning) | Deep Cogito | — | 76.8% | 68.8% | 72.7% | 84.9% | 11.0% | 128k | T | 77 t/s | 0.90 s | $1.25 | $1.25 | $1.25 | undefined | — | Nov 2025 | |
| 523 | Mi:dm K 2.5 Pro Preview | Korea Telecom | — | 72.2% | 57.6% | 78.7% | 81.3% | 8.8% | 128k | T | — | — | — | — | — | undefined | — | Dec 2025 | |
| 524 | GPT-4o mini Realtime (Dec '24) | OpenAI | — | — | — | — | — | — | 128k | — | — | — | — | — | — | undefined | — | Dec 2024 | |
| 525 | GPT-5.4 Pro (xhigh) | OpenAI | — | — | — | — | — | — | 1.1M | T 🖼 | — | — | $30.00 | $180 | $67.50 | undefined | — | Mar 2026 | |
| 526 | GPT-3.5 Turbo (0613) | OpenAI | — | — | — | — | — | — | 4k | T | — | — | — | — | — | undefined | — | Jun 2023 | |
| 527 | GPT-4o Realtime (Dec '24) | OpenAI | — | — | — | — | — | — | 128k | T | — | — | — | — | — | undefined | — | Dec 2024 |