DeepSeek V3.1
DeepSeek / DeepSeek
29scores
19benchmarks
$0.15 / $0.75 per 1M tokenscost in/out
Metadata
DeepSeek Open source
Aliases: deepseek-chat-v3.1, deepseek-deepseek-chat-v3.1, deepseek/deepseek-chat-v3.1
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| MCP-Universe | Agentic | 17 | 22.08 | 2026-05-06 |
| Tau2-Bench Telecom | Agentic | 207 | 37.4% | 2026-05-11 |
| Tau2-Bench Telecom | Agentic | 216 | 34.8% | 2026-05-11 |
| Terminal-Bench Hard | Agentic | 118 | 25% | 2026-05-11 |
| Terminal-Bench Hard | Agentic | 123 | 24.2% | 2026-05-11 |
| Codeforces | Coding | 12 | 0.697 | 2026-05-28 |
| SciCode | Coding | 117 | 39.1% | 2026-05-11 |
| SciCode | Coding | 159 | 36.7% | 2026-05-11 |
| FinChain | Finance | 12 | 56.76 ChainEval | 2026-05-28 |
| BenchLM | General Knowledge | 90 | 30 | 2026-05-06 |
| MMLU-Redux | General Knowledge | 18 | 0.92 | 2026-05-06 |
| AIIQ Composite IQ | Intelligence | 34 | 95 | 2026-05-12 |
| Artificial Analysis Intelligence Index | Intelligence | 163 | 28.13 | 2026-05-11 |
| Artificial Analysis Intelligence Index | Intelligence | 166 | 27.71 | 2026-05-11 |
| Humanity's Last Exam | Intelligence | 112 | 13% | 2026-05-11 |
| Humanity's Last Exam | Intelligence | 231 | 6.3% | 2026-05-11 |
| MMLU-Pro | Intelligence | 30 | 85.1% | 2026-05-11 |
| MMLU-Pro | Intelligence | 56 | 83.3% | 2026-05-11 |
| AIME 2025 | Math | 26 | 89.7% | 2026-05-11 |
| AIME 2025 | Math | 141 | 49.7% | 2026-05-11 |
| IneqMath | Math | 13 | 15.50 | 2026-05-06 |
| IneqMath | Math | 16 | 12 | 2026-05-06 |
| HMMT 2025 | Mathematics | 31 | 0.34 | 2026-05-06 |
| LiveMedBench | Medical | 24 | 0.0959 | 2026-05-27 |
| GPQA Diamond | Reasoning | 123 | 77.9% | 2026-05-11 |
| GPQA Diamond | Reasoning | 172 | 73.5% | 2026-05-11 |
| CritPt | Science | 58 | 2% | 2026-05-11 |
| CritPt | Science | 171 | 0% | 2026-05-11 |
| BrowseComp-zh | Search | 10 | 0.49 | 2026-05-06 |
No matching rows.