IFBench
Instruction Following Benchmark evaluating model's ability to follow complex instructions
22rows
scoreprimary metric
2026-05-28sampled
Metadata
Metrics
Score, Normalized Score
Showing 2 latest source slices.
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Qwen3.7 Max | 79.1% | Qwen3.7 Max qwen-qwen3.7-max | Self-reported | 2026-05-28 |
| 2 | DeepSeek V4 Pro Max | 77% | DeepSeek V4 Pro deepseek-deepseek-v4-pro | Self-reported | 2026-05-28 |
| 3 | GLM-5.1 Thinking | 76% | GLM 5.1 z-ai-glm-5.1 | Self-reported | 2026-05-28 |
| 4 | Kimi K2.6 Thinking | 76% | MoonshotAI: Kimi K2.6 moonshotai-kimi-k2.6 | Self-reported | 2026-05-28 |
| 5 | Qwen3.6 Plus | 74.2% | Qwen3.6 Plus qwen-qwen3.6-plus | Self-reported | 2026-05-28 |
| 6 | Claude Opus 4.6 Max | 62.5% | Claude Opus 4.6 anthropic-claude-opus-4.6 | Self-reported | 2026-05-28 |
| 1 | Hermes 3 70B | 0.81 | — | Self-reported | 2026-05-06 |
| 2 | Qwen3.5-397B-A17B | 0.77 | Qwen3.5 397B A17B qwen-qwen3.5-397b-a17b | Self-reported | 2026-05-06 |
| 2 | Qwen3.5-27B | 0.77 | Qwen3.5-27B qwen-qwen3.5-27b | Self-reported | 2026-05-06 |
| 4 | Qwen3.5-122B-A10B | 0.76 | Qwen3.5-122B-A10B qwen-qwen3.5-122b-a10b | Self-reported | 2026-05-06 |
| 5 | Qwen3.6 Plus | 0.74 | Qwen3.6 Plus qwen-qwen3.6-plus | Self-reported | 2026-05-06 |
| 6 | Nemotron 3 Super (120B A12B) | 0.73 | Nemotron 3 Super nvidia-nemotron-3-super-120b-a12b | Self-reported | 2026-05-06 |
| 7 | Mercury 2 | 0.71 | Mercury 2 inception-mercury-2 | Self-reported | 2026-05-06 |
| 8 | Qwen3.5-35B-A3B | 0.70 | Qwen3.5-35B-A3B qwen-qwen3.5-35b-a3b | Self-reported | 2026-05-06 |
| 9 | MiniMax M2.1 | 0.70 | MiniMax M2.1 minimax-minimax-m2.1 | Self-reported | 2026-05-06 |
| 10 | GPT OSS 120B High | 0.69 | — | Self-reported | 2026-05-06 |
| 11 | K-EXAONE-236B-A23B | 0.67 | — | Self-reported | 2026-05-06 |
| 12 | Qwen3.5-9B | 0.65 | Qwen3.5-9B qwen-qwen3.5-9b | Self-reported | 2026-05-06 |
| 13 | Qwen3.5-4B | 0.59 | — | Self-reported | 2026-05-06 |
| 14 | Mistral Small 4 | 0.48 | Mistral: Mistral Small 4 mistralai-mistral-small-2603 | Self-reported | 2026-05-06 |
| 15 | Qwen3.5-2B | 0.41 | — | Self-reported | 2026-05-06 |
| 16 | Qwen3.5-0.8B | 0.21 | — | Self-reported | 2026-05-06 |
No matching rows.