HUMAINE
Prolific human-centered LLM benchmark measuring real-world user success, satisfaction, engagement, and preference patterns across demographically broad feedback.
45rows
overall_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Overall score, Average success, Average satisfaction, Average engagement, Success rate, Conversation count
| Rank | Subject | Overall score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | deepseek/deepseek-r1-0528 | 3.79 | R1 0528 deepseek-deepseek-r1-0528 | Imported | 2026-05-06 |
| 2 | openai/o3 | 3.79 | o3 openai-o3 | Imported | 2026-05-06 |
| 3 | google/gemini-2.5-pro | 3.76 | Gemini 2.5 Pro google-gemini-2.5-pro | Imported | 2026-05-06 |
| 4 | google/gemini-3.1-pro-preview | 3.73 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 5 | x-ai/grok-4 | 3.72 | Grok 4 x-ai-grok-4 | Imported | 2026-05-06 |
| 6 | moonshotai/kimi-k2 | 3.71 | MoonshotAI: Kimi K2 0711 moonshotai-kimi-k2 | Imported | 2026-05-06 |
| 7 | openai/gpt-5.4 | 3.70 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 8 | openai/gpt-5.5 | 3.70 | GPT-5.5 openai-gpt-5.5 | Imported | 2026-05-06 |
| 9 | mistralai/magistral-medium-2506 | 3.69 | — | Imported | 2026-05-06 |
| 10 | anthropic/claude-opus-4.7 | 3.68 | Claude Opus 4.7 anthropic-claude-opus-4.7 | Imported | 2026-05-06 |
| 11 | x-ai/grok-3 | 3.67 | Grok 3 xaigrok-3 | Imported | 2026-05-06 |
| 12 | deepseek/deepseek-v4-flash | 3.66 | DeepSeek V4 Flash deepseek-deepseek-v4-flash | Imported | 2026-05-06 |
| 13 | google/gemini-2.5-flash | 3.66 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-06 |
| 14 | deepseek/deepseek-chat-v3-0324 | 3.64 | DeepSeek V3 0324 deepseek-deepseek-chat-v3-0324 | Imported | 2026-05-06 |
| 15 | openai/gpt-5-mini | 3.63 | GPT-5 Mini openai-gpt-5-mini | Imported | 2026-05-06 |
| 16 | openai/gpt-5 | 3.61 | GPT-5 openai-gpt-5 | Imported | 2026-05-06 |
| 17 | x-ai/grok-4.20-beta | 3.60 | Grok 4.20 x-ai-grok-4.20 | Imported | 2026-05-06 |
| 18 | openai/o4-mini | 3.58 | o4 Mini openai-o4-mini | Imported | 2026-05-06 |
| 19 | google/gemma-3-27b-it | 3.57 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-06 |
| 20 | openai/gpt-5.2-chat | 3.57 | GPT-5.2 Chat openai-gpt-5.2-chat | Imported | 2026-05-06 |
| 21 | openai/o1-mini | 3.57 | — | Imported | 2026-05-06 |
| 22 | google/gemini-2.0-flash-001 | 3.55 | Gemini 2.0 Flash google-gemini-2.0-flash | Imported | 2026-05-06 |
| 23 | moonshotai/kimi-k2.5 | 3.55 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 24 | openai/gpt-4.1 | 3.53 | GPT-4.1 openai-gpt-4.1 | Imported | 2026-05-06 |
| 25 | anthropic/claude-opus-4 | 3.50 | Claude Opus 4 anthropic-claude-opus-4 | Imported | 2026-05-06 |
| 26 | anthropic/claude-sonnet-4 | 3.50 | Claude Sonnet 4 anthropic-claude-sonnet-4 | Imported | 2026-05-06 |
| 27 | anthropic/claude-sonnet-4.5 | 3.49 | Claude Sonnet 4.5 anthropic-claude-sonnet-4.5 | Imported | 2026-05-06 |
| 28 | anthropic/claude-opus-4.6 | 3.48 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 29 | z-ai/glm-4.7 | 3.47 | GLM 4.7 z-ai-glm-4.7 | Imported | 2026-05-06 |
| 30 | openai/o1 | 3.44 | o1 openai-o1 | Imported | 2026-05-06 |
| 31 | anthropic/claude-3.7-sonnet | 3.40 | Claude 3.7 Sonnet anthropic-claude-3.7-sonnet | Imported | 2026-05-06 |
| 32 | cohere/command-a | 3.40 | Command A cohere-command-a | Imported | 2026-05-06 |
| 33 | qwen/qwen3-235b-a22b-2507 | 3.40 | Qwen3 235B A22B Instruct 2507 qwen-qwen3-235b-a22b-2507 | Imported | 2026-05-06 |
| 34 | deepseek/deepseek-v3.2 | 3.39 | DeepSeek V3.2 deepseek-deepseek-v3.2 | Imported | 2026-05-06 |
| 35 | openai/o3-mini | 3.38 | o3-mini openai-o3-mini | Imported | 2026-05-06 |
| 36 | allenai/olmo-3.1-32b-instruct | 3.37 | Olmo 3.1 32B Instruct allenai-olmo-3.1-32b-instruct | Imported | 2026-05-06 |
| 37 | meta-llama/llama-3.3-70b-instruct | 3.37 | Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | Imported | 2026-05-06 |
| 38 | google/gemini-3-pro | 3.34 | Gemini 3 google-gemini-3 | Imported | 2026-05-06 |
| 39 | minimax/minimax-m2.1 | 3.34 | MiniMax M2.1 minimax-minimax-m2.1 | Imported | 2026-05-06 |
| 40 | openai/gpt-4o | 3.33 | GPT-4o openai-gpt-4o | Imported | 2026-05-06 |
| 41 | mistralai/mistral-large-3 | 3.31 | — | Imported | 2026-05-06 |
| 42 | mistralai/mistral-nemo | 3.30 | Mistral: Mistral Nemo mistralai-mistral-nemo | Imported | 2026-05-06 |
| 43 | meta-llama/llama-4-maverick | 3.27 | Llama 4 Maverick meta-llama-4-maverick | Imported | 2026-05-06 |
| 44 | anthropic/claude-opus-4.5 | 3.25 | Claude Opus 4.5 anthropic-claude-opus-4.5 | Imported | 2026-05-06 |
| 45 | cohere/command-r7b-12-2024 | 3.13 | Command R7B (12-2024) cohere-command-r7b-12-2024 | Imported | 2026-05-06 |
No matching rows.