HindiGen v1
Hindi generative-task benchmark for chat and instruct models, evaluated with the 3C3H rubric across Hindi QA, grammar, and safety tasks.
35rows
3c3h_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
3C3H Score, Correctness, Completeness, Conciseness, Helpfulness, Honesty, Harmlessness, Question Answering (QA), Grammar, Safety
| Rank | Subject | 3C3H Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | o3-2025-04-16 | 85.56 | o3 openai-o3 | Imported | 2026-05-06 |
| 2 | o1-2024-12-17 | 79.64 | o1 openai-o1 | Imported | 2026-05-06 |
| 3 | claude-3-5-sonnet-20241022 | 77.47 | Claude 3.5 Sonnet anthropic-claude-3.5-sonnet | Imported | 2026-05-06 |
| 4 | o4-mini-2025-04-16 | 75.52 | o4 Mini openai-o4-mini | Imported | 2026-05-06 |
| 5 | claude-opus-4-20250514 | 74.49 | Claude Opus 4 anthropic-claude-opus-4 | Imported | 2026-05-06 |
| 6 | gpt-4o-2024-08-06 | 74.45 | GPT-4o (2024-08-06) openai-gpt-4o-2024-08-06 | Imported | 2026-05-06 |
| 7 | gpt-4o-2024-05-13 | 73.98 | GPT-4o (2024-05-13) openai-gpt-4o-2024-05-13 | Imported | 2026-05-06 |
| 8 | gemini-2.5-flash-preview-05-20 | 73.65 | — | Imported | 2026-05-06 |
| 9 | gpt-4.1-2025-04-14 | 73.37 | GPT-4.1 openai-gpt-4.1 | Imported | 2026-05-06 |
| 10 | gpt-4o-2024-11-20 | 72.44 | GPT-4o (2024-11-20) openai-gpt-4o-2024-11-20 | Imported | 2026-05-06 |
| 11 | gemini-2.5-pro-preview-05-06 | 71.77 | Gemini 2.5 Pro Preview 05-06 google-gemini-2.5-pro-preview-05-06 | Imported | 2026-05-06 |
| 12 | claude-3-7-sonnet-20250219 | 70.77 | Claude 3.7 Sonnet anthropic-claude-3.7-sonnet | Imported | 2026-05-06 |
| 13 | meta-llama/Llama-3.1-70B-Instruct | 70.45 | Llama 3.1 70B Instruct meta-llama-llama-3.1-70b-instruct | Imported | 2026-05-06 |
| 14 | claude-sonnet-4-20250514 | 69.75 | Claude Sonnet 4 anthropic-claude-sonnet-4 | Imported | 2026-05-06 |
| 15 | gpt-4o-mini-2024-07-18 | 65.50 | GPT-4o-mini (2024-07-18) openai-gpt-4o-mini-2024-07-18 | Imported | 2026-05-06 |
| 16 | gpt-4.1-mini-2025-04-14 | 65.02 | GPT-4.1 Mini openai-gpt-4.1-mini | Imported | 2026-05-06 |
| 17 | claude-3-5-haiku-20241022 | 61.25 | — | Imported | 2026-05-06 |
| 18 | o1-mini-2024-09-12 | 60 | — | Imported | 2026-05-06 |
| 19 | krutrim-ai-labs/Krutrim-2-instruct | 59.62 | — | Imported | 2026-05-06 |
| 20 | gpt-4.1-nano-2025-04-14 | 56.89 | GPT-4.1 Nano openai-gpt-4.1-nano | Imported | 2026-05-06 |
| 21 | claude-3-haiku-20240307 | 55.32 | Claude 3 Haiku anthropic-claude-3-haiku | Imported | 2026-05-06 |
| 22 | o3-mini-2025-01-31 | 55.14 | o3-mini openai-o3-mini | Imported | 2026-05-06 |
| 23 | tiiuae/Falcon-H1-34B-Instruct | 53.61 | — | Imported | 2026-05-06 |
| 24 | google/gemma-3-27b-it | 50.32 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-06 |
| 25 | mistralai/Mistral-Small-24B-Instruct-2501 | 50.29 | Mistral: Mistral Small 3 mistralai-mistral-small-24b-instruct-2501 | Imported | 2026-05-06 |
| 26 | gpt-3.5-turbo-0125 | 49.10 | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-06 |
| 27 | tiiuae/Falcon-H1-7B-Instruct | 34.97 | — | Imported | 2026-05-06 |
| 28 | MBZUAI/Llama-3-Nanda-10B-Chat | 31.73 | — | Imported | 2026-05-06 |
| 29 | sarvamai/sarvam-m | 22.92 | — | Imported | 2026-05-06 |
| 30 | gpt-4.5-preview-2025-02-27 | 15.46 | GPT-4.5 openai-gpt-4.5-preview | Imported | 2026-05-06 |
| 31 | tiiuae/Falcon-H1-3B-Instruct | 14.44 | — | Imported | 2026-05-06 |
| 32 | GenVRadmin/AryaBhatta-GemmaOrca-Merged | 13.83 | — | Imported | 2026-05-06 |
| 33 | GenVRadmin/AryaBhatta-GemmaUltra-Merged | 11.07 | — | Imported | 2026-05-06 |
| 34 | ai4bharat/Airavata | 9.57 | — | Imported | 2026-05-06 |
| 35 | tiiuae/Falcon-H1-1.5B-Instruct | 6.35 | — | Imported | 2026-05-06 |
No matching rows.