C-Eval
C-Eval: Evaluates broad language-model knowledge, reasoning, commonsense, instruction following, or exam-style accuracy.
143rows
average_accuracyprimary metric
2026-05-27sampled
Metadata
Metrics
Average, Average (Hard), STEM, Social Science, Humanities, Other
| Rank | Subject | Average | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | 海信星海 | 92.3% | — | Imported | 2026-05-27 |
| 2 | BlueLM | 92.2% | — | Imported | 2026-05-27 |
| 3 | 讯飞星火认知大模型(Spark4.0 Max) | 91.8% | — | Imported | 2026-05-27 |
| 4 | DiMind | 91.3% | — | Imported | 2026-05-27 |
| 5 | MagicLM | 90.1% | — | Imported | 2026-05-27 |
| 6 | Mind GPT | 89.7% | — | Imported | 2026-05-27 |
| 7 | 云天天书 | 89.5% | — | Imported | 2026-05-27 |
| 8 | QuarkLLM | 89% | — | Imported | 2026-05-27 |
| 9 | YAYI-Ultra | 87.7% | — | Imported | 2026-05-27 |
| 10 | 卓睦鸟医疗大模型 | 87.4% | — | Imported | 2026-05-27 |
| 11 | 声智科技医院大模型--AzeroGPT | 87.2% | — | Imported | 2026-05-27 |
| 12 | HZ60 | 87% | — | Imported | 2026-05-27 |
| 13 | TeleChat2-115B | 86.9% | — | Imported | 2026-05-27 |
| 14 | FanttecLM | 86.7% | — | Imported | 2026-05-27 |
| 15 | TuringMM-V2-Chat | 86.4% | — | Imported | 2026-05-27 |
| 16 | 讯飞星火认知大模型(Spark3.5 Max) | 85.9% | — | Imported | 2026-05-27 |
| 17 | Qwen | 85.7% | — | Imported | 2026-05-27 |
| 18 | UniGPT3.0(山海) | 84.5% | — | Imported | 2026-05-27 |
| 19 | Jiutian-大模型 | 84.2% | — | Imported | 2026-05-27 |
| 20 | DZJ-13B | 83.6% | — | Imported | 2026-05-27 |
| 21 | 砭石•医学 | 83.5% | — | Imported | 2026-05-27 |
| 22 | Yi-1.5-34B | 83.1% | — | Imported | 2026-05-27 |
| 23 | CW-MLM | 83% | — | Imported | 2026-05-27 |
| 24 | Qwen-72B | 82.8% | — | Imported | 2026-05-27 |
| 25 | JIUTIAN-57B | 82.4% | — | Imported | 2026-05-27 |
| 26 | Youyuanjian(中文名:邮远见) | 82.2% | — | Imported | 2026-05-27 |
| 27 | Yi-34B | 81.4% | — | Imported | 2026-05-27 |
| 28 | APUS-xDAN 大模型4.0(MoE)136B | 81.3% | — | Imported | 2026-05-27 |
| 29 | TuringMM-34B-Chat | 80.7% | — | Imported | 2026-05-27 |
| 30 | Linly-Chinese-LLaMA2-70B | 80.6% | — | Imported | 2026-05-27 |
| 31 | PCI-TransGPT | 80.4% | — | Imported | 2026-05-27 |
| 32 | HZ30 | 80.1% | — | Imported | 2026-05-27 |
| 33 | Taichu-70B | 80.1% | — | Imported | 2026-05-27 |
| 34 | Yan | 80% | — | Imported | 2026-05-27 |
| 35 | AndesGPT-7B | 79.9% | — | Imported | 2026-05-27 |
| 36 | 砭石•中医 | 78.9% | — | Imported | 2026-05-27 |
| 37 | OrionStar-Yi-34B-Chat | 78.1% | — | Imported | 2026-05-27 |
| 38 | 云天书 | 77.1% | — | Imported | 2026-05-27 |
| 39 | XuanYuan-13B | 76.8% | — | Imported | 2026-05-27 |
| 40 | EnableLLM | 76.3% | — | Imported | 2026-05-27 |
| 41 | YAYI2-30B | 75.3% | — | Imported | 2026-05-27 |
| 42 | 砭石 | 74.8% | — | Imported | 2026-05-27 |
| 43 | XuanYuan-6B | 74.4% | — | Imported | 2026-05-27 |
| 44 | xDAN-L2-Chat-lite-v1.0 | 74.3% | — | Imported | 2026-05-27 |
| 45 | ZhiLu-2-8B-Instruct | 74.2% | — | Imported | 2026-05-27 |
| 46 | Galaxy | 73.7% | — | Imported | 2026-05-27 |
| 47 | KwaiYii-66B | 73.7% | — | Imported | 2026-05-27 |
| 48 | BlueLM-7B | 73.3% | — | Imported | 2026-05-27 |
| 49 | INDICS-MIND-13B | 72.9% | — | Imported | 2026-05-27 |
| 50 | UniGPT2.0(山海) | 72.9% | — | Imported | 2026-05-27 |
| 51 | XuanYuan2-70B | 72.7% | — | Imported | 2026-05-27 |
| 52 | XVERSE-65B-2 | 72.4% | — | Imported | 2026-05-27 |
| 53 | Qwen-14B | 72.1% | — | Imported | 2026-05-27 |
| 54 | Yi-6B | 72% | — | Imported | 2026-05-27 |
| 55 | XuanYuan-70B | 71.9% | — | Imported | 2026-05-27 |
| 56 | YaYi | 71.8% | — | Imported | 2026-05-27 |
| 57 | AiLMe-100B v3 | 71.6% | — | Imported | 2026-05-27 |
| 58 | Mengzi | 71.5% | — | Imported | 2026-05-27 |
| 59 | DFM2.0 | 71.2% | — | Imported | 2026-05-27 |
| 60 | ChatGLM2 | 71.1% | — | Imported | 2026-05-27 |
| 61 | HZ20 | 70.4% | — | Imported | 2026-05-27 |
| 62 | 星云通信大模型 ZTE TelcoGPT | 70.4% | — | Imported | 2026-05-27 |
| 63 | ChatDD-FM | 69.1% | — | Imported | 2026-05-27 |
| 64 | 360GPT-S2 | 69% | — | Imported | 2026-05-27 |
| 65 | ChatGLM3-6B-base | 69% | — | Imported | 2026-05-27 |
| 66 | InternLM-123B | 68.8% | — | Imported | 2026-05-27 |
| 67 | GPT-4 | 68.7% | GPT-4 openai-gpt-4 | Imported | 2026-05-27 |
| 68 | XVERSE-65B | 68.6% | — | Imported | 2026-05-27 |
| 69 | HITsz-Lychee-Base-11B-V0.1 | 67% | — | Imported | 2026-05-27 |
| 70 | Aquila2-70B-Expr | 66.8% | — | Imported | 2026-05-27 |
| 71 | CW-MLM-13B | 66.7% | — | Imported | 2026-05-27 |
| 72 | GS-LLM-Beta | 66.7% | — | Imported | 2026-05-27 |
| 73 | SageGPT-V0.2 | 66.6% | — | Imported | 2026-05-27 |
| 74 | SenseChat | 66.1% | — | Imported | 2026-05-27 |
| 75 | CHAOS_LM-7B-4bit | 65.3% | — | Imported | 2026-05-27 |
| 76 | CHAOS_LM-7B | 65% | — | Imported | 2026-05-27 |
| 77 | Mengzi-7B | 64.9% | — | Imported | 2026-05-27 |
| 78 | GS-LLM-Beta-Mini | 64.6% | — | Imported | 2026-05-27 |
| 79 | Atom-13B | 64.5% | — | Imported | 2026-05-27 |
| 80 | 赤兔 | 64.1% | — | Imported | 2026-05-27 |
| 81 | 支点-1.5B | 64% | — | Imported | 2026-05-27 |
| 82 | Nanbeige-16B-Base | 63.8% | — | Imported | 2026-05-27 |
| 83 | LingoWhale-8B | 63.6% | — | Imported | 2026-05-27 |
| 84 | Qwen-7B v1.1 | 63.5% | — | Imported | 2026-05-27 |
| 85 | XVERSE-13B-2 | 63.5% | — | Imported | 2026-05-27 |
| 86 | TeleChat | 63.1% | — | Imported | 2026-05-27 |
| 87 | Alaya-7B-Base | 62.8% | — | Imported | 2026-05-27 |
| 88 | InternLM | 62.7% | — | Imported | 2026-05-27 |
| 89 | KwaiYii-13B | 62.6% | — | Imported | 2026-05-27 |
| 90 | 万语-50M(Wanyv-50M) | 62% | — | Imported | 2026-05-27 |
| 91 | ChatGLM2-12B | 61.6% | — | Imported | 2026-05-27 |
| 92 | Erlangshen-UniMC-1.3B | 61% | — | Imported | 2026-05-27 |
| 93 | Dolphin | 60.4% | — | Imported | 2026-05-27 |
| 94 | UniGPT | 60.3% | — | Imported | 2026-05-27 |
| 95 | MiLM-6B | 60.2% | — | Imported | 2026-05-27 |
| 96 | Qwen-7B | 59.6% | — | Imported | 2026-05-27 |
| 97 | BatGPT-15b-sirius-v2 | 57.4% | — | Imported | 2026-05-27 |
| 98 | XVERSE-7B | 57.1% | — | Imported | 2026-05-27 |
| 99 | Instruct-DLM-v2 | 56.8% | — | Imported | 2026-05-27 |
| 100 | GS-LLM-Alpha | 55.6% | — | Imported | 2026-05-27 |
| 101 | AquilaChat2-34B v1.2 | 55.5% | — | Imported | 2026-05-27 |
| 102 | Qwen-1.8B | 54.7% | — | Imported | 2026-05-27 |
| 103 | XVERSE-13B | 54.7% | — | Imported | 2026-05-27 |
| 104 | EduChat | 54.6% | — | Imported | 2026-05-27 |
| 105 | ChatGPT | 54.4% | — | Imported | 2026-05-27 |
| 106 | Claude-v1.3 | 54.2% | — | Imported | 2026-05-27 |
| 107 | TeleChat-E | 54.2% | — | Imported | 2026-05-27 |
| 108 | CPM | 54.1% | — | Imported | 2026-05-27 |
| 109 | Baichuan-13B | 53.6% | — | Imported | 2026-05-27 |
| 110 | DLM-v2 | 53.5% | — | Imported | 2026-05-27 |
| 111 | InternLM-7B | 52.8% | — | Imported | 2026-05-27 |
| 112 | ChatGLM2-6B | 51.7% | — | Imported | 2026-05-27 |
| 113 | yami-13B | 51.4% | — | Imported | 2026-05-27 |
| 114 | JiuZhou 九州 | 51.3% | — | Imported | 2026-05-27 |
| 115 | nagin-13b | 50.8% | — | Imported | 2026-05-27 |
| 116 | nagin-7B | 50.7% | — | Imported | 2026-05-27 |
| 117 | BR-LLM-7B | 50.5% | — | Imported | 2026-05-27 |
| 118 | Colossal-LLaMA-2-7B-base | 50.2% | — | Imported | 2026-05-27 |
| 119 | ZhiLu-13B-Instruct | 50.1% | — | Imported | 2026-05-27 |
| 120 | 咪姆 | 48.8% | — | Imported | 2026-05-27 |
| 121 | cucumberearth | 47.7% | — | Imported | 2026-05-27 |
| 122 | AndesLM-13B | 46% | — | Imported | 2026-05-27 |
| 123 | Claude-instant-v1.0 | 45.9% | — | Imported | 2026-05-27 |
| 124 | MiLM-1.3B | 45.8% | — | Imported | 2026-05-27 |
| 125 | WestlakeLM-19B | 44.6% | — | Imported | 2026-05-27 |
| 126 | bloomz-mt-176B | 44.3% | — | Imported | 2026-05-27 |
| 127 | 玉言 | 44.3% | — | Imported | 2026-05-27 |
| 128 | GLM-130B | 44% | — | Imported | 2026-05-27 |
| 129 | baichuan-7B | 42.8% | — | Imported | 2026-05-27 |
| 130 | YuLan-Chat-2-13B | 42.6% | — | Imported | 2026-05-27 |
| 131 | CubeLM-13B | 42.5% | — | Imported | 2026-05-27 |
| 132 | Chinese-Alpaca-33B | 41.6% | — | Imported | 2026-05-27 |
| 133 | Chinese-Alpaca-Plus-13B | 41.5% | — | Imported | 2026-05-27 |
| 134 | Yuren-13b | 40.4% | — | Imported | 2026-05-27 |
| 135 | MoYu | 40.3% | — | Imported | 2026-05-27 |
| 136 | ChatGLM-6B | 38.9% | — | Imported | 2026-05-27 |
| 137 | LLaMA-65B | 38.8% | — | Imported | 2026-05-27 |
| 138 | CCNU-7B | 35.7% | — | Imported | 2026-05-27 |
| 139 | Llama2-Moses-7B | 34.5% | — | Imported | 2026-05-27 |
| 140 | Chinese LLaMA-13B | 33.3% | — | Imported | 2026-05-27 |
| 141 | MOSS | 33.1% | — | Imported | 2026-05-27 |
| 142 | Camalama | 32.6% | — | Imported | 2026-05-27 |
| 143 | Chinese Alpaca-13B | 30.9% | — | Imported | 2026-05-27 |
No matching rows.