C-Eval

C-Eval: Evaluates broad language-model knowledge, reasoning, commonsense, instruction following, or exam-style accuracy.

143rows
average_accuracyprimary metric
2026-05-27sampled

Metadata

Metrics

Average, Average (Hard), STEM, Social Science, Humanities, Other

Latest Results

Rows are parsed from the public C-Eval static leaderboard JS payload. C-Eval states the leaderboard stopped receiving updates after releasing the test set in July 2025.

Rank Subject Average Model Match Provenance Sampled
1 海信星海 92.3% Imported 2026-05-27
2 BlueLM 92.2% Imported 2026-05-27
3 讯飞星火认知大模型(Spark4.0 Max) 91.8% Imported 2026-05-27
4 DiMind 91.3% Imported 2026-05-27
5 MagicLM 90.1% Imported 2026-05-27
6 Mind GPT 89.7% Imported 2026-05-27
7 云天天书 89.5% Imported 2026-05-27
8 QuarkLLM 89% Imported 2026-05-27
9 YAYI-Ultra 87.7% Imported 2026-05-27
10 卓睦鸟医疗大模型 87.4% Imported 2026-05-27
11 声智科技医院大模型--AzeroGPT 87.2% Imported 2026-05-27
12 HZ60 87% Imported 2026-05-27
13 TeleChat2-115B 86.9% Imported 2026-05-27
14 FanttecLM 86.7% Imported 2026-05-27
15 TuringMM-V2-Chat 86.4% Imported 2026-05-27
16 讯飞星火认知大模型(Spark3.5 Max) 85.9% Imported 2026-05-27
17 Qwen 85.7% Imported 2026-05-27
18 UniGPT3.0(山海) 84.5% Imported 2026-05-27
19 Jiutian-大模型 84.2% Imported 2026-05-27
20 DZJ-13B 83.6% Imported 2026-05-27
21 砭石•医学 83.5% Imported 2026-05-27
22 Yi-1.5-34B 83.1% Imported 2026-05-27
23 CW-MLM 83% Imported 2026-05-27
24 Qwen-72B 82.8% Imported 2026-05-27
25 JIUTIAN-57B 82.4% Imported 2026-05-27
26 Youyuanjian(中文名:邮远见) 82.2% Imported 2026-05-27
27 Yi-34B 81.4% Imported 2026-05-27
28 APUS-xDAN 大模型4.0(MoE)136B 81.3% Imported 2026-05-27
29 TuringMM-34B-Chat 80.7% Imported 2026-05-27
30 Linly-Chinese-LLaMA2-70B 80.6% Imported 2026-05-27
31 PCI-TransGPT 80.4% Imported 2026-05-27
32 HZ30 80.1% Imported 2026-05-27
33 Taichu-70B 80.1% Imported 2026-05-27
34 Yan 80% Imported 2026-05-27
35 AndesGPT-7B 79.9% Imported 2026-05-27
36 砭石•中医 78.9% Imported 2026-05-27
37 OrionStar-Yi-34B-Chat 78.1% Imported 2026-05-27
38 云天书 77.1% Imported 2026-05-27
39 XuanYuan-13B 76.8% Imported 2026-05-27
40 EnableLLM 76.3% Imported 2026-05-27
41 YAYI2-30B 75.3% Imported 2026-05-27
42 砭石 74.8% Imported 2026-05-27
43 XuanYuan-6B 74.4% Imported 2026-05-27
44 xDAN-L2-Chat-lite-v1.0 74.3% Imported 2026-05-27
45 ZhiLu-2-8B-Instruct 74.2% Imported 2026-05-27
46 Galaxy 73.7% Imported 2026-05-27
47 KwaiYii-66B 73.7% Imported 2026-05-27
48 BlueLM-7B 73.3% Imported 2026-05-27
49 INDICS-MIND-13B 72.9% Imported 2026-05-27
50 UniGPT2.0(山海) 72.9% Imported 2026-05-27
51 XuanYuan2-70B 72.7% Imported 2026-05-27
52 XVERSE-65B-2 72.4% Imported 2026-05-27
53 Qwen-14B 72.1% Imported 2026-05-27
54 Yi-6B 72% Imported 2026-05-27
55 XuanYuan-70B 71.9% Imported 2026-05-27
56 YaYi 71.8% Imported 2026-05-27
57 AiLMe-100B v3 71.6% Imported 2026-05-27
58 Mengzi 71.5% Imported 2026-05-27
59 DFM2.0 71.2% Imported 2026-05-27
60 ChatGLM2 71.1% Imported 2026-05-27
61 HZ20 70.4% Imported 2026-05-27
62 星云通信大模型 ZTE TelcoGPT 70.4% Imported 2026-05-27
63 ChatDD-FM 69.1% Imported 2026-05-27
64 360GPT-S2 69% Imported 2026-05-27
65 ChatGLM3-6B-base 69% Imported 2026-05-27
66 InternLM-123B 68.8% Imported 2026-05-27
67 GPT-4 68.7% GPT-4
openai-gpt-4
Imported 2026-05-27
68 XVERSE-65B 68.6% Imported 2026-05-27
69 HITsz-Lychee-Base-11B-V0.1 67% Imported 2026-05-27
70 Aquila2-70B-Expr 66.8% Imported 2026-05-27
71 CW-MLM-13B 66.7% Imported 2026-05-27
72 GS-LLM-Beta 66.7% Imported 2026-05-27
73 SageGPT-V0.2 66.6% Imported 2026-05-27
74 SenseChat 66.1% Imported 2026-05-27
75 CHAOS_LM-7B-4bit 65.3% Imported 2026-05-27
76 CHAOS_LM-7B 65% Imported 2026-05-27
77 Mengzi-7B 64.9% Imported 2026-05-27
78 GS-LLM-Beta-Mini 64.6% Imported 2026-05-27
79 Atom-13B 64.5% Imported 2026-05-27
80 赤兔 64.1% Imported 2026-05-27
81 支点-1.5B 64% Imported 2026-05-27
82 Nanbeige-16B-Base 63.8% Imported 2026-05-27
83 LingoWhale-8B 63.6% Imported 2026-05-27
84 Qwen-7B v1.1 63.5% Imported 2026-05-27
85 XVERSE-13B-2 63.5% Imported 2026-05-27
86 TeleChat 63.1% Imported 2026-05-27
87 Alaya-7B-Base 62.8% Imported 2026-05-27
88 InternLM 62.7% Imported 2026-05-27
89 KwaiYii-13B 62.6% Imported 2026-05-27
90 万语-50M(Wanyv-50M) 62% Imported 2026-05-27
91 ChatGLM2-12B 61.6% Imported 2026-05-27
92 Erlangshen-UniMC-1.3B 61% Imported 2026-05-27
93 Dolphin 60.4% Imported 2026-05-27
94 UniGPT 60.3% Imported 2026-05-27
95 MiLM-6B 60.2% Imported 2026-05-27
96 Qwen-7B 59.6% Imported 2026-05-27
97 BatGPT-15b-sirius-v2 57.4% Imported 2026-05-27
98 XVERSE-7B 57.1% Imported 2026-05-27
99 Instruct-DLM-v2 56.8% Imported 2026-05-27
100 GS-LLM-Alpha 55.6% Imported 2026-05-27
101 AquilaChat2-34B v1.2 55.5% Imported 2026-05-27
102 Qwen-1.8B 54.7% Imported 2026-05-27
103 XVERSE-13B 54.7% Imported 2026-05-27
104 EduChat 54.6% Imported 2026-05-27
105 ChatGPT 54.4% Imported 2026-05-27
106 Claude-v1.3 54.2% Imported 2026-05-27
107 TeleChat-E 54.2% Imported 2026-05-27
108 CPM 54.1% Imported 2026-05-27
109 Baichuan-13B 53.6% Imported 2026-05-27
110 DLM-v2 53.5% Imported 2026-05-27
111 InternLM-7B 52.8% Imported 2026-05-27
112 ChatGLM2-6B 51.7% Imported 2026-05-27
113 yami-13B 51.4% Imported 2026-05-27
114 JiuZhou 九州 51.3% Imported 2026-05-27
115 nagin-13b 50.8% Imported 2026-05-27
116 nagin-7B 50.7% Imported 2026-05-27
117 BR-LLM-7B 50.5% Imported 2026-05-27
118 Colossal-LLaMA-2-7B-base 50.2% Imported 2026-05-27
119 ZhiLu-13B-Instruct 50.1% Imported 2026-05-27
120 咪姆 48.8% Imported 2026-05-27
121 cucumberearth 47.7% Imported 2026-05-27
122 AndesLM-13B 46% Imported 2026-05-27
123 Claude-instant-v1.0 45.9% Imported 2026-05-27
124 MiLM-1.3B 45.8% Imported 2026-05-27
125 WestlakeLM-19B 44.6% Imported 2026-05-27
126 bloomz-mt-176B 44.3% Imported 2026-05-27
127 玉言 44.3% Imported 2026-05-27
128 GLM-130B 44% Imported 2026-05-27
129 baichuan-7B 42.8% Imported 2026-05-27
130 YuLan-Chat-2-13B 42.6% Imported 2026-05-27
131 CubeLM-13B 42.5% Imported 2026-05-27
132 Chinese-Alpaca-33B 41.6% Imported 2026-05-27
133 Chinese-Alpaca-Plus-13B 41.5% Imported 2026-05-27
134 Yuren-13b 40.4% Imported 2026-05-27
135 MoYu 40.3% Imported 2026-05-27
136 ChatGLM-6B 38.9% Imported 2026-05-27
137 LLaMA-65B 38.8% Imported 2026-05-27
138 CCNU-7B 35.7% Imported 2026-05-27
139 Llama2-Moses-7B 34.5% Imported 2026-05-27
140 Chinese LLaMA-13B 33.3% Imported 2026-05-27
141 MOSS 33.1% Imported 2026-05-27
142 Camalama 32.6% Imported 2026-05-27
143 Chinese Alpaca-13B 30.9% Imported 2026-05-27