Open Chinese LLM Leaderboard

BAAI leaderboard for Chinese-oriented LLM evaluation across C-ARC, C-HellaSwag, C-TruthfulQA, C-Winogrande, C-GSM8K, C-SEM, C-MMLU, and CLCC-H.

177rows
averageprimary metric
2026-05-06sampled

Metadata

Metrics

Average, Evaluated Tasks, C-ARC, C-HellaSwag, C-TruthfulQA, C-Winogrande, C-GSM8K, C-SEM, C-MMLU, CLCC-H

Latest Results

Rank Subject Average Model Match Provenance Sampled
1 Qwen/Qwen2-72B-Instruct 75.67 Imported 2026-05-06
2 abacusai/Smaug-72B-v0.1 73.16 Imported 2026-05-06
3 Qwen/Qwen2-72B 72.66 Imported 2026-05-06
4 MTSAIR/MultiVerse_70B 72.08 Imported 2026-05-06
5 CombinHorizon/YiSM-blossom5.1-34B-SLERP 70.89 Imported 2026-05-06
6 Weyaxi/Bagel-Hermes-34B-Slerp 70.82 Imported 2026-05-06
7 abacusai/Smaug-34B-v0.1 70.61 Imported 2026-05-06
8 cloudyu/Yi-34Bx2-MoE-60B-DPO 70.61 Imported 2026-05-06
9 Qwen/Qwen1.5-72B-Chat 70.48 Imported 2026-05-06
10 cloudyu/Yi-34Bx2-MoE-60B 70.44 Imported 2026-05-06
11 ConvexAI/Luminex-34B-v0.2 70.44 Imported 2026-05-06
12 ConvexAI/Luminex-34B-v0.1 70.29 Imported 2026-05-06
13 cloudyu/Yi-34Bx2-MOE-200K 69.63 Imported 2026-05-06
14 OpenBuddy/openbuddy-yi1.5-34b-v21.3-32k 69.60 Imported 2026-05-06
15 cloudyu/Mixtral_34Bx2_MoE_60B 69.41 Imported 2026-05-06
16 bosonai/Higgs-Llama-3-70B 69.37 Imported 2026-05-06
17 OpenBuddy/openbuddy-deepseek-67b-v18.1-4k 69.32 Imported 2026-05-06
18 abacusai/Smaug-Llama-3-70B-Instruct 69.11 Imported 2026-05-06
19 01-ai/Yi-1.5-34B-Chat 68.92 Imported 2026-05-06
20 OpenBuddy/openbuddy-llama3-70b-v21.2-32k 68.90 Imported 2026-05-06
21 Qwen/Qwen2-57B-A14B-Instruct 68.71 Imported 2026-05-06
22 Azure99/blossom-v5.1-34b 68.67 Imported 2026-05-06
23 altomek/YiSM-34B-0rn 68.36 Imported 2026-05-06
24 cognitivecomputations/dolphin-2.9.1-yi-1.5-34b 68.25 Imported 2026-05-06
25 Qwen/Qwen1.5-72B 68.14 Imported 2026-05-06
26 cloudyu/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16 67.81 Imported 2026-05-06
27 OpenBuddy/openbuddy-zero-56b-v21.2-32k 67.64 Imported 2026-05-06
28 chujiezheng/Llama3-70B-Chinese-Chat-ExPO 67.62 Imported 2026-05-06
29 chujiezheng/Smaug-34B-v0.1-ExPO 67.55 Imported 2026-05-06
30 MaziyarPanahi/Llama-3-70B-Instruct-DPO-v0.2 67.54 Imported 2026-05-06
31 Weyaxi/Nous-Hermes-2-SUS-Chat-34B-Slerp 67.32 Imported 2026-05-06
32 brucethemoose/Yi-34B-200K-DARE-merge-v7 67.09 Imported 2026-05-06
33 abacusai/Smaug-Llama-3-70B-Instruct-32K 67.04 Imported 2026-05-06
34 chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 66.98 Imported 2026-05-06
35 01-ai/Yi-1.5-34B-Chat-16K 66.91 Imported 2026-05-06
36 01-ai/Yi-1.5-34B-32K 66.84 Imported 2026-05-06
37 01-ai/Yi-1.5-34B 66.69 Imported 2026-05-06
38 Qwen/Qwen1.5-32B 66.63 Imported 2026-05-06
39 Qwen/Qwen2-57B-A14B 66.47 Imported 2026-05-06
40 cognitivecomputations/dolphin-2.9.1-llama-3-70b 66.14 Imported 2026-05-06
41 Qwen/Qwen1.5-14B-Chat 66.05 Imported 2026-05-06
42 Danielbrdz/Barcenas-14b-Phi-3-medium-ORPO 65.67 Imported 2026-05-06
43 BAAI/Infinity-Instruct-3M-0625-Yi-1.5-9B 65.64 Imported 2026-05-06
44 VAGOsolutions/SauerkrautLM-Phi-3-medium 65.42 Imported 2026-05-06
45 01-ai/Yi-34B 65.31 Imported 2026-05-06
46 ValiantLabs/Llama3-70B-ShiningValiant2 65.30 Imported 2026-05-06
47 CausalLM/34b-beta 64.70 Imported 2026-05-06
48 OpenBuddy/openbuddy-yi1.5-34b-v21.6-32k-fp16 64.63 Imported 2026-05-06
49 01-ai/Yi-1.5-9B-Chat 64.28 Imported 2026-05-06
50 Duxiaoman-DI/XuanYuan-70B 64.23 Imported 2026-05-06
51 alpindale/WizardLM-2-8x22B 64.21 Imported 2026-05-06
52 NousResearch/Nous-Hermes-2-Yi-34B 63.89 Imported 2026-05-06
53 NLPark/AnFeng_v3_Avocet 63.67 Imported 2026-05-06
54 Qwen/Qwen2-7B 63.51 Imported 2026-05-06
55 byroneverson/Yi-1.5-9B-Chat-16K-abliterated 63.48 Imported 2026-05-06
56 TIGER-Lab/MAmmoTH2-8x7B-Plus 63.17 Imported 2026-05-06
57 OpenBuddy/openbuddy-yi1.5-9b-v21.1-32k 62.97 Imported 2026-05-06
58 byroneverson/Yi-1.5-9B-Chat-abliterated 62.94 Imported 2026-05-06
59 NLPark/Shi-Ci_v3-Robin 62.90 Imported 2026-05-06
60 Qwen/Qwen1.5-32B-Chat 62.60 Imported 2026-05-06
61 01-ai/Yi-1.5-9B-Chat-16K 62.54 Imported 2026-05-06
62 01-ai/Yi-1.5-9B-32K 61.93 Imported 2026-05-06
63 01-ai/Yi-1.5-9B 61.80 Imported 2026-05-06
64 Qwen/Qwen2-7B-Instruct 61.79 Imported 2026-05-06
65 Qwen/Qwen1.5-14B 61.65 Imported 2026-05-06
66 chujiezheng/internlm2-chat-20b-ExPO 61.26 Imported 2026-05-06
67 ValiantLabs/Llama3-70B-Fireplace 60.83 Imported 2026-05-06
68 BAAI/Infinity-Instruct-7M-0729-Llama3_1-8B 60.65 Imported 2026-05-06
69 byroneverson/Mistral-Small-Instruct-2409-abliterated 60.50 Imported 2026-05-06
70 Duxiaoman-DI/XuanYuan-70B-Chat 60.45 Imported 2026-05-06
71 NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO 60.17 Imported 2026-05-06
72 GritLM/GritLM-8x7B-KTO 60.06 Imported 2026-05-06
73 BAAI/Infinity-Instruct-3M-0625-Llama3-8B 59.83 Imported 2026-05-06
74 Langboat/Mengzi3-8B-Chat 59.73 Imported 2026-05-06
75 chujiezheng/LLaMA3-iterative-DPO-final-ExPO 59.69 Imported 2026-05-06
76 CofeAI/Tele-FLM 59.59 Imported 2026-05-06
77 01-ai/Yi-34B-Chat 59.44 Imported 2026-05-06
78 abhishek/autotrain-llama3-70b-orpo-v1 59.07 Imported 2026-05-06
79 NotAiLOL/Yi-1.5-dolphin-9B 58.93 Imported 2026-05-06
80 VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct 58.68 Imported 2026-05-06
81 DeepMount00/Llama-3-8b-Ita 58.68 Imported 2026-05-06
82 MaziyarPanahi/Llama-3-8B-Instruct-v0.8 58.64 Imported 2026-05-06
83 Kukedlc/NeuralLLaMa-3-8b-DT-v0.1 58.59 Imported 2026-05-06
84 GritLM/GritLM-8x7B 58.54 Imported 2026-05-06
85 BAAI/Infinity-Instruct-7M-Gen-Llama3_1-8B 58.42 Imported 2026-05-06
86 01-ai/Yi-9B 58.34 Imported 2026-05-06
87 MaziyarPanahi/Llama-3-8B-Instruct-v0.10 58.32 Imported 2026-05-06
88 01-ai/Yi-9B-200K 58.07 Imported 2026-05-06
89 OpenBuddy/openbuddy-llama3-8b-v21.2-32k 57.96 Imported 2026-05-06
90 Danielbrdz/Barcenas-Llama3-8b-ORPO 57.95 Imported 2026-05-06
91 MaziyarPanahi/Llama-3-8B-Instruct-v0.9 57.95 Imported 2026-05-06
92 UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3 57.90 Imported 2026-05-06
93 Kukedlc/NeuralLLaMa-3-8b-ORPO-v0.3 57.87 Imported 2026-05-06
94 01-ai/Yi-34B-200K 57.87 Imported 2026-05-06
95 UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2 57.83 Imported 2026-05-06
96 NLPark/Test1_SLIDE 57.49 Imported 2026-05-06
97 RubielLabarta/LogoS-7Bx2-MoE-13B-v0.2 57.27 Imported 2026-05-06
98 Magpie-Align/Llama-3-8B-Magpie-Align-v0.3 57.23 Imported 2026-05-06
99 chujiezheng/Llama3-8B-Chinese-Chat-ExPO 57.20 Imported 2026-05-06
100 01-ai/Yi-6B-Chat 56.97 Imported 2026-05-06
101 Kukedlc/NeuralSynthesis-7B-v0.3 56.96 Imported 2026-05-06
102 Kquant03/CognitiveFusion2-4x7B-BF16 56.87 Imported 2026-05-06
103 Kukedlc/NeuralSynthesis-7B-v0.1 56.74 Imported 2026-05-06
104 Kukedlc/NeuralSynthesis-7b-v0.4-slerp 56.69 Imported 2026-05-06
105 cognitivecomputations/WestLake-7B-v2-laser 56.67 Imported 2026-05-06
106 MaziyarPanahi/Calme-4x7B-MoE-v0.1 56.65 Imported 2026-05-06
107 allknowingroger/MultiverseEx26-7B-slerp 56.62 Imported 2026-05-06
108 Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3 56.61 Imported 2026-05-06
109 automerger/YamshadowExperiment28-7B 56.60 Imported 2026-05-06
110 Qwen/Qwen1.5-7B 56.59 Imported 2026-05-06
111 NousResearch/Hermes-2-Theta-Llama-3-8B 56.56 Imported 2026-05-06
112 MaziyarPanahi/Calme-4x7B-MoE-v0.2 56.47 Imported 2026-05-06
113 TIGER-Lab/MAmmoTH2-8B-Plus 56.41 Imported 2026-05-06
114 CultriX/NeuralMona_MoE-4x7B 56.35 Imported 2026-05-06
115 chujiezheng/internlm2-chat-7b-ExPO 56.22 Imported 2026-05-06
116 Kukedlc/NeuralLLaMa-3-8b-ORPO-v0.4 56.17 Imported 2026-05-06
117 01-ai/Yi-1.5-6B 56.04 Imported 2026-05-06
118 Kukedlc/NeuralExperiment-7b-MagicCoder-v7.5 55.95 Imported 2026-05-06
119 MaziyarPanahi/Topxtral-4x7B-v0.1 55.93 Imported 2026-05-06
120 chujiezheng/tulu-2-dpo-70b-ExPO 55.85 Imported 2026-05-06
121 abacusai/Smaug-Mixtral-v0.1 55.83 Imported 2026-05-06
122 OpenBuddy/openbuddy-zero-14b-v22.3-32k 55.61 Imported 2026-05-06
123 abacusai/Slerp-CM-mist-dpo 55.25 Imported 2026-05-06
124 NousResearch/Meta-Llama-3-8B-Instruct 55.09 Imported 2026-05-06
125 Qwen/Qwen1.5-MoE-A2.7B 54.82 Imported 2026-05-06
126 NousResearch/Nous-Hermes-2-SOLAR-10.7B 54.76 Imported 2026-05-06
127 HIT-SCIR/Chinese-Mixtral-8x7B 54.68 Imported 2026-05-06
128 Weyaxi/Einstein-v6.1-Llama3-8B 54.07 Imported 2026-05-06
129 chujiezheng/Starling-LM-7B-beta-ExPO 53.72 Imported 2026-05-06
130 abacusai/Llama-3-Smaug-8B 53.66 Imported 2026-05-06
131 cognitivecomputations/Llama-3-8B-Instruct-abliterated-v2 53.65 Imported 2026-05-06
132 TIGER-Lab/MAmmoTH2-7B-Plus 53.57 Imported 2026-05-06
133 Artples/L-MChat-7b 53.30 Imported 2026-05-06
134 abacusai/bigyi-15b 53.23 Imported 2026-05-06
135 01-ai/Yi-6B 52.96 Imported 2026-05-06
136 Qwen/Qwen1.5-MoE-A2.7B-Chat 52.79 Imported 2026-05-06
137 MoaData/Myrrh_solar_10.7b_3.0 52.74 Imported 2026-05-06
138 FlagAlpha/Llama3-Chinese-8B-Instruct 52.56 Imported 2026-05-06
139 MTSAIR/multi_verse_model 52.32 Imported 2026-05-06
140 SeaLLMs/SeaLLM-7B-v2 52.31 Imported 2026-05-06
141 UnicomLLM/Unichat-llama3-Chinese-8B 51.64 Imported 2026-05-06
142 Qwen/Qwen2-1.5B-Instruct 51.53 Imported 2026-05-06
143 Qwen/Qwen1.5-4B-Chat 51.41 Imported 2026-05-06
144 chujiezheng/Starling-LM-7B-alpha-ExPO 51.28 Imported 2026-05-06
145 Qwen/Qwen1.5-4B 50.78 Imported 2026-05-06
146 UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2 50.73 Imported 2026-05-06
147 GritLM/GritLM-7B-KTO 50.42 Imported 2026-05-06
148 UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3 50.33 Imported 2026-05-06
149 beowolx/CodeNinja-1.0-OpenChat-7B 49.98 Imported 2026-05-06
150 cognitivecomputations/dolphin-2.9.1-llama-3-8b 49.57 Imported 2026-05-06
151 THUDM/chatglm3-6b 49.42 Imported 2026-05-06
152 Qwen/Qwen2-1.5B 49.23 Imported 2026-05-06
153 AIJUUD/juud-Mistral-7B-dpo 49.15 Imported 2026-05-06
154 MaziyarPanahi/Mistral-7B-Instruct-v0.2 49.02 Imported 2026-05-06
155 01-ai/Yi-Coder-9B-Chat 48.90 Imported 2026-05-06
156 abacusai/bigstral-12b-32k 48.81 Imported 2026-05-06
157 chujiezheng/Mistral7B-PairRM-SPPO-ExPO 48.64 Imported 2026-05-06
158 01-ai/Yi-Coder-9B 48.38 Imported 2026-05-06
159 cognitivecomputations/dolphin-2.6-mixtral-8x7b 45.69 Imported 2026-05-06
160 aws-prototyping/MegaBeam-Mistral-7B-512k 45.16 Imported 2026-05-06
161 amazon/MegaBeam-Mistral-7B-300k 44.74 Imported 2026-05-06
162 SenseLLM/ReflectionCoder-DS-33B 44.59 Imported 2026-05-06
163 Azure99/blossom-v5.1-9b 44.46 Imported 2026-05-06
164 Qwen/Qwen1.5-1.8B-Chat 43.65 Imported 2026-05-06
165 SenseLLM/ReflectionCoder-CL-34B 43.10 Imported 2026-05-06
166 Infinirc/Infinirc-Llama3-8B-2G-Release-v1.0 42.94 Imported 2026-05-06
167 UnicomLLM/Unichat-llama3-Chinese-8B-28K 42.45 Imported 2026-05-06
168 Qwen/Qwen1.5-1.8B 41.70 Imported 2026-05-06
169 Qwen/Qwen2-0.5B-Instruct 40.04 Imported 2026-05-06
170 OpenBuddy/openbuddy-mistral-22b-v21.1-32k 39.57 Imported 2026-05-06
171 Qwen/Qwen2-0.5B 38.58 Imported 2026-05-06
172 SeaLLMs/SeaLLM-7B-v2.5 38.55 Imported 2026-05-06
173 Qwen/Qwen1.5-0.5B-Chat 36.69 Imported 2026-05-06
174 Qwen/Qwen1.5-0.5B 35.78 Imported 2026-05-06
175 cognitivecomputations/dolphin-2.9.1-mixtral-1x22b 34.95 Imported 2026-05-06
176 RLHFlow/ArmoRM-Llama3-8B-v0.1 34.36 Imported 2026-05-06
177 Artples/L-MChat-Small 32.44 Imported 2026-05-06