NeoEvalPlusN

Public leaderboard for proprietary command-following, distractor-resistance, expectation-breaking, poem, and stylized-writing tests run mainly on open-source LLM variants.

202rows
total_scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Total Score, B-test, C-test, D-test, S-test, P-test

Latest Results

Rows are parsed from the public NeoEvalPlusN semicolon CSV. The synthetic Maximum row is excluded; total_score is the sum of B, C, D, S, and P scores.

Rank Subject Total Score Model Match Provenance Sampled
1 TheDrummer/Behemoth-X-123B-v2 21 Imported 2026-05-06
2 TheDrummer/Behemoth-ReduX-123B-v1 20.25 Imported 2026-05-06
3 ChuckMcSneed/Premerge-XE-XE-123B 20 Imported 2026-05-06
4 ChuckMcSneed/Gembo-v1-70b 19.75 Imported 2026-05-06
5 NousResearch/Hermes-3-Llama-3.1-70B 19.75 L Hermes 3 70B Instruct
nousresearch-hermes-3-llama-3.1-70b
Imported 2026-05-06
6 anthracite-org/magnum-v4-123b 19.50 Imported 2026-05-06
7 ChuckMcSneed/ArcaneEntanglement-model64-70b 19.25 Imported 2026-05-06
8 ChuckMcSneed/Gembo-v1.1-70b 19 Imported 2026-05-06
9 zai-org/GLM-4.5 (no thinking) 19 GLM GLM 4.5
z-ai-glm-4.5
Imported 2026-05-06
10 ChuckMcSneed/Premerge-XE-EX-123B(private) 18.75 Imported 2026-05-06
11 ChuckMcSneed/Premerge-EX-EX-123B 18.75 Imported 2026-05-06
12 crestf411/daybreak-miqu-1-70b-v1.0-hf 18.75 Imported 2026-05-06
13 NousResearch/Hermes-3-Llama-3.1-405B 18.75 L Hermes 3 405B Instruct
nousresearch-hermes-3-llama-3.1-405b
Imported 2026-05-06
14 ChuckMcSneed/WinterGoliath-123b 18.50 Imported 2026-05-06
15 ChuckMcSneed/BenchmaxxxerPS-v1-123b 18.50 Imported 2026-05-06
16 crestf411/L3-70B-daybreak-abliterated-v0.4 18.50 Imported 2026-05-06
17 meta-llama/Meta-Llama-3.1-70B-Instruct 18.50 Imported 2026-05-06
18 mistralai/Mistral-Large-Instruct-2407 18.50 Imported 2026-05-06
19 deepseek-ai/DeepSeek-V3 18.50 Imported 2026-05-06
20 ChuckMcSneed/BenchmaxxxerMOE-v1-123b(private) 18.25 Imported 2026-05-06
21 cognitivecomputations/dolphin-2.9.2-qwen2-72b 18.25 Imported 2026-05-06
22 TheDrummer/Behemoth-123B-v2 18.25 Imported 2026-05-06
23 alpindale/goliath-120b 18 A Goliath 120B
alpindale-goliath-120b
Imported 2026-05-06
24 Xwin-LM/Xwin-LM-70B-V0.1+euryale-lora 18 Imported 2026-05-06
25 tdrussell/Llama-3-70B-Instruct-Storywriter 18 Imported 2026-05-06
26 Sao10K/L3.1-70B-Euryale-v2.2 18 Imported 2026-05-06
27 zai-org/GLM-4.6 (no thinking) 18 GLM GLM 4.6
z-ai-glm-4.6
Imported 2026-05-06
28 ChuckMcSneed/Premerge-EX-XE-123B(private) 17.75 Imported 2026-05-06
29 FluffyKaeloky/Luminum-v0.1-123B 17.75 Imported 2026-05-06
30 TheDrummer/Behemoth-123B-v1 17.75 Imported 2026-05-06
31 TheDrummer/Behemoth-123B-v1.1 17.50 Imported 2026-05-06
32 TheDrummer/Behemoth-123B-v2.1 17.50 Imported 2026-05-06
33 TheDrummer/Behemoth-123B-v2.2 17.50 Imported 2026-05-06
34 TheDrummer/Behemoth-123B-v1.2 17.50 Imported 2026-05-06
35 crestf411/L3.1-70B-sunfall-v0.6.1 17.25 Imported 2026-05-06
36 cognitivecomputations/dolphin-2.9.1-dbrx 17 Imported 2026-05-06
37 mistralai/Mistral-Nemo-Instruct-2407 17 Imported 2026-05-06
38 migtissera/Tess-3-Llama-3.1-405B 17 Imported 2026-05-06
39 TheDrummer/Endurance-100B-v1 17 Imported 2026-05-06
40 jondurbin/spicyboros-70b-2.2 16.75 Imported 2026-05-06
41 meta-llama/Meta-Llama-3-70B-Instruct 16.75 Llama 3 70B Instruct
meta-llama-llama-3-70b-instruct
Imported 2026-05-06
42 alpindale/magnum-72b-v1 16.75 Imported 2026-05-06
43 SillyTilly/Meta-Llama-3.1-405B-Instruct 16.75 Imported 2026-05-06
44 nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 16.75 Imported 2026-05-06
45 meta-llama/Llama-3.3-70B-Instruct 16.75 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-06
46 ChuckMcSneed/PMaxxxer-v1-70b 16.50 Imported 2026-05-06
47 ChuckMcSneed/BenchmaxxxerSP-v1-123b(private) 16.50 Imported 2026-05-06
48 sophosympatheia/Aurora-Nights-103B-v1.0 16.50 Imported 2026-05-06
49 openbmb/Eurux-8x22b-nca 16.50 Imported 2026-05-06
50 wolfram/miquliz-120b-v2.0 16.25 Imported 2026-05-06
51 TheDrummer/Endurance-100B-v1.1 16.25 Imported 2026-05-06
52 ChuckMcSneed/BenchmaxxxerSS-v1-123b(private) 16 Imported 2026-05-06
53 Sao10K/Euryale-1.3-L2-70B+xwin-lora 16 Imported 2026-05-06
54 NeverSleep/Lumimaid-v0.2-123B 16 Imported 2026-05-06
55 NeverSleep/Lumimaid-v0.2-70B 16 Imported 2026-05-06
56 moonshotai/Kimi-K2-Instruct-0905 16 KIMI MoonshotAI: Kimi K2 0905
moonshotai-kimi-k2-0905
Imported 2026-05-06
57 anthracite-org/magnum-v2-123b 15.75 Imported 2026-05-06
58 CohereForAI/c4ai-command-a-03-2025 15.75 Imported 2026-05-06
59 TheDrummer/Fallen-Command-A-111B-v1.1 15.75 Imported 2026-05-06
60 zai-org/GLM-4.5-Air (no thinking) 15.75 GLM GLM 4.5 Air
z-ai-glm-4.5-air
Imported 2026-05-06
61 xai-org/grok-2 15.75 Imported 2026-05-06
62 alpindale/WizardLM-2-8x22B 15.50 Imported 2026-05-06
63 mistralai/Mistral-Large-Instruct-2407 15.50 Imported 2026-05-06
64 moonshotai/Kimi-K2-Instruct 15.50 KIMI MoonshotAI: Kimi K2 0711
moonshotai-kimi-k2
Imported 2026-05-06
65 lizpreciatior/lzlv_70b_fp16_hf 15.25 Imported 2026-05-06
66 sophosympatheia/Aurora-Nights-70B-v1.0 15.25 Imported 2026-05-06
67 miqu-123b(personal test merge) 15.25 Imported 2026-05-06
68 migtissera/Tess-3-Mistral-Large-2-123B 15.25 Imported 2026-05-06
69 CohereForAI/c4ai-command-r-plus-08-2024 15.25 Imported 2026-05-06
70 BeaverAI/Fallen-Command-A-111B-v1a-GGUF 15.25 Imported 2026-05-06
71 miqudev/miqu-1-70b 15 Imported 2026-05-06
72 openbmb/Eurux-8x22b-kto 15 Imported 2026-05-06
73 Sao10K/WinterGoddess-1.4x-70B-L2 14.75 Imported 2026-05-06
74 sophosympatheia/Midnight-Rose-70B-v2.0.3 14.75 Imported 2026-05-06
75 mistralai/Mistral-Large-Instruct-2411 14.75 Imported 2026-05-06
76 ChuckMcSneed/SMaxxxer-v1-70b 14.50 Imported 2026-05-06
77 SF-Foundation/EinBase-70B-v0.1-full 14.50 Imported 2026-05-06
78 CohereForAI/c4ai-command-r-plus 14.50 Imported 2026-05-06
79 TheDrummer/Lazarus-2407-100B 14.50 Imported 2026-05-06
80 bartowski/Lazarus-2407-100B-GGUF 14.50 Imported 2026-05-06
81 wolfram/miqu-1-120b 14.25 Imported 2026-05-06
82 billyjoe/quill-72b-instruct 14.25 Imported 2026-05-06
83 mistralai/Mistral-Large-3-675B-Instruct-2512 14.25 Imported 2026-05-06
84 mistralai/Mixtral-8x22B-Instruct-v0.1 14 Imported 2026-05-06
85 deepseek-ai/DeepSeek-V2-Chat-0628 14 Imported 2026-05-06
86 mistralai/Mistral-Small-Instruct-2409 14 Imported 2026-05-06
87 Xwin-LM/Xwin-LM-70B-V0.1 13.75 Imported 2026-05-06
88 tiiuae/falcon-180B-chat 13.75 Imported 2026-05-06
89 wolfram/miqu-1-103b 13.75 Imported 2026-05-06
90 meta-llama/Meta-Llama-3-8B-Instruct 13.75 Llama 3 8B Instruct
meta-llama-llama-3-8b-instruct
Imported 2026-05-06
91 ChuckMcSneed/WinterGoliath-123b-32k 13.50 Imported 2026-05-06
92 mistralai/Mistral-Small-24B-Instruct-2501 13.50 Mistral: Mistral Small 3
mistralai-mistral-small-24b-instruct-2501
Imported 2026-05-06
93 deepseek-ai/DeepSeek-V3.1 (no thinking) 13.50 Imported 2026-05-06
94 nsfwthrowitaway69/Venus-120b-v1.0 13 Imported 2026-05-06
95 cognitivecomputations/MegaDolphin-120b 13 Imported 2026-05-06
96 CohereForAI/c4ai-command-r-v01 13 Imported 2026-05-06
97 jondurbin/airoboros-dpo-110b-3.3 13 Imported 2026-05-06
98 Undi95/Miqu-70B-Alpaca-DPO 12.75 Imported 2026-05-06
99 miqu-120b-attenuated(experimental merge) 12.75 Imported 2026-05-06
100 jukofyork/Dark-Miqu-70B 12.75 Imported 2026-05-06
101 mistralai/Mixtral-8x7B-Instruct-v0.1 12.50 Mistral: Mixtral 8x7B Instruct
mistralai-mixtral-8x7b-instruct
Imported 2026-05-06
102 anthracite-org/magnum-v2-72b 12.50 Imported 2026-05-06
103 anthracite-org/magnum-v3-34b 12.50 Imported 2026-05-06
104 Nexusflow/Athene-V2-Chat 12.50 Imported 2026-05-06
105 NousResearch/Nous-Hermes-Llama2-70b 12.25 Imported 2026-05-06
106 grimulkan/lzlv-longLORA-70b-rope8-32k-fp16 12.25 Imported 2026-05-06
107 sophosympatheia/Midnight-Miqu-70B-v1.5 12.25 Imported 2026-05-06
108 meta-llama/Meta-Llama-3.1-8B-Instruct 12.25 Imported 2026-05-06
109 WizardLM/WizardLM-70B-V1.0 12 Imported 2026-05-06
110 ShinojiResearch/Senku-70B-Full 12 Imported 2026-05-06
111 Qwen/Qwen1.5-110B-Chat 12 Imported 2026-05-06
112 cognitivecomputations/dolphin-2.9.1-qwen-110b 12 Imported 2026-05-06
113 Mikael110/llama-2-70b-guanaco-qlora 11.75 Imported 2026-05-06
114 TheDrummer/Fallen-Command-A-111B-v1 11.75 Imported 2026-05-06
115 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 (no thinking) 11.75 Imported 2026-05-06
116 THUDM/GLM-4-32B-0414 11.75 Imported 2026-05-06
117 Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss 11.50 Imported 2026-05-06
118 CausalLM/34b-beta 11.50 Imported 2026-05-06
119 jondurbin/airoboros-110b-3.3 11.50 Imported 2026-05-06
120 Qwen/Qwen2.5-72B-Instruct 11.50 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-06
121 mistralai/Mistral-Small-3.1-24B-Instruct-2503 11.50 Mistral: Mistral Small 3.1 24B
mistralai-mistral-small-3.1-24b-instruct
Imported 2026-05-06
122 grimulkan/Goliath-longLORA-120b-rope8-32k-fp16 11.25 Imported 2026-05-06
123 DiscoResearch/DiscoLM-70b 11 Imported 2026-05-06
124 cognitivecomputations/dolphin-2.2-70b 11 Imported 2026-05-06
125 databricks/dbrx-instruct 11 Imported 2026-05-06
126 jarradh/llama2_70b_chat_uncensored 10.75 Imported 2026-05-06
127 CohereForAI/c4ai-command-r-08-2024 10.75 Imported 2026-05-06
128 Qwen/Qwen1.5-72B-Chat 10.50 Imported 2026-05-06
129 dnhkng/RYS-XLarge 10.50 Imported 2026-05-06
130 ChuckMcSneed/DoubleGold-v0.1-123b-32k 10.25 Imported 2026-05-06
131 meta-llama/Llama-4-Scout-17B-16E-Instruct 10.25 Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-06
132 Sao10K/Euryale-1.3-L2-70B 10 Imported 2026-05-06
133 google/gemma-2-27b-it 10 Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-06
134 google/gemma-3-27b-it 10 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-06
135 Qwen/Qwen3-235B-A22B (no thinking) 10 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-06
136 XWIN-32k-experimental(equivalent to grimulkan/Xwin-longLORA-70b-rope8-32k-fp16?) 9.75 Imported 2026-05-06
137 deepnight-research/saily_100b 9.75 Imported 2026-05-06
138 senseable/WestLake-7B-v2 9.75 Imported 2026-05-06
139 google/gemma-2-9b-it 9.75 Imported 2026-05-06
140 Qwen/Qwen-72B-Chat 9.50 Imported 2026-05-06
141 Doctor-Shotgun/mythospice-70b 9.50 Imported 2026-05-06
142 deepseek-ai/deepseek-llm-67b-chat 9.50 Imported 2026-05-06
143 Qwen/Qwen3-32B (no thinking) 9.50 Qwen3 32B
qwen-qwen3-32b
Imported 2026-05-06
144 tiiuae/Falcon-H1-34B-Instruct 9.50 Imported 2026-05-06
145 Qwen/Qwen3-235B-A22B (no thinking) 9.25 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-06
146 NousResearch/Nous-Capybara-34B 9 Imported 2026-05-06
147 ChatGPT/12-2023 9 Imported 2026-05-06
148 KaeriJenti/kaori-70b-v1 9 Imported 2026-05-06
149 sophosympatheia/Midnight-Rose-70B-v1.0 9 Imported 2026-05-06
150 MayaPH/GodziLLa2-70B 9 Imported 2026-05-06
151 augtoma/qCammel-70-x 8.75 Imported 2026-05-06
152 ChuckMcSneed/DoubleGold-v0.5-123b-32k 8.75 Imported 2026-05-06
153 TeeZee/Kyllene-34B-v1.1 8.75 Imported 2026-05-06
154 MaziyarPanahi/WizardLM-Math-70B-v0.1 8.75 Imported 2026-05-06
155 migtissera/Tess-70B-v1.6 8.50 Imported 2026-05-06
156 upstage/SOLAR-0-70b-16bit 8.25 Imported 2026-05-06
157 ChuckMcSneed/WinterGoddess-1.4x-70b-32k 8.25 Imported 2026-05-06
158 NousResearch/Nous-Hermes-2-Llama-2-70B 8 Imported 2026-05-06
159 budecosystem/genz-70b 8 Imported 2026-05-06
160 goliath-120b-attenuated(experimental merge) 8 Imported 2026-05-06
161 inclusionAI/Ling-1T 8 Imported 2026-05-06
162 garage-bAInd/Platypus2-70B 7.50 Imported 2026-05-06
163 deepnight-research/Saily_220B 7.50 Imported 2026-05-06
164 TomGrc/FusionNet_7Bx2_MoE_v0.1 7.50 Imported 2026-05-06
165 Yukang/LongAlpaca-70B 7.25 Imported 2026-05-06
166 ChuckMcSneed/Dicephal-123B+longlora 7.25 Imported 2026-05-06
167 Xwin-LM/Xwin-LM-70B-V0.1+longlora 7.25 Imported 2026-05-06
168 ICBU-NPU/FashionGPT-70B-V1.1 7.25 Imported 2026-05-06
169 ValiantLabs/ShiningValiant 7.25 Imported 2026-05-06
170 NeverSleep/MiquMaid-v2-70B-DPO 7.25 Imported 2026-05-06
171 allenai/tulu-2-dpo-70b 7 Imported 2026-05-06
172 migtissera/SynthIA-70B-v1.5 6.75 Imported 2026-05-06
173 NousResearch/Nous-Puffin-70B 6.75 Imported 2026-05-06
174 grimulkan/aurelian-v0.5-70b-rope8-32K-fp16 6.75 Imported 2026-05-06
175 berkeley-nest/Starling-LM-7B-alpha 6.75 Imported 2026-05-06
176 NeverSleep/Lumimaid-v0.2-12B 6.75 Imported 2026-05-06
177 ibivibiv/strix-rufipes-70b 6.25 Imported 2026-05-06
178 cloudyu/Mixtral_34Bx2_MoE_60B 6.25 Imported 2026-05-06
179 Xwin-LM/Xwin-LM-70B-V0.1+chat-longlora 6 Imported 2026-05-06
180 Nexusflow/Starling-LM-7B-beta 6 Imported 2026-05-06
181 upstage/SOLAR-10.7B-Instruct-v1.0 5.75 Imported 2026-05-06
182 abacusai/Smaug-34B-v0.1 5.75 Imported 2026-05-06
183 meta-llama/Meta-Llama-3-70B 5.75 Imported 2026-05-06
184 fblgit/una-cybertron-7b-v3-OMA 5.50 Imported 2026-05-06
185 ChuckMcSneed/Dicephal-123B 5.50 Imported 2026-05-06
186 elinas/chronos-70b-v2 5.25 Imported 2026-05-06
187 Brillibits/Instruct_Llama70B_Dolly15k 5.25 Imported 2026-05-06
188 Chat-Error/fiction.live-Kimiko-V2-70B 4.75 Imported 2026-05-06
189 mistral-community/Mixtral-8x22B-v0.1 4.75 Imported 2026-05-06
190 llama3-123b-selfmerge 4.75 Imported 2026-05-06
191 Xwin-LM/Xwin-LM-70B-V0.1+LongAlpaca-lora 4.25 Imported 2026-05-06
192 grimulkan/aurelian-alpha0.1-70b-rope8-32K-fp16 4.25 Imported 2026-05-06
193 databricks/dbrx-base 4.25 Imported 2026-05-06
194 fblgit/una-xaberius-34b-v1beta 4 Imported 2026-05-06
195 Yukang/LongAlpaca-70B-lora 4 Imported 2026-05-06
196 meta-llama/Llama-2-70b-hf 3.75 Imported 2026-05-06
197 alpindale/miquella-120b(old) 3.50 Imported 2026-05-06
198 epfl-llm/meditron-70b 3.25 Imported 2026-05-06
199 Yukang/Llama-2-70b-longlora-32k 3.25 Imported 2026-05-06
200 abacusai/Smaug-72B-v0.1(llama chat format) 3 Imported 2026-05-06
201 abacusai/Smaug-72B-v0.1(Alpaca format) 2.75 Imported 2026-05-06
202 Qwen/Qwen-72B 2.50 Imported 2026-05-06