OpenBookQA | BenchmarkList

Metadata

Score, Normalized Score

Rank	Subject	Score	Model Match	Provenance	Sampled
1	Phi-3.5-MoE-instruct	0.90	—	Self-reported	2026-05-06
2	Phi-3.5-mini-instruct	0.79	—	Self-reported	2026-05-06
2	Phi 4 Mini	0.79	—	Self-reported	2026-05-06
4	Mistral NeMo Instruct	0.61	Mistral: Mistral Nemo mistralai-mistral-nemo	Self-reported	2026-05-06
5	Hermes 3 70B	0.49	—	Self-reported	2026-05-06