PhiBench | BenchmarkList

Metadata

Score, Normalized Score

Rank	Subject	Score	Model Match	Provenance	Sampled
1	Phi 4 Reasoning Plus	0.74	—	Self-reported	2026-05-06
2	Phi 4 Reasoning	0.71	—	Self-reported	2026-05-06
3	Phi 4	0.56	Phi 4 microsoft-phi-4	Self-reported	2026-05-06