BRIDGE Medical Leaderboard

Clinical practice text understanding leaderboard for medical LLMs, covering summarization, dialogue, clinical evidence, and EHR-oriented tasks across multiple prompting settings.

321rows
average_performanceprimary metric
2026-05-27sampled

Metadata

Metrics

Average Performance, ADE-Identification, BrainMRI-AIS, Brateca-Hospitalization, Brateca-Mortality, Cantemist-Coding, CARES-Area, CARES-ICD10 Chapter, CARES ICD10 Block, CARES-ICD10 Subblock, C-EMRS, ClinicalNotes-UPMC, PPTS, CLIP, DialMed, EHRQA-Primary department, EHRQA-Sub department, GOUT-CC-Consensus, JP-STS, MEDIQA 2019-RQE, MedNLI, MedSTS, MTS, MEDIQA 2023-sum-A, RuMedDaNet, CBLUE-CDN, CHIP-CTC, IMCS-V2-DAC, RuMedNLI, CLISTER, IFMIR-Incident type, MIMIC-IV CDM, MIMIC-III Outcome.LoS, MIMIC-III Outcome.Mortality, MIMIC-IV DiReCT.Dis, MIMIC-IV DiReCT.PDD, ADE-Extraction, ADE-Drug dosage, BARR2, Cantemis-NER, Cantemis-Norm, CHIP-CDEE, CodiEsp-ICD-10-CM, CodiEsp-ICD-10-PCS, CLINpt-NER, DiSMed-NER, MIE, Ex4CDS, n2c2 2006-De-identification, Medication extraction, n2c2 2010-Concept, n2c2 2010-Assertion, n2c2 2010-Relation, n2c2 2014-De-identification, IMCS-V2-NER, meddocan, MTS-Temporal, n2c2 2018-ADE&medication, NorSynthClinical-NER, NorSynthClinical-RE, NUBES, CHIP-MDCFNPC, IMCS-V2-SR, n2c2 2014-Diabetes, n2c2 2014-CAD, n2c2 2014-Hyperlipidemia, n2c2 2014-Hypertension, n2c2 2014-Medication, CAS-label, RuDReC-NER, NorSynthClinical-PHI, RuCCoN, BRONCO150-NER&Status, CARDIO-DE, GraSSCo PHI, IFMIR-NER, IFMIR - NER&factuality, iCorpus, cMedQA, EHRQA-QA, MEDIQA 2023-chat-A, MEDIQA 2023-sum-B, MedDG, IMCS-V2-MRG, CAS-evidence, icliniq-10k, HealthCareMagic-100k, MIMIC-IV BHC

Latest Results

Rows parsed from the BRIDGE Medical public Hugging Face leaderboard JSON files. Each model and prompting setting is retained as a separate configuration row.

Rank Subject Average Performance Model Match Provenance Sampled
1 gemini-1.5-pro-002 (Few-Shot) 55.51 Imported 2026-05-27
2 gemma-4-31B-it (Few-Shot) 54.88 Gemma 4 31B
google-gemma-4-31b-it
Imported 2026-05-27
3 gemini-2.5-flash (Few-Shot) 53.36 Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-27
4 gemini-2.0-flash-001 (Few-Shot) 53.33 Gemini 2.0 Flash
google-gemini-2.0-flash
Imported 2026-05-27
5 gemma-4-26B-A4B-it (Few-Shot) 53.22 Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Imported 2026-05-27
6 gpt-4o-0806 (Few-Shot) 52.59 GPT-4o
openai-gpt-4o
Imported 2026-05-27
7 medgemma-27b-it (Few-Shot) 51.97 Imported 2026-05-27
8 medgemma-27b-text-it (Few-Shot) 51.73 Imported 2026-05-27
9 DeepSeek-R1 (Few-Shot) 51.38 R1
deepseek-r1
Imported 2026-05-27
10 Qwen2.5-72B-Instruct (Few-Shot) 50.99 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-27
11 Qwen3-Next-80B-A3B-Thinking (Few-Shot) 50.9 Qwen3 Next 80B A3B Thinking
qwen-qwen3-next-80b-a3b-thinking
Imported 2026-05-27
12 Mistral-Large-Instruct-2411 (Few-Shot) 50.68 Imported 2026-05-27
13 Athene-V2-Chat (Few-Shot) 50.68 Imported 2026-05-27
14 Qwen3-Next-80B-A3B-Instruct (Few-Shot) 50.56 Qwen3 Next 80B A3B Instruct
qwen-qwen3-next-80b-a3b-instruct
Imported 2026-05-27
15 Llama-3.1-70B-Instruct (Few-Shot) 50.52 Llama 3.1 70B Instruct
meta-llama-llama-3.1-70b-instruct
Imported 2026-05-27
16 Qwen3-30B-A3B-Thinking-2507 (Few-Shot) 50.26 Qwen3 30B A3B Thinking 2507
qwen-qwen3-30b-a3b-thinking-2507
Imported 2026-05-27
17 HuatuoGPT-o1-72B (Few-Shot) 50.21 Imported 2026-05-27
18 Qwen2.5-32B-Instruct (Few-Shot) 49.54 Imported 2026-05-27
19 Llama-3.3-70B-Instruct (Few-Shot) 49.49 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-27
20 gemma-3-27b-it (Few-Shot) 49.45 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-27
21 Magistral-Small-2506 (Few-Shot) 49.15 Imported 2026-05-27
22 Qwen3-32B-Non-Thinking (Few-Shot) 48.94 Imported 2026-05-27
23 gemma-4-E4B-it (Few-Shot) 48.79 Imported 2026-05-27
24 Qwen3-235B-A22B-Non-Thinking (Few-Shot) 48.71 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-27
25 Qwen3-30B-A3B-Instruct-2507 (Few-Shot) 48.62 Qwen3 30B A3B Instruct 2507
qwen-qwen3-30b-a3b-instruct-2507
Imported 2026-05-27
26 QwQ-32B-Preview (Few-Shot) 48.43 Imported 2026-05-27
27 Baichuan-M1-14B-Instruct (Few-Shot) 48.3 Imported 2026-05-27
28 Hulu-Med-32B (Few-Shot) 48.3 Imported 2026-05-27
29 QWQ-32B (Few-Shot) 48.19 Imported 2026-05-27
30 Biomni-R0-32B-Preview-Non-Thinking (Few-Shot) 48.13 Imported 2026-05-27
31 Qwen3-235B-A22B-Thinking (Few-Shot) 48.11 Qwen3 235B A22B Thinking 2507
qwen-qwen3-235b-a22b-thinking-2507
Imported 2026-05-27
32 gemma-3-12b-it (Few-Shot) 47.73 Gemma 3 12B
google-gemma-3-12b-it
Imported 2026-05-27
33 Qwen3-30B-A3B-Non-Thinking (Few-Shot) 47.4 Imported 2026-05-27
34 Qwen3-32B-Thinking (Few-Shot) 47.38 Imported 2026-05-27
35 Qwen3-30B-A3B-Thinking (Few-Shot) 47 Imported 2026-05-27
36 Qwen3-4B-Thinking-2507 (Few-Shot) 46.97 Imported 2026-05-27
37 Phi-4 (Few-Shot) 46.8 Phi 4
microsoft-phi-4
Imported 2026-05-27
38 Qwen3-14B-Thinking (Few-Shot) 46.79 Imported 2026-05-27
39 gemma-4-31B-it (Zero-Shot) 46.74 Gemma 4 31B
google-gemma-4-31b-it
Imported 2026-05-27
40 Qwen3-14B-Non-Thinking (Few-Shot) 46.64 Imported 2026-05-27
41 DeepSeek-R1-Distill-Llama-70B (Few-Shot) 46.17 R1 Distill Llama 70B
deepseek-deepseek-r1-distill-llama-70b
Imported 2026-05-27
42 gemma-4-31B-it (CoT) 45.88 Gemma 4 31B
google-gemma-4-31b-it
Imported 2026-05-27
43 Hulu-Med-14B (Few-Shot) 45.87 Imported 2026-05-27
44 gemma-2-27b-it (Few-Shot) 45.79 Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-27
45 Qwen3-8B-Non-Thinking (Few-Shot) 45.52 Imported 2026-05-27
46 Baichuan-M2-32B (Few-Shot) 45.21 Imported 2026-05-27
47 Qwen3-8B-Thinking (Few-Shot) 45.19 Imported 2026-05-27
48 gemma-4-26B-A4B-it (CoT) 45.17 Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Imported 2026-05-27
49 gemini-2.5-flash (Zero-Shot) 44.84 Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-27
50 Qwen3-4B-Instruct-2507 (Few-Shot) 44.83 Imported 2026-05-27
51 Llama3-OpenBioLLM-70B (Few-Shot) 44.51 Imported 2026-05-27
52 gemma-4-26B-A4B-it (Zero-Shot) 44.51 Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Imported 2026-05-27
53 Llama-3-70B-UltraMedical (Few-Shot) 44.43 Imported 2026-05-27
54 DeepSeek-R1-Distill-Qwen-32B (Few-Shot) 44.33 R1 Distill Qwen 32B
deepseek-deepseek-r1-distill-qwen-32b
Imported 2026-05-27
55 DeepSeek-R1 (Zero-Shot) 44.25 R1
deepseek-r1
Imported 2026-05-27
56 gpt-4o-0806 (Zero-Shot) 44.2 GPT-4o
openai-gpt-4o
Imported 2026-05-27
57 Biomni-R0-32B-Preview-Thinking (Few-Shot) 44.1 Imported 2026-05-27
58 Qwen3-Next-80B-A3B-Thinking (Zero-Shot) 43.86 Qwen3 Next 80B A3B Thinking
qwen-qwen3-next-80b-a3b-thinking
Imported 2026-05-27
59 gemini-1.5-pro-002 (Zero-Shot) 43.85 Imported 2026-05-27
60 gpt-35-turbo-0125 (Few-Shot) 43.61 GPT-3.5 Turbo
openai-gpt-3.5-turbo
Imported 2026-05-27
61 gemma-2-9b-it (Few-Shot) 43.54 Imported 2026-05-27
62 Llama-3.1-8B-Instruct (Few-Shot) 43.54 Llama 3.1 8B Instruct
meta-llama-llama-3.1-8b-instruct
Imported 2026-05-27
63 QwenLong-L1-32B (Few-Shot) 43.43 Imported 2026-05-27
64 gemini-2.5-flash (CoT) 43.29 Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-27
65 HuatuoGPT-o1-70B (Few-Shot) 43.11 Imported 2026-05-27
66 gemini-2.0-flash-001 (Zero-Shot) 43.03 Gemini 2.0 Flash
google-gemini-2.0-flash
Imported 2026-05-27
67 DeepSeek-R1-0528-Qwen3-8B (Few-Shot) 42.94 Imported 2026-05-27
68 Qwen3-Next-80B-A3B-Thinking (CoT) 42.9 Qwen3 Next 80B A3B Thinking
qwen-qwen3-next-80b-a3b-thinking
Imported 2026-05-27
69 Llama-3.1-Nemotron-70B-Instruct-HF (Few-Shot) 42.79 Imported 2026-05-27
70 Qwen3-4B-Non-Thinking (Few-Shot) 42.77 Imported 2026-05-27
71 K2-Think (Few-Shot) 42.68 Imported 2026-05-27
72 Qwen3-4B-Thinking (Few-Shot) 42.68 Imported 2026-05-27
73 gemma-4-E2B-it (Few-Shot) 42.31 Imported 2026-05-27
74 Mistral-Large-Instruct-2411 (Zero-Shot) 42.28 Imported 2026-05-27
75 DeepSeek-R1 (CoT) 42.1 R1
deepseek-r1
Imported 2026-05-27
76 gemini-2.0-flash-001 (CoT) 41.98 Gemini 2.0 Flash
google-gemini-2.0-flash
Imported 2026-05-27
77 Qwen3-30B-A3B-Thinking-2507 (Zero-Shot) 41.82 Qwen3 30B A3B Thinking 2507
qwen-qwen3-30b-a3b-thinking-2507
Imported 2026-05-27
78 Athene-V2-Chat (Zero-Shot) 41.69 Imported 2026-05-27
79 Qwen3-235B-A22B-Thinking (Zero-Shot) 41.63 Qwen3 235B A22B Thinking 2507
qwen-qwen3-235b-a22b-thinking-2507
Imported 2026-05-27
80 Qwen2.5-72B-Instruct (Zero-Shot) 41.62 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-27
81 Qwen2.5-7B-Instruct (Few-Shot) 41.6 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-27
82 Qwen3-30B-A3B-Thinking-2507 (CoT) 41.43 Qwen3 30B A3B Thinking 2507
qwen-qwen3-30b-a3b-thinking-2507
Imported 2026-05-27
83 DeepSeek-R1-Distill-Qwen-14B (Few-Shot) 41.4 Imported 2026-05-27
84 Qwen3-32B-Thinking (Zero-Shot) 41.04 Imported 2026-05-27
85 HuatuoGPT-o1-72B (Zero-Shot) 41.01 Imported 2026-05-27
86 Yi-1.5-34B-Chat-16K (Few-Shot) 40.97 Imported 2026-05-27
87 Qwen3-30B-A3B-Thinking (Zero-Shot) 40.93 Imported 2026-05-27
88 Mistral-Small-3.1-24B-Instruct-2503 (Few-Shot) 40.92 Mistral: Mistral Small 3.1 24B
mistralai-mistral-small-3.1-24b-instruct
Imported 2026-05-27
89 medgemma-27b-it (Zero-Shot) 40.8 Imported 2026-05-27
90 gpt-4o-0806 (CoT) 40.66 GPT-4o
openai-gpt-4o
Imported 2026-05-27
91 Llama-4-Scout-17B-16E-Instruct (Few-Shot) 40.64 Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-27
92 Hulu-Med-7B (Few-Shot) 40.6 Imported 2026-05-27
93 gemini-1.5-pro-002 (CoT) 40.53 Imported 2026-05-27
94 Qwen3-Next-80B-A3B-Instruct (CoT) 40.5 Qwen3 Next 80B A3B Instruct
qwen-qwen3-next-80b-a3b-instruct
Imported 2026-05-27
95 medgemma-27b-text-it (Zero-Shot) 40.47 Imported 2026-05-27
96 HuatuoGPT-o1-8B (Few-Shot) 40.43 Imported 2026-05-27
97 HuatuoGPT-o1-7B (Few-Shot) 40.36 Imported 2026-05-27
98 Qwen3-14B-Thinking (Zero-Shot) 40.17 Imported 2026-05-27
99 Qwen3-235B-A22B-Thinking (CoT) 40.14 Qwen3 235B A22B Thinking 2507
qwen-qwen3-235b-a22b-thinking-2507
Imported 2026-05-27
100 Qwen3-4B-Thinking-2507 (Zero-Shot) 40.04 Imported 2026-05-27
101 Qwen3-8B-Thinking (Zero-Shot) 39.98 Imported 2026-05-27
102 gemma-3-27b-it (Zero-Shot) 39.9 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-27
103 Qwen2.5-32B-Instruct (Zero-Shot) 39.89 Imported 2026-05-27
104 Llama-3.3-70B-Instruct (Zero-Shot) 39.86 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-27
105 Qwen3-Next-80B-A3B-Instruct (Zero-Shot) 39.81 Qwen3 Next 80B A3B Instruct
qwen-qwen3-next-80b-a3b-instruct
Imported 2026-05-27
106 DeepSeek-R1-Distill-Llama-70B (Zero-Shot) 39.79 R1 Distill Llama 70B
deepseek-deepseek-r1-distill-llama-70b
Imported 2026-05-27
107 DeepSeek-R1-Distill-Qwen-32B (Zero-Shot) 39.75 R1 Distill Qwen 32B
deepseek-deepseek-r1-distill-qwen-32b
Imported 2026-05-27
108 Mistral-Small-3.1-24B-Instruct-2503 (Zero-Shot) 39.73 Mistral: Mistral Small 3.1 24B
mistralai-mistral-small-3.1-24b-instruct
Imported 2026-05-27
109 Mistral-Small-24B-Instruct-2501 (Few-Shot) 39.66 Mistral: Mistral Small 3
mistralai-mistral-small-24b-instruct-2501
Imported 2026-05-27
110 Ministral-8B-Instruct-2410 (Few-Shot) 39.62 Imported 2026-05-27
111 QWQ-32B (Zero-Shot) 39.37 Imported 2026-05-27
112 Qwen3-30B-A3B-Thinking (CoT) 39.35 Imported 2026-05-27
113 Athene-V2-Chat (CoT) 39.34 Imported 2026-05-27
114 Qwen3-32B-Non-Thinking (Zero-Shot) 39.28 Imported 2026-05-27
115 QwenLong-L1-32B (Zero-Shot) 39.25 Imported 2026-05-27
116 Qwen3-235B-A22B-Non-Thinking (Zero-Shot) 39.21 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-27
117 gemma-4-E4B-it (Zero-Shot) 39.19 Imported 2026-05-27
118 Biomni-R0-32B-Preview-Non-Thinking (Zero-Shot) 39.18 Imported 2026-05-27
119 Llama-3.1-70B-Instruct (Zero-Shot) 39.09 Llama 3.1 70B Instruct
meta-llama-llama-3.1-70b-instruct
Imported 2026-05-27
120 gpt-oss-120b (Few-Shot) 39.04 gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-27
121 DeepSeek-R1-Distill-Llama-70B (CoT) 38.95 R1 Distill Llama 70B
deepseek-deepseek-r1-distill-llama-70b
Imported 2026-05-27
122 Mistral-Large-Instruct-2411 (CoT) 38.9 Imported 2026-05-27
123 Qwen2.5-72B-Instruct (CoT) 38.86 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-27
124 Qwen3-4B-Thinking-2507 (CoT) 38.84 Imported 2026-05-27
125 gemma-3-4b-it (Few-Shot) 38.82 Gemma 3 4B
google-gemma-3-4b-it
Imported 2026-05-27
126 DeepSeek-R1-Distill-Qwen-32B (CoT) 38.72 R1 Distill Qwen 32B
deepseek-deepseek-r1-distill-qwen-32b
Imported 2026-05-27
127 Qwen3-4B-Thinking (Zero-Shot) 38.5 Imported 2026-05-27
128 Baichuan-M2-32B (CoT) 38.3 Imported 2026-05-27
129 Hulu-Med-32B (Zero-Shot) 38.29 Imported 2026-05-27
130 gemma-2-27b-it (Zero-Shot) 38.22 Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-27
131 medgemma-27b-it (CoT) 38.2 Imported 2026-05-27
132 HuatuoGPT-o1-72B (CoT) 38.15 Imported 2026-05-27
133 gemma-4-E4B-it (CoT) 38.06 Imported 2026-05-27
134 Baichuan-M2-32B (Zero-Shot) 38.02 Imported 2026-05-27
135 Qwen3-235B-A22B-Non-Thinking (CoT) 38 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-27
136 medgemma-27b-text-it (CoT) 37.83 Imported 2026-05-27
137 K2-Think (Zero-Shot) 37.81 Imported 2026-05-27
138 Qwen2.5-32B-Instruct (CoT) 37.66 Imported 2026-05-27
139 Magistral-Small-2506 (Zero-Shot) 37.56 Imported 2026-05-27
140 gemma-3-27b-it (CoT) 37.55 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-27
141 Mistral-Small-24B-Instruct-2501 (Zero-Shot) 37.53 Mistral: Mistral Small 3
mistralai-mistral-small-24b-instruct-2501
Imported 2026-05-27
142 Qwen3-30B-A3B-Instruct-2507 (CoT) 37.5 Qwen3 30B A3B Instruct 2507
qwen-qwen3-30b-a3b-instruct-2507
Imported 2026-05-27
143 medgemma-4b-it (Few-Shot) 37.49 Imported 2026-05-27
144 Yi-1.5-9B-Chat-16K (Few-Shot) 37.37 Imported 2026-05-27
145 gemma-3-12b-it (Zero-Shot) 37.32 Gemma 3 12B
google-gemma-3-12b-it
Imported 2026-05-27
146 gpt-oss-120b (Zero-Shot) 37.24 gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-27
147 Qwen2.5-3B-Instruct (Few-Shot) 37.18 Imported 2026-05-27
148 Qwen3-14B-Thinking (CoT) 37.07 Imported 2026-05-27
149 Qwen3-8B-Thinking (CoT) 37.06 Imported 2026-05-27
150 QWQ-32B (CoT) 37.03 Imported 2026-05-27
151 Qwen3-4B-Thinking (CoT) 36.98 Imported 2026-05-27
152 DeepSeek-R1-0528-Qwen3-8B (Zero-Shot) 36.94 Imported 2026-05-27
153 Qwen3-30B-A3B-Instruct-2507 (Zero-Shot) 36.87 Qwen3 30B A3B Instruct 2507
qwen-qwen3-30b-a3b-instruct-2507
Imported 2026-05-27
154 Qwen3-30B-A3B-Non-Thinking (CoT) 36.86 Imported 2026-05-27
155 Llama-3.3-70B-Instruct (CoT) 36.83 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-27
156 Qwen3-14B-Non-Thinking (Zero-Shot) 36.8 Imported 2026-05-27
157 QwenLong-L1-32B (CoT) 36.71 Imported 2026-05-27
158 AntAngelMed (Zero-Shot) 36.65 Imported 2026-05-27
159 Phi-3.5-MoE-instruct (Few-Shot) 36.56 Imported 2026-05-27
160 DeepSeek-R1-0528-Qwen3-8B (CoT) 36.28 Imported 2026-05-27
161 Mistral-Small-3.1-24B-Instruct-2503 (CoT) 36.23 Mistral: Mistral Small 3.1 24B
mistralai-mistral-small-3.1-24b-instruct
Imported 2026-05-27
162 Qwen3-4B-Instruct-2507 (CoT) 36.23 Imported 2026-05-27
163 Qwen3-30B-A3B-Non-Thinking (Zero-Shot) 36.15 Imported 2026-05-27
164 Phi-4 (Zero-Shot) 36.13 Phi 4
microsoft-phi-4
Imported 2026-05-27
165 Baichuan-M1-14B-Instruct (Zero-Shot) 36.08 Imported 2026-05-27
166 Mistral-Small-Instruct-2409 (Few-Shot) 35.98 Imported 2026-05-27
167 Hulu-Med-14B (Zero-Shot) 35.45 Imported 2026-05-27
168 gemma-3-12b-it (CoT) 35.37 Gemma 3 12B
google-gemma-3-12b-it
Imported 2026-05-27
169 K2-Think (CoT) 35.35 Imported 2026-05-27
170 Biomni-R0-32B-Preview-Thinking (Zero-Shot) 35.34 Imported 2026-05-27
171 AntAngelMed (Few-Shot) 35.32 Imported 2026-05-27
172 gpt-35-turbo-0125 (Zero-Shot) 35.3 GPT-3.5 Turbo
openai-gpt-3.5-turbo
Imported 2026-05-27
173 Mistral-Small-Instruct-2409 (Zero-Shot) 35.19 Imported 2026-05-27
174 Llama-4-Scout-17B-16E-Instruct (Zero-Shot) 35.12 Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-27
175 Llama-3.1-70B-Instruct (CoT) 35.1 Llama 3.1 70B Instruct
meta-llama-llama-3.1-70b-instruct
Imported 2026-05-27
176 gemma-2-9b-it (Zero-Shot) 35.07 Imported 2026-05-27
177 Llama-3.2-3B-Instruct (Few-Shot) 34.81 Llama 3.2 3B Instruct
meta-llama-llama-3.2-3b-instruct
Imported 2026-05-27
178 DeepSeek-R1-Distill-Qwen-14B (CoT) 34.79 Imported 2026-05-27
179 Qwen3-1.7B-Non-Thinking (Few-Shot) 34.76 Imported 2026-05-27
180 Qwen3-14B-Non-Thinking (CoT) 34.59 Imported 2026-05-27
181 Biomni-R0-32B-Preview-Non-Thinking (CoT) 34.59 Imported 2026-05-27
182 Qwen3-32B-Thinking (CoT) 34.46 Imported 2026-05-27
183 Baichuan-M1-14B-Instruct (CoT) 34.36 Imported 2026-05-27
184 DeepSeek-R1-Distill-Qwen-14B (Zero-Shot) 34.28 Imported 2026-05-27
185 Hulu-Med-32B (CoT) 34.25 Imported 2026-05-27
186 gemma-2-27b-it (CoT) 34.22 Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-27
187 Magistral-Small-2506 (CoT) 34.16 Imported 2026-05-27
188 MMed-Llama-3-8B (Few-Shot) 34.14 Imported 2026-05-27
189 Qwen3-4B-Instruct-2507 (Zero-Shot) 34.06 Imported 2026-05-27
190 HuatuoGPT-o1-70B (Zero-Shot) 33.9 Imported 2026-05-27
191 Qwen3-8B-Non-Thinking (Zero-Shot) 33.86 Imported 2026-05-27
192 Llama-3-70B-UltraMedical (Zero-Shot) 33.4 Imported 2026-05-27
193 Qwen3-8B-Non-Thinking (CoT) 33.31 Imported 2026-05-27
194 Qwen3-4B-Non-Thinking (Zero-Shot) 33.29 Imported 2026-05-27
195 Qwen3-4B-Non-Thinking (CoT) 33.2 Imported 2026-05-27
196 Llama3-OpenBioLLM-8B (Few-Shot) 33.12 Imported 2026-05-27
197 Llama3-OpenBioLLM-70B (Zero-Shot) 33.01 Imported 2026-05-27
198 MeLLaMA-70B-chat (Few-Shot) 32.9 Imported 2026-05-27
199 gemma-4-E2B-it (CoT) 32.87 Imported 2026-05-27
200 Qwen3-1.7B-Thinking (Few-Shot) 32.87 Imported 2026-05-27
201 DeepSeek-R1-Distill-Llama-8B (Few-Shot) 32.86 Imported 2026-05-27
202 Qwen3-32B-Non-Thinking (CoT) 32.8 Imported 2026-05-27
203 Llama-3.1-Nemotron-70B-Instruct-HF (Zero-Shot) 32.75 Imported 2026-05-27
204 gemma-4-E2B-it (Zero-Shot) 32.74 Imported 2026-05-27
205 Phi-4 (CoT) 32.59 Phi 4
microsoft-phi-4
Imported 2026-05-27
206 Biomni-R0-32B-Preview-Thinking (CoT) 32.54 Imported 2026-05-27
207 HuatuoGPT-o1-70B (CoT) 32.35 Imported 2026-05-27
208 MeLLaMA-70B-chat (Zero-Shot) 32.26 Imported 2026-05-27
209 Yi-1.5-34B-Chat-16K (Zero-Shot) 32.12 Imported 2026-05-27
210 gpt-oss-120b (CoT) 32.11 gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-27
211 meditron-70b (Few-Shot) 32.09 Imported 2026-05-27
212 QwQ-32B-Preview (Zero-Shot) 31.74 Imported 2026-05-27
213 Qwen2.5-1.5B-Instruct (Few-Shot) 31.66 Imported 2026-05-27
214 gpt-35-turbo-0125 (CoT) 31.63 GPT-3.5 Turbo
openai-gpt-3.5-turbo
Imported 2026-05-27
215 Hulu-Med-14B (CoT) 31.61 Imported 2026-05-27
216 Mistral-Small-24B-Instruct-2501 (CoT) 31.59 Mistral: Mistral Small 3
mistralai-mistral-small-24b-instruct-2501
Imported 2026-05-27
217 Phi-3.5-mini-instruct (Few-Shot) 31.33 Imported 2026-05-27
218 Qwen2.5-7B-Instruct (Zero-Shot) 31.32 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-27
219 Mistral-Small-Instruct-2409 (CoT) 31.17 Imported 2026-05-27
220 MedReason-8B (Few-Shot) 31.12 Imported 2026-05-27
221 Llama-3.1-8B-UltraMedical (Few-Shot) 30.96 Imported 2026-05-27
222 AntAngelMed (CoT) 30.77 Imported 2026-05-27
223 Hulu-Med-7B (Zero-Shot) 30.71 Imported 2026-05-27
224 Ministral-8B-Instruct-2410 (Zero-Shot) 30.37 Imported 2026-05-27
225 Qwen2.5-7B-Instruct (CoT) 30.25 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-27
226 gemma-2-9b-it (CoT) 29.94 Imported 2026-05-27
227 Phi-4-mini-instruct (Few-Shot) 29.93 Imported 2026-05-27
228 HuatuoGPT-o1-7B (Zero-Shot) 29.59 Imported 2026-05-27
229 Yi-1.5-34B-Chat-16K (CoT) 29.57 Imported 2026-05-27
230 Phi-3.5-MoE-instruct (Zero-Shot) 29.54 Imported 2026-05-27
231 Llama-3-70B-UltraMedical (CoT) 29.44 Imported 2026-05-27
232 medgemma-4b-it (Zero-Shot) 29.42 Imported 2026-05-27
233 Llama-3.1-8B-Instruct (CoT) 29.4 Llama 3.1 8B Instruct
meta-llama-llama-3.1-8b-instruct
Imported 2026-05-27
234 Llama-4-Scout-17B-16E-Instruct (CoT) 29.38 Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-27
235 MeLLaMA-70B-chat (CoT) 29.25 Imported 2026-05-27
236 gpt-oss-20b (Zero-Shot) 29.05 gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-27
237 Llama-3.1-8B-Instruct (Zero-Shot) 28.98 Llama 3.1 8B Instruct
meta-llama-llama-3.1-8b-instruct
Imported 2026-05-27
238 Yi-1.5-9B-Chat-16K (Zero-Shot) 28.81 Imported 2026-05-27
239 Llama3-OpenBioLLM-70B (CoT) 28.78 Imported 2026-05-27
240 gemma-3-4b-it (Zero-Shot) 28.56 Gemma 3 4B
google-gemma-3-4b-it
Imported 2026-05-27
241 medgemma-4b-it (CoT) 28.5 Imported 2026-05-27
242 DeepSeek-R1-Distill-Llama-8B (Zero-Shot) 28.48 Imported 2026-05-27
243 gemma-3-4b-it (CoT) 28.19 Gemma 3 4B
google-gemma-3-4b-it
Imported 2026-05-27
244 Qwen3-1.7B-Thinking (CoT) 27.71 Imported 2026-05-27
245 Hulu-Med-7B (CoT) 27.6 Imported 2026-05-27
246 DeepSeek-R1-Distill-Llama-8B (CoT) 27.34 Imported 2026-05-27
247 Llama-2-70b-chat (Few-Shot) 27.14 Imported 2026-05-27
248 MeLLaMA-13B-chat (Few-Shot) 27.01 Imported 2026-05-27
249 HuatuoGPT-o1-7B (CoT) 26.71 Imported 2026-05-27
250 Qwen2.5-3B-Instruct (Zero-Shot) 26.59 Imported 2026-05-27
251 Qwen3-0.6B-Non-Thinking (Few-Shot) 26.31 Imported 2026-05-27
252 Qwen3-1.7B-Thinking (Zero-Shot) 26.28 Imported 2026-05-27
253 gemma-3-1b-it (Few-Shot) 25.92 Imported 2026-05-27
254 Ministral-8B-Instruct-2410 (CoT) 25.91 Imported 2026-05-27
255 HuatuoGPT-o1-8B (Zero-Shot) 25.86 Imported 2026-05-27
256 Qwen2.5-3B-Instruct (CoT) 25.44 Imported 2026-05-27
257 Phi-3.5-mini-instruct (Zero-Shot) 25.41 Imported 2026-05-27
258 Yi-1.5-9B-Chat-16K (CoT) 25.4 Imported 2026-05-27
259 Phi-3.5-MoE-instruct (CoT) 25.27 Imported 2026-05-27
260 DeepSeek-R1-Distill-Qwen-7B (Zero-Shot) 25.27 Imported 2026-05-27
261 Llama-2-70b-chat (Zero-Shot) 25.15 Imported 2026-05-27
262 gpt-oss-20b (Few-Shot) 25.14 gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-27
263 Qwen3-0.6B-Thinking (Few-Shot) 24.87 Imported 2026-05-27
264 gpt-oss-20b (CoT) 24.86 gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-27
265 BioMistral-7B (Few-Shot) 24.66 Imported 2026-05-27
266 Phi-4-mini-instruct (Zero-Shot) 24.54 Imported 2026-05-27
267 Llama-3.2-1B-Instruct (Few-Shot) 24.43 Llama 3.2 1B Instruct
meta-llama-llama-3.2-1b-instruct
Imported 2026-05-27
268 Llama-3.1-Nemotron-70B-Instruct-HF (CoT) 24.09 Imported 2026-05-27
269 Phi-3.5-mini-instruct (CoT) 23.91 Imported 2026-05-27
270 DeepSeek-R1-Distill-Qwen-7B (CoT) 23.87 Imported 2026-05-27
271 Llama-2-13b-chat (Few-Shot) 23.55 Imported 2026-05-27
272 QwQ-32B-Preview (CoT) 23.31 Imported 2026-05-27
273 DeepSeek-R1-Distill-Qwen-7B (Few-Shot) 23.07 Imported 2026-05-27
274 Llama-3.2-3B-Instruct (Zero-Shot) 22.9 Llama 3.2 3B Instruct
meta-llama-llama-3.2-3b-instruct
Imported 2026-05-27
275 Qwen3-1.7B-Non-Thinking (CoT) 22.76 Imported 2026-05-27
276 Phi-4-mini-instruct (CoT) 22.5 Imported 2026-05-27
277 HuatuoGPT-o1-8B (CoT) 22.48 Imported 2026-05-27
278 Llama-2-7b-chat (Few-Shot) 22.33 Imported 2026-05-27
279 Qwen2.5-1.5B-Instruct (Zero-Shot) 22.16 Imported 2026-05-27
280 Qwen3-1.7B-Non-Thinking (Zero-Shot) 21.95 Imported 2026-05-27
281 Llama-3.2-3B-Instruct (CoT) 21.6 Llama 3.2 3B Instruct
meta-llama-llama-3.2-3b-instruct
Imported 2026-05-27
282 Phi-4-mini-reasoning (Zero-Shot) 21.31 Imported 2026-05-27
283 Llama-2-13b-chat (Zero-Shot) 20.91 Imported 2026-05-27
284 MeLLaMA-13B-chat (Zero-Shot) 20.76 Imported 2026-05-27
285 BioMistral-7B (Zero-Shot) 20.43 Imported 2026-05-27
286 Qwen3-0.6B-Thinking (Zero-Shot) 20.38 Imported 2026-05-27
287 MMed-Llama-3-8B (Zero-Shot) 20.37 Imported 2026-05-27
288 MeLLaMA-13B-chat (CoT) 20.26 Imported 2026-05-27
289 Llama-3.1-8B-UltraMedical (Zero-Shot) 20.16 Imported 2026-05-27
290 Phi-4-mini-reasoning (CoT) 19.79 Imported 2026-05-27
291 OpenThinker3-7B (Zero-Shot) 19.79 Imported 2026-05-27
292 Qwen2.5-1.5B-Instruct (CoT) 19.47 Imported 2026-05-27
293 Llama-2-70b-chat (CoT) 19.02 Imported 2026-05-27
294 Qwen3-0.6B-Thinking (CoT) 18.95 Imported 2026-05-27
295 OpenThinker3-7B (CoT) 18.37 Imported 2026-05-27
296 Llama-3.1-8B-UltraMedical (CoT) 18.34 Imported 2026-05-27
297 MedReason-8B (CoT) 18.29 Imported 2026-05-27
298 MedReason-8B (Zero-Shot) 18.17 Imported 2026-05-27
299 meditron-7b (Few-Shot) 17.04 Imported 2026-05-27
300 Qwen3-0.6B-Non-Thinking (CoT) 16.59 Imported 2026-05-27
301 Llama-2-7b-chat (Zero-Shot) 16.47 Imported 2026-05-27
302 Llama-2-13b-chat (CoT) 16.24 Imported 2026-05-27
303 MMed-Llama-3-8B (CoT) 16.17 Imported 2026-05-27
304 gemma-3-1b-it (Zero-Shot) 15.73 Imported 2026-05-27
305 meditron-70b (Zero-Shot) 15.68 Imported 2026-05-27
306 Phi-4-mini-reasoning (Few-Shot) 15.51 Imported 2026-05-27
307 Qwen3-0.6B-Non-Thinking (Zero-Shot) 15.21 Imported 2026-05-27
308 DeepSeek-R1-Distill-Qwen-1.5B (Few-Shot) 14.94 Imported 2026-05-27
309 DeepSeek-R1-Distill-Qwen-1.5B (Zero-Shot) 14.26 Imported 2026-05-27
310 Llama3-OpenBioLLM-8B (Zero-Shot) 14.2 Imported 2026-05-27
311 Llama-2-7b-chat (CoT) 13.66 Imported 2026-05-27
312 gemma-3-1b-it (CoT) 13.53 Imported 2026-05-27
313 DeepSeek-R1-Distill-Qwen-1.5B (CoT) 13.42 Imported 2026-05-27
314 Llama3-OpenBioLLM-8B (CoT) 13.29 Imported 2026-05-27
315 meditron-70b (CoT) 13.17 Imported 2026-05-27
316 OpenThinker3-7B (Few-Shot) 13.01 Imported 2026-05-27
317 Llama-3.2-1B-Instruct (Zero-Shot) 12.72 Llama 3.2 1B Instruct
meta-llama-llama-3.2-1b-instruct
Imported 2026-05-27
318 Llama-3.2-1B-Instruct (CoT) 11.86 Llama 3.2 1B Instruct
meta-llama-llama-3.2-1b-instruct
Imported 2026-05-27
319 BioMistral-7B (CoT) 10.84 Imported 2026-05-27
320 meditron-7b (CoT) 9.52 Imported 2026-05-27
321 meditron-7b (Zero-Shot) 9.52 Imported 2026-05-27