BRIDGE Medical Leaderboard
Clinical practice text understanding leaderboard for medical LLMs, covering summarization, dialogue, clinical evidence, and EHR-oriented tasks across multiple prompting settings.
Metadata
Metrics
Average Performance, ADE-Identification, BrainMRI-AIS, Brateca-Hospitalization, Brateca-Mortality, Cantemist-Coding, CARES-Area, CARES-ICD10 Chapter, CARES ICD10 Block, CARES-ICD10 Subblock, C-EMRS, ClinicalNotes-UPMC, PPTS, CLIP, DialMed, EHRQA-Primary department, EHRQA-Sub department, GOUT-CC-Consensus, JP-STS, MEDIQA 2019-RQE, MedNLI, MedSTS, MTS, MEDIQA 2023-sum-A, RuMedDaNet, CBLUE-CDN, CHIP-CTC, IMCS-V2-DAC, RuMedNLI, CLISTER, IFMIR-Incident type, MIMIC-IV CDM, MIMIC-III Outcome.LoS, MIMIC-III Outcome.Mortality, MIMIC-IV DiReCT.Dis, MIMIC-IV DiReCT.PDD, ADE-Extraction, ADE-Drug dosage, BARR2, Cantemis-NER, Cantemis-Norm, CHIP-CDEE, CodiEsp-ICD-10-CM, CodiEsp-ICD-10-PCS, CLINpt-NER, DiSMed-NER, MIE, Ex4CDS, n2c2 2006-De-identification, Medication extraction, n2c2 2010-Concept, n2c2 2010-Assertion, n2c2 2010-Relation, n2c2 2014-De-identification, IMCS-V2-NER, meddocan, MTS-Temporal, n2c2 2018-ADE&medication, NorSynthClinical-NER, NorSynthClinical-RE, NUBES, CHIP-MDCFNPC, IMCS-V2-SR, n2c2 2014-Diabetes, n2c2 2014-CAD, n2c2 2014-Hyperlipidemia, n2c2 2014-Hypertension, n2c2 2014-Medication, CAS-label, RuDReC-NER, NorSynthClinical-PHI, RuCCoN, BRONCO150-NER&Status, CARDIO-DE, GraSSCo PHI, IFMIR-NER, IFMIR - NER&factuality, iCorpus, cMedQA, EHRQA-QA, MEDIQA 2023-chat-A, MEDIQA 2023-sum-B, MedDG, IMCS-V2-MRG, CAS-evidence, icliniq-10k, HealthCareMagic-100k, MIMIC-IV BHC
| Rank | Subject | Average Performance | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | gemini-1.5-pro-002 (Few-Shot) | 55.51 | — | Imported | 2026-05-27 |
| 2 | gemma-4-31B-it (Few-Shot) | 54.88 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-27 |
| 3 | gemini-2.5-flash (Few-Shot) | 53.36 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-27 |
| 4 | gemini-2.0-flash-001 (Few-Shot) | 53.33 | Gemini 2.0 Flash google-gemini-2.0-flash | Imported | 2026-05-27 |
| 5 | gemma-4-26B-A4B-it (Few-Shot) | 53.22 | Gemma 4 26B A4B google-gemma-4-26b-a4b-it | Imported | 2026-05-27 |
| 6 | gpt-4o-0806 (Few-Shot) | 52.59 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 7 | medgemma-27b-it (Few-Shot) | 51.97 | — | Imported | 2026-05-27 |
| 8 | medgemma-27b-text-it (Few-Shot) | 51.73 | — | Imported | 2026-05-27 |
| 9 | DeepSeek-R1 (Few-Shot) | 51.38 | R1 deepseek-r1 | Imported | 2026-05-27 |
| 10 | Qwen2.5-72B-Instruct (Few-Shot) | 50.99 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-27 |
| 11 | Qwen3-Next-80B-A3B-Thinking (Few-Shot) | 50.9 | Qwen3 Next 80B A3B Thinking qwen-qwen3-next-80b-a3b-thinking | Imported | 2026-05-27 |
| 12 | Mistral-Large-Instruct-2411 (Few-Shot) | 50.68 | — | Imported | 2026-05-27 |
| 13 | Athene-V2-Chat (Few-Shot) | 50.68 | — | Imported | 2026-05-27 |
| 14 | Qwen3-Next-80B-A3B-Instruct (Few-Shot) | 50.56 | Qwen3 Next 80B A3B Instruct qwen-qwen3-next-80b-a3b-instruct | Imported | 2026-05-27 |
| 15 | Llama-3.1-70B-Instruct (Few-Shot) | 50.52 | Llama 3.1 70B Instruct meta-llama-llama-3.1-70b-instruct | Imported | 2026-05-27 |
| 16 | Qwen3-30B-A3B-Thinking-2507 (Few-Shot) | 50.26 | Qwen3 30B A3B Thinking 2507 qwen-qwen3-30b-a3b-thinking-2507 | Imported | 2026-05-27 |
| 17 | HuatuoGPT-o1-72B (Few-Shot) | 50.21 | — | Imported | 2026-05-27 |
| 18 | Qwen2.5-32B-Instruct (Few-Shot) | 49.54 | — | Imported | 2026-05-27 |
| 19 | Llama-3.3-70B-Instruct (Few-Shot) | 49.49 | Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | Imported | 2026-05-27 |
| 20 | gemma-3-27b-it (Few-Shot) | 49.45 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-27 |
| 21 | Magistral-Small-2506 (Few-Shot) | 49.15 | — | Imported | 2026-05-27 |
| 22 | Qwen3-32B-Non-Thinking (Few-Shot) | 48.94 | — | Imported | 2026-05-27 |
| 23 | gemma-4-E4B-it (Few-Shot) | 48.79 | — | Imported | 2026-05-27 |
| 24 | Qwen3-235B-A22B-Non-Thinking (Few-Shot) | 48.71 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-27 |
| 25 | Qwen3-30B-A3B-Instruct-2507 (Few-Shot) | 48.62 | Qwen3 30B A3B Instruct 2507 qwen-qwen3-30b-a3b-instruct-2507 | Imported | 2026-05-27 |
| 26 | QwQ-32B-Preview (Few-Shot) | 48.43 | — | Imported | 2026-05-27 |
| 27 | Baichuan-M1-14B-Instruct (Few-Shot) | 48.3 | — | Imported | 2026-05-27 |
| 28 | Hulu-Med-32B (Few-Shot) | 48.3 | — | Imported | 2026-05-27 |
| 29 | QWQ-32B (Few-Shot) | 48.19 | — | Imported | 2026-05-27 |
| 30 | Biomni-R0-32B-Preview-Non-Thinking (Few-Shot) | 48.13 | — | Imported | 2026-05-27 |
| 31 | Qwen3-235B-A22B-Thinking (Few-Shot) | 48.11 | Qwen3 235B A22B Thinking 2507 qwen-qwen3-235b-a22b-thinking-2507 | Imported | 2026-05-27 |
| 32 | gemma-3-12b-it (Few-Shot) | 47.73 | Gemma 3 12B google-gemma-3-12b-it | Imported | 2026-05-27 |
| 33 | Qwen3-30B-A3B-Non-Thinking (Few-Shot) | 47.4 | — | Imported | 2026-05-27 |
| 34 | Qwen3-32B-Thinking (Few-Shot) | 47.38 | — | Imported | 2026-05-27 |
| 35 | Qwen3-30B-A3B-Thinking (Few-Shot) | 47 | — | Imported | 2026-05-27 |
| 36 | Qwen3-4B-Thinking-2507 (Few-Shot) | 46.97 | — | Imported | 2026-05-27 |
| 37 | Phi-4 (Few-Shot) | 46.8 | Phi 4 microsoft-phi-4 | Imported | 2026-05-27 |
| 38 | Qwen3-14B-Thinking (Few-Shot) | 46.79 | — | Imported | 2026-05-27 |
| 39 | gemma-4-31B-it (Zero-Shot) | 46.74 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-27 |
| 40 | Qwen3-14B-Non-Thinking (Few-Shot) | 46.64 | — | Imported | 2026-05-27 |
| 41 | DeepSeek-R1-Distill-Llama-70B (Few-Shot) | 46.17 | R1 Distill Llama 70B deepseek-deepseek-r1-distill-llama-70b | Imported | 2026-05-27 |
| 42 | gemma-4-31B-it (CoT) | 45.88 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-27 |
| 43 | Hulu-Med-14B (Few-Shot) | 45.87 | — | Imported | 2026-05-27 |
| 44 | gemma-2-27b-it (Few-Shot) | 45.79 | Gemma 2 27B google-gemma-2-27b-it | Imported | 2026-05-27 |
| 45 | Qwen3-8B-Non-Thinking (Few-Shot) | 45.52 | — | Imported | 2026-05-27 |
| 46 | Baichuan-M2-32B (Few-Shot) | 45.21 | — | Imported | 2026-05-27 |
| 47 | Qwen3-8B-Thinking (Few-Shot) | 45.19 | — | Imported | 2026-05-27 |
| 48 | gemma-4-26B-A4B-it (CoT) | 45.17 | Gemma 4 26B A4B google-gemma-4-26b-a4b-it | Imported | 2026-05-27 |
| 49 | gemini-2.5-flash (Zero-Shot) | 44.84 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-27 |
| 50 | Qwen3-4B-Instruct-2507 (Few-Shot) | 44.83 | — | Imported | 2026-05-27 |
| 51 | Llama3-OpenBioLLM-70B (Few-Shot) | 44.51 | — | Imported | 2026-05-27 |
| 52 | gemma-4-26B-A4B-it (Zero-Shot) | 44.51 | Gemma 4 26B A4B google-gemma-4-26b-a4b-it | Imported | 2026-05-27 |
| 53 | Llama-3-70B-UltraMedical (Few-Shot) | 44.43 | — | Imported | 2026-05-27 |
| 54 | DeepSeek-R1-Distill-Qwen-32B (Few-Shot) | 44.33 | R1 Distill Qwen 32B deepseek-deepseek-r1-distill-qwen-32b | Imported | 2026-05-27 |
| 55 | DeepSeek-R1 (Zero-Shot) | 44.25 | R1 deepseek-r1 | Imported | 2026-05-27 |
| 56 | gpt-4o-0806 (Zero-Shot) | 44.2 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 57 | Biomni-R0-32B-Preview-Thinking (Few-Shot) | 44.1 | — | Imported | 2026-05-27 |
| 58 | Qwen3-Next-80B-A3B-Thinking (Zero-Shot) | 43.86 | Qwen3 Next 80B A3B Thinking qwen-qwen3-next-80b-a3b-thinking | Imported | 2026-05-27 |
| 59 | gemini-1.5-pro-002 (Zero-Shot) | 43.85 | — | Imported | 2026-05-27 |
| 60 | gpt-35-turbo-0125 (Few-Shot) | 43.61 | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-27 |
| 61 | gemma-2-9b-it (Few-Shot) | 43.54 | — | Imported | 2026-05-27 |
| 62 | Llama-3.1-8B-Instruct (Few-Shot) | 43.54 | Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | Imported | 2026-05-27 |
| 63 | QwenLong-L1-32B (Few-Shot) | 43.43 | — | Imported | 2026-05-27 |
| 64 | gemini-2.5-flash (CoT) | 43.29 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-27 |
| 65 | HuatuoGPT-o1-70B (Few-Shot) | 43.11 | — | Imported | 2026-05-27 |
| 66 | gemini-2.0-flash-001 (Zero-Shot) | 43.03 | Gemini 2.0 Flash google-gemini-2.0-flash | Imported | 2026-05-27 |
| 67 | DeepSeek-R1-0528-Qwen3-8B (Few-Shot) | 42.94 | — | Imported | 2026-05-27 |
| 68 | Qwen3-Next-80B-A3B-Thinking (CoT) | 42.9 | Qwen3 Next 80B A3B Thinking qwen-qwen3-next-80b-a3b-thinking | Imported | 2026-05-27 |
| 69 | Llama-3.1-Nemotron-70B-Instruct-HF (Few-Shot) | 42.79 | — | Imported | 2026-05-27 |
| 70 | Qwen3-4B-Non-Thinking (Few-Shot) | 42.77 | — | Imported | 2026-05-27 |
| 71 | K2-Think (Few-Shot) | 42.68 | — | Imported | 2026-05-27 |
| 72 | Qwen3-4B-Thinking (Few-Shot) | 42.68 | — | Imported | 2026-05-27 |
| 73 | gemma-4-E2B-it (Few-Shot) | 42.31 | — | Imported | 2026-05-27 |
| 74 | Mistral-Large-Instruct-2411 (Zero-Shot) | 42.28 | — | Imported | 2026-05-27 |
| 75 | DeepSeek-R1 (CoT) | 42.1 | R1 deepseek-r1 | Imported | 2026-05-27 |
| 76 | gemini-2.0-flash-001 (CoT) | 41.98 | Gemini 2.0 Flash google-gemini-2.0-flash | Imported | 2026-05-27 |
| 77 | Qwen3-30B-A3B-Thinking-2507 (Zero-Shot) | 41.82 | Qwen3 30B A3B Thinking 2507 qwen-qwen3-30b-a3b-thinking-2507 | Imported | 2026-05-27 |
| 78 | Athene-V2-Chat (Zero-Shot) | 41.69 | — | Imported | 2026-05-27 |
| 79 | Qwen3-235B-A22B-Thinking (Zero-Shot) | 41.63 | Qwen3 235B A22B Thinking 2507 qwen-qwen3-235b-a22b-thinking-2507 | Imported | 2026-05-27 |
| 80 | Qwen2.5-72B-Instruct (Zero-Shot) | 41.62 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-27 |
| 81 | Qwen2.5-7B-Instruct (Few-Shot) | 41.6 | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-27 |
| 82 | Qwen3-30B-A3B-Thinking-2507 (CoT) | 41.43 | Qwen3 30B A3B Thinking 2507 qwen-qwen3-30b-a3b-thinking-2507 | Imported | 2026-05-27 |
| 83 | DeepSeek-R1-Distill-Qwen-14B (Few-Shot) | 41.4 | — | Imported | 2026-05-27 |
| 84 | Qwen3-32B-Thinking (Zero-Shot) | 41.04 | — | Imported | 2026-05-27 |
| 85 | HuatuoGPT-o1-72B (Zero-Shot) | 41.01 | — | Imported | 2026-05-27 |
| 86 | Yi-1.5-34B-Chat-16K (Few-Shot) | 40.97 | — | Imported | 2026-05-27 |
| 87 | Qwen3-30B-A3B-Thinking (Zero-Shot) | 40.93 | — | Imported | 2026-05-27 |
| 88 | Mistral-Small-3.1-24B-Instruct-2503 (Few-Shot) | 40.92 | Mistral: Mistral Small 3.1 24B mistralai-mistral-small-3.1-24b-instruct | Imported | 2026-05-27 |
| 89 | medgemma-27b-it (Zero-Shot) | 40.8 | — | Imported | 2026-05-27 |
| 90 | gpt-4o-0806 (CoT) | 40.66 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 91 | Llama-4-Scout-17B-16E-Instruct (Few-Shot) | 40.64 | Llama 4 Scout meta-llama-llama-4-scout | Imported | 2026-05-27 |
| 92 | Hulu-Med-7B (Few-Shot) | 40.6 | — | Imported | 2026-05-27 |
| 93 | gemini-1.5-pro-002 (CoT) | 40.53 | — | Imported | 2026-05-27 |
| 94 | Qwen3-Next-80B-A3B-Instruct (CoT) | 40.5 | Qwen3 Next 80B A3B Instruct qwen-qwen3-next-80b-a3b-instruct | Imported | 2026-05-27 |
| 95 | medgemma-27b-text-it (Zero-Shot) | 40.47 | — | Imported | 2026-05-27 |
| 96 | HuatuoGPT-o1-8B (Few-Shot) | 40.43 | — | Imported | 2026-05-27 |
| 97 | HuatuoGPT-o1-7B (Few-Shot) | 40.36 | — | Imported | 2026-05-27 |
| 98 | Qwen3-14B-Thinking (Zero-Shot) | 40.17 | — | Imported | 2026-05-27 |
| 99 | Qwen3-235B-A22B-Thinking (CoT) | 40.14 | Qwen3 235B A22B Thinking 2507 qwen-qwen3-235b-a22b-thinking-2507 | Imported | 2026-05-27 |
| 100 | Qwen3-4B-Thinking-2507 (Zero-Shot) | 40.04 | — | Imported | 2026-05-27 |
| 101 | Qwen3-8B-Thinking (Zero-Shot) | 39.98 | — | Imported | 2026-05-27 |
| 102 | gemma-3-27b-it (Zero-Shot) | 39.9 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-27 |
| 103 | Qwen2.5-32B-Instruct (Zero-Shot) | 39.89 | — | Imported | 2026-05-27 |
| 104 | Llama-3.3-70B-Instruct (Zero-Shot) | 39.86 | Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | Imported | 2026-05-27 |
| 105 | Qwen3-Next-80B-A3B-Instruct (Zero-Shot) | 39.81 | Qwen3 Next 80B A3B Instruct qwen-qwen3-next-80b-a3b-instruct | Imported | 2026-05-27 |
| 106 | DeepSeek-R1-Distill-Llama-70B (Zero-Shot) | 39.79 | R1 Distill Llama 70B deepseek-deepseek-r1-distill-llama-70b | Imported | 2026-05-27 |
| 107 | DeepSeek-R1-Distill-Qwen-32B (Zero-Shot) | 39.75 | R1 Distill Qwen 32B deepseek-deepseek-r1-distill-qwen-32b | Imported | 2026-05-27 |
| 108 | Mistral-Small-3.1-24B-Instruct-2503 (Zero-Shot) | 39.73 | Mistral: Mistral Small 3.1 24B mistralai-mistral-small-3.1-24b-instruct | Imported | 2026-05-27 |
| 109 | Mistral-Small-24B-Instruct-2501 (Few-Shot) | 39.66 | Mistral: Mistral Small 3 mistralai-mistral-small-24b-instruct-2501 | Imported | 2026-05-27 |
| 110 | Ministral-8B-Instruct-2410 (Few-Shot) | 39.62 | — | Imported | 2026-05-27 |
| 111 | QWQ-32B (Zero-Shot) | 39.37 | — | Imported | 2026-05-27 |
| 112 | Qwen3-30B-A3B-Thinking (CoT) | 39.35 | — | Imported | 2026-05-27 |
| 113 | Athene-V2-Chat (CoT) | 39.34 | — | Imported | 2026-05-27 |
| 114 | Qwen3-32B-Non-Thinking (Zero-Shot) | 39.28 | — | Imported | 2026-05-27 |
| 115 | QwenLong-L1-32B (Zero-Shot) | 39.25 | — | Imported | 2026-05-27 |
| 116 | Qwen3-235B-A22B-Non-Thinking (Zero-Shot) | 39.21 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-27 |
| 117 | gemma-4-E4B-it (Zero-Shot) | 39.19 | — | Imported | 2026-05-27 |
| 118 | Biomni-R0-32B-Preview-Non-Thinking (Zero-Shot) | 39.18 | — | Imported | 2026-05-27 |
| 119 | Llama-3.1-70B-Instruct (Zero-Shot) | 39.09 | Llama 3.1 70B Instruct meta-llama-llama-3.1-70b-instruct | Imported | 2026-05-27 |
| 120 | gpt-oss-120b (Few-Shot) | 39.04 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-27 |
| 121 | DeepSeek-R1-Distill-Llama-70B (CoT) | 38.95 | R1 Distill Llama 70B deepseek-deepseek-r1-distill-llama-70b | Imported | 2026-05-27 |
| 122 | Mistral-Large-Instruct-2411 (CoT) | 38.9 | — | Imported | 2026-05-27 |
| 123 | Qwen2.5-72B-Instruct (CoT) | 38.86 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-27 |
| 124 | Qwen3-4B-Thinking-2507 (CoT) | 38.84 | — | Imported | 2026-05-27 |
| 125 | gemma-3-4b-it (Few-Shot) | 38.82 | Gemma 3 4B google-gemma-3-4b-it | Imported | 2026-05-27 |
| 126 | DeepSeek-R1-Distill-Qwen-32B (CoT) | 38.72 | R1 Distill Qwen 32B deepseek-deepseek-r1-distill-qwen-32b | Imported | 2026-05-27 |
| 127 | Qwen3-4B-Thinking (Zero-Shot) | 38.5 | — | Imported | 2026-05-27 |
| 128 | Baichuan-M2-32B (CoT) | 38.3 | — | Imported | 2026-05-27 |
| 129 | Hulu-Med-32B (Zero-Shot) | 38.29 | — | Imported | 2026-05-27 |
| 130 | gemma-2-27b-it (Zero-Shot) | 38.22 | Gemma 2 27B google-gemma-2-27b-it | Imported | 2026-05-27 |
| 131 | medgemma-27b-it (CoT) | 38.2 | — | Imported | 2026-05-27 |
| 132 | HuatuoGPT-o1-72B (CoT) | 38.15 | — | Imported | 2026-05-27 |
| 133 | gemma-4-E4B-it (CoT) | 38.06 | — | Imported | 2026-05-27 |
| 134 | Baichuan-M2-32B (Zero-Shot) | 38.02 | — | Imported | 2026-05-27 |
| 135 | Qwen3-235B-A22B-Non-Thinking (CoT) | 38 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-27 |
| 136 | medgemma-27b-text-it (CoT) | 37.83 | — | Imported | 2026-05-27 |
| 137 | K2-Think (Zero-Shot) | 37.81 | — | Imported | 2026-05-27 |
| 138 | Qwen2.5-32B-Instruct (CoT) | 37.66 | — | Imported | 2026-05-27 |
| 139 | Magistral-Small-2506 (Zero-Shot) | 37.56 | — | Imported | 2026-05-27 |
| 140 | gemma-3-27b-it (CoT) | 37.55 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-27 |
| 141 | Mistral-Small-24B-Instruct-2501 (Zero-Shot) | 37.53 | Mistral: Mistral Small 3 mistralai-mistral-small-24b-instruct-2501 | Imported | 2026-05-27 |
| 142 | Qwen3-30B-A3B-Instruct-2507 (CoT) | 37.5 | Qwen3 30B A3B Instruct 2507 qwen-qwen3-30b-a3b-instruct-2507 | Imported | 2026-05-27 |
| 143 | medgemma-4b-it (Few-Shot) | 37.49 | — | Imported | 2026-05-27 |
| 144 | Yi-1.5-9B-Chat-16K (Few-Shot) | 37.37 | — | Imported | 2026-05-27 |
| 145 | gemma-3-12b-it (Zero-Shot) | 37.32 | Gemma 3 12B google-gemma-3-12b-it | Imported | 2026-05-27 |
| 146 | gpt-oss-120b (Zero-Shot) | 37.24 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-27 |
| 147 | Qwen2.5-3B-Instruct (Few-Shot) | 37.18 | — | Imported | 2026-05-27 |
| 148 | Qwen3-14B-Thinking (CoT) | 37.07 | — | Imported | 2026-05-27 |
| 149 | Qwen3-8B-Thinking (CoT) | 37.06 | — | Imported | 2026-05-27 |
| 150 | QWQ-32B (CoT) | 37.03 | — | Imported | 2026-05-27 |
| 151 | Qwen3-4B-Thinking (CoT) | 36.98 | — | Imported | 2026-05-27 |
| 152 | DeepSeek-R1-0528-Qwen3-8B (Zero-Shot) | 36.94 | — | Imported | 2026-05-27 |
| 153 | Qwen3-30B-A3B-Instruct-2507 (Zero-Shot) | 36.87 | Qwen3 30B A3B Instruct 2507 qwen-qwen3-30b-a3b-instruct-2507 | Imported | 2026-05-27 |
| 154 | Qwen3-30B-A3B-Non-Thinking (CoT) | 36.86 | — | Imported | 2026-05-27 |
| 155 | Llama-3.3-70B-Instruct (CoT) | 36.83 | Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | Imported | 2026-05-27 |
| 156 | Qwen3-14B-Non-Thinking (Zero-Shot) | 36.8 | — | Imported | 2026-05-27 |
| 157 | QwenLong-L1-32B (CoT) | 36.71 | — | Imported | 2026-05-27 |
| 158 | AntAngelMed (Zero-Shot) | 36.65 | — | Imported | 2026-05-27 |
| 159 | Phi-3.5-MoE-instruct (Few-Shot) | 36.56 | — | Imported | 2026-05-27 |
| 160 | DeepSeek-R1-0528-Qwen3-8B (CoT) | 36.28 | — | Imported | 2026-05-27 |
| 161 | Mistral-Small-3.1-24B-Instruct-2503 (CoT) | 36.23 | Mistral: Mistral Small 3.1 24B mistralai-mistral-small-3.1-24b-instruct | Imported | 2026-05-27 |
| 162 | Qwen3-4B-Instruct-2507 (CoT) | 36.23 | — | Imported | 2026-05-27 |
| 163 | Qwen3-30B-A3B-Non-Thinking (Zero-Shot) | 36.15 | — | Imported | 2026-05-27 |
| 164 | Phi-4 (Zero-Shot) | 36.13 | Phi 4 microsoft-phi-4 | Imported | 2026-05-27 |
| 165 | Baichuan-M1-14B-Instruct (Zero-Shot) | 36.08 | — | Imported | 2026-05-27 |
| 166 | Mistral-Small-Instruct-2409 (Few-Shot) | 35.98 | — | Imported | 2026-05-27 |
| 167 | Hulu-Med-14B (Zero-Shot) | 35.45 | — | Imported | 2026-05-27 |
| 168 | gemma-3-12b-it (CoT) | 35.37 | Gemma 3 12B google-gemma-3-12b-it | Imported | 2026-05-27 |
| 169 | K2-Think (CoT) | 35.35 | — | Imported | 2026-05-27 |
| 170 | Biomni-R0-32B-Preview-Thinking (Zero-Shot) | 35.34 | — | Imported | 2026-05-27 |
| 171 | AntAngelMed (Few-Shot) | 35.32 | — | Imported | 2026-05-27 |
| 172 | gpt-35-turbo-0125 (Zero-Shot) | 35.3 | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-27 |
| 173 | Mistral-Small-Instruct-2409 (Zero-Shot) | 35.19 | — | Imported | 2026-05-27 |
| 174 | Llama-4-Scout-17B-16E-Instruct (Zero-Shot) | 35.12 | Llama 4 Scout meta-llama-llama-4-scout | Imported | 2026-05-27 |
| 175 | Llama-3.1-70B-Instruct (CoT) | 35.1 | Llama 3.1 70B Instruct meta-llama-llama-3.1-70b-instruct | Imported | 2026-05-27 |
| 176 | gemma-2-9b-it (Zero-Shot) | 35.07 | — | Imported | 2026-05-27 |
| 177 | Llama-3.2-3B-Instruct (Few-Shot) | 34.81 | Llama 3.2 3B Instruct meta-llama-llama-3.2-3b-instruct | Imported | 2026-05-27 |
| 178 | DeepSeek-R1-Distill-Qwen-14B (CoT) | 34.79 | — | Imported | 2026-05-27 |
| 179 | Qwen3-1.7B-Non-Thinking (Few-Shot) | 34.76 | — | Imported | 2026-05-27 |
| 180 | Qwen3-14B-Non-Thinking (CoT) | 34.59 | — | Imported | 2026-05-27 |
| 181 | Biomni-R0-32B-Preview-Non-Thinking (CoT) | 34.59 | — | Imported | 2026-05-27 |
| 182 | Qwen3-32B-Thinking (CoT) | 34.46 | — | Imported | 2026-05-27 |
| 183 | Baichuan-M1-14B-Instruct (CoT) | 34.36 | — | Imported | 2026-05-27 |
| 184 | DeepSeek-R1-Distill-Qwen-14B (Zero-Shot) | 34.28 | — | Imported | 2026-05-27 |
| 185 | Hulu-Med-32B (CoT) | 34.25 | — | Imported | 2026-05-27 |
| 186 | gemma-2-27b-it (CoT) | 34.22 | Gemma 2 27B google-gemma-2-27b-it | Imported | 2026-05-27 |
| 187 | Magistral-Small-2506 (CoT) | 34.16 | — | Imported | 2026-05-27 |
| 188 | MMed-Llama-3-8B (Few-Shot) | 34.14 | — | Imported | 2026-05-27 |
| 189 | Qwen3-4B-Instruct-2507 (Zero-Shot) | 34.06 | — | Imported | 2026-05-27 |
| 190 | HuatuoGPT-o1-70B (Zero-Shot) | 33.9 | — | Imported | 2026-05-27 |
| 191 | Qwen3-8B-Non-Thinking (Zero-Shot) | 33.86 | — | Imported | 2026-05-27 |
| 192 | Llama-3-70B-UltraMedical (Zero-Shot) | 33.4 | — | Imported | 2026-05-27 |
| 193 | Qwen3-8B-Non-Thinking (CoT) | 33.31 | — | Imported | 2026-05-27 |
| 194 | Qwen3-4B-Non-Thinking (Zero-Shot) | 33.29 | — | Imported | 2026-05-27 |
| 195 | Qwen3-4B-Non-Thinking (CoT) | 33.2 | — | Imported | 2026-05-27 |
| 196 | Llama3-OpenBioLLM-8B (Few-Shot) | 33.12 | — | Imported | 2026-05-27 |
| 197 | Llama3-OpenBioLLM-70B (Zero-Shot) | 33.01 | — | Imported | 2026-05-27 |
| 198 | MeLLaMA-70B-chat (Few-Shot) | 32.9 | — | Imported | 2026-05-27 |
| 199 | gemma-4-E2B-it (CoT) | 32.87 | — | Imported | 2026-05-27 |
| 200 | Qwen3-1.7B-Thinking (Few-Shot) | 32.87 | — | Imported | 2026-05-27 |
| 201 | DeepSeek-R1-Distill-Llama-8B (Few-Shot) | 32.86 | — | Imported | 2026-05-27 |
| 202 | Qwen3-32B-Non-Thinking (CoT) | 32.8 | — | Imported | 2026-05-27 |
| 203 | Llama-3.1-Nemotron-70B-Instruct-HF (Zero-Shot) | 32.75 | — | Imported | 2026-05-27 |
| 204 | gemma-4-E2B-it (Zero-Shot) | 32.74 | — | Imported | 2026-05-27 |
| 205 | Phi-4 (CoT) | 32.59 | Phi 4 microsoft-phi-4 | Imported | 2026-05-27 |
| 206 | Biomni-R0-32B-Preview-Thinking (CoT) | 32.54 | — | Imported | 2026-05-27 |
| 207 | HuatuoGPT-o1-70B (CoT) | 32.35 | — | Imported | 2026-05-27 |
| 208 | MeLLaMA-70B-chat (Zero-Shot) | 32.26 | — | Imported | 2026-05-27 |
| 209 | Yi-1.5-34B-Chat-16K (Zero-Shot) | 32.12 | — | Imported | 2026-05-27 |
| 210 | gpt-oss-120b (CoT) | 32.11 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-27 |
| 211 | meditron-70b (Few-Shot) | 32.09 | — | Imported | 2026-05-27 |
| 212 | QwQ-32B-Preview (Zero-Shot) | 31.74 | — | Imported | 2026-05-27 |
| 213 | Qwen2.5-1.5B-Instruct (Few-Shot) | 31.66 | — | Imported | 2026-05-27 |
| 214 | gpt-35-turbo-0125 (CoT) | 31.63 | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-27 |
| 215 | Hulu-Med-14B (CoT) | 31.61 | — | Imported | 2026-05-27 |
| 216 | Mistral-Small-24B-Instruct-2501 (CoT) | 31.59 | Mistral: Mistral Small 3 mistralai-mistral-small-24b-instruct-2501 | Imported | 2026-05-27 |
| 217 | Phi-3.5-mini-instruct (Few-Shot) | 31.33 | — | Imported | 2026-05-27 |
| 218 | Qwen2.5-7B-Instruct (Zero-Shot) | 31.32 | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-27 |
| 219 | Mistral-Small-Instruct-2409 (CoT) | 31.17 | — | Imported | 2026-05-27 |
| 220 | MedReason-8B (Few-Shot) | 31.12 | — | Imported | 2026-05-27 |
| 221 | Llama-3.1-8B-UltraMedical (Few-Shot) | 30.96 | — | Imported | 2026-05-27 |
| 222 | AntAngelMed (CoT) | 30.77 | — | Imported | 2026-05-27 |
| 223 | Hulu-Med-7B (Zero-Shot) | 30.71 | — | Imported | 2026-05-27 |
| 224 | Ministral-8B-Instruct-2410 (Zero-Shot) | 30.37 | — | Imported | 2026-05-27 |
| 225 | Qwen2.5-7B-Instruct (CoT) | 30.25 | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-27 |
| 226 | gemma-2-9b-it (CoT) | 29.94 | — | Imported | 2026-05-27 |
| 227 | Phi-4-mini-instruct (Few-Shot) | 29.93 | — | Imported | 2026-05-27 |
| 228 | HuatuoGPT-o1-7B (Zero-Shot) | 29.59 | — | Imported | 2026-05-27 |
| 229 | Yi-1.5-34B-Chat-16K (CoT) | 29.57 | — | Imported | 2026-05-27 |
| 230 | Phi-3.5-MoE-instruct (Zero-Shot) | 29.54 | — | Imported | 2026-05-27 |
| 231 | Llama-3-70B-UltraMedical (CoT) | 29.44 | — | Imported | 2026-05-27 |
| 232 | medgemma-4b-it (Zero-Shot) | 29.42 | — | Imported | 2026-05-27 |
| 233 | Llama-3.1-8B-Instruct (CoT) | 29.4 | Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | Imported | 2026-05-27 |
| 234 | Llama-4-Scout-17B-16E-Instruct (CoT) | 29.38 | Llama 4 Scout meta-llama-llama-4-scout | Imported | 2026-05-27 |
| 235 | MeLLaMA-70B-chat (CoT) | 29.25 | — | Imported | 2026-05-27 |
| 236 | gpt-oss-20b (Zero-Shot) | 29.05 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-27 |
| 237 | Llama-3.1-8B-Instruct (Zero-Shot) | 28.98 | Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | Imported | 2026-05-27 |
| 238 | Yi-1.5-9B-Chat-16K (Zero-Shot) | 28.81 | — | Imported | 2026-05-27 |
| 239 | Llama3-OpenBioLLM-70B (CoT) | 28.78 | — | Imported | 2026-05-27 |
| 240 | gemma-3-4b-it (Zero-Shot) | 28.56 | Gemma 3 4B google-gemma-3-4b-it | Imported | 2026-05-27 |
| 241 | medgemma-4b-it (CoT) | 28.5 | — | Imported | 2026-05-27 |
| 242 | DeepSeek-R1-Distill-Llama-8B (Zero-Shot) | 28.48 | — | Imported | 2026-05-27 |
| 243 | gemma-3-4b-it (CoT) | 28.19 | Gemma 3 4B google-gemma-3-4b-it | Imported | 2026-05-27 |
| 244 | Qwen3-1.7B-Thinking (CoT) | 27.71 | — | Imported | 2026-05-27 |
| 245 | Hulu-Med-7B (CoT) | 27.6 | — | Imported | 2026-05-27 |
| 246 | DeepSeek-R1-Distill-Llama-8B (CoT) | 27.34 | — | Imported | 2026-05-27 |
| 247 | Llama-2-70b-chat (Few-Shot) | 27.14 | — | Imported | 2026-05-27 |
| 248 | MeLLaMA-13B-chat (Few-Shot) | 27.01 | — | Imported | 2026-05-27 |
| 249 | HuatuoGPT-o1-7B (CoT) | 26.71 | — | Imported | 2026-05-27 |
| 250 | Qwen2.5-3B-Instruct (Zero-Shot) | 26.59 | — | Imported | 2026-05-27 |
| 251 | Qwen3-0.6B-Non-Thinking (Few-Shot) | 26.31 | — | Imported | 2026-05-27 |
| 252 | Qwen3-1.7B-Thinking (Zero-Shot) | 26.28 | — | Imported | 2026-05-27 |
| 253 | gemma-3-1b-it (Few-Shot) | 25.92 | — | Imported | 2026-05-27 |
| 254 | Ministral-8B-Instruct-2410 (CoT) | 25.91 | — | Imported | 2026-05-27 |
| 255 | HuatuoGPT-o1-8B (Zero-Shot) | 25.86 | — | Imported | 2026-05-27 |
| 256 | Qwen2.5-3B-Instruct (CoT) | 25.44 | — | Imported | 2026-05-27 |
| 257 | Phi-3.5-mini-instruct (Zero-Shot) | 25.41 | — | Imported | 2026-05-27 |
| 258 | Yi-1.5-9B-Chat-16K (CoT) | 25.4 | — | Imported | 2026-05-27 |
| 259 | Phi-3.5-MoE-instruct (CoT) | 25.27 | — | Imported | 2026-05-27 |
| 260 | DeepSeek-R1-Distill-Qwen-7B (Zero-Shot) | 25.27 | — | Imported | 2026-05-27 |
| 261 | Llama-2-70b-chat (Zero-Shot) | 25.15 | — | Imported | 2026-05-27 |
| 262 | gpt-oss-20b (Few-Shot) | 25.14 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-27 |
| 263 | Qwen3-0.6B-Thinking (Few-Shot) | 24.87 | — | Imported | 2026-05-27 |
| 264 | gpt-oss-20b (CoT) | 24.86 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-27 |
| 265 | BioMistral-7B (Few-Shot) | 24.66 | — | Imported | 2026-05-27 |
| 266 | Phi-4-mini-instruct (Zero-Shot) | 24.54 | — | Imported | 2026-05-27 |
| 267 | Llama-3.2-1B-Instruct (Few-Shot) | 24.43 | Llama 3.2 1B Instruct meta-llama-llama-3.2-1b-instruct | Imported | 2026-05-27 |
| 268 | Llama-3.1-Nemotron-70B-Instruct-HF (CoT) | 24.09 | — | Imported | 2026-05-27 |
| 269 | Phi-3.5-mini-instruct (CoT) | 23.91 | — | Imported | 2026-05-27 |
| 270 | DeepSeek-R1-Distill-Qwen-7B (CoT) | 23.87 | — | Imported | 2026-05-27 |
| 271 | Llama-2-13b-chat (Few-Shot) | 23.55 | — | Imported | 2026-05-27 |
| 272 | QwQ-32B-Preview (CoT) | 23.31 | — | Imported | 2026-05-27 |
| 273 | DeepSeek-R1-Distill-Qwen-7B (Few-Shot) | 23.07 | — | Imported | 2026-05-27 |
| 274 | Llama-3.2-3B-Instruct (Zero-Shot) | 22.9 | Llama 3.2 3B Instruct meta-llama-llama-3.2-3b-instruct | Imported | 2026-05-27 |
| 275 | Qwen3-1.7B-Non-Thinking (CoT) | 22.76 | — | Imported | 2026-05-27 |
| 276 | Phi-4-mini-instruct (CoT) | 22.5 | — | Imported | 2026-05-27 |
| 277 | HuatuoGPT-o1-8B (CoT) | 22.48 | — | Imported | 2026-05-27 |
| 278 | Llama-2-7b-chat (Few-Shot) | 22.33 | — | Imported | 2026-05-27 |
| 279 | Qwen2.5-1.5B-Instruct (Zero-Shot) | 22.16 | — | Imported | 2026-05-27 |
| 280 | Qwen3-1.7B-Non-Thinking (Zero-Shot) | 21.95 | — | Imported | 2026-05-27 |
| 281 | Llama-3.2-3B-Instruct (CoT) | 21.6 | Llama 3.2 3B Instruct meta-llama-llama-3.2-3b-instruct | Imported | 2026-05-27 |
| 282 | Phi-4-mini-reasoning (Zero-Shot) | 21.31 | — | Imported | 2026-05-27 |
| 283 | Llama-2-13b-chat (Zero-Shot) | 20.91 | — | Imported | 2026-05-27 |
| 284 | MeLLaMA-13B-chat (Zero-Shot) | 20.76 | — | Imported | 2026-05-27 |
| 285 | BioMistral-7B (Zero-Shot) | 20.43 | — | Imported | 2026-05-27 |
| 286 | Qwen3-0.6B-Thinking (Zero-Shot) | 20.38 | — | Imported | 2026-05-27 |
| 287 | MMed-Llama-3-8B (Zero-Shot) | 20.37 | — | Imported | 2026-05-27 |
| 288 | MeLLaMA-13B-chat (CoT) | 20.26 | — | Imported | 2026-05-27 |
| 289 | Llama-3.1-8B-UltraMedical (Zero-Shot) | 20.16 | — | Imported | 2026-05-27 |
| 290 | Phi-4-mini-reasoning (CoT) | 19.79 | — | Imported | 2026-05-27 |
| 291 | OpenThinker3-7B (Zero-Shot) | 19.79 | — | Imported | 2026-05-27 |
| 292 | Qwen2.5-1.5B-Instruct (CoT) | 19.47 | — | Imported | 2026-05-27 |
| 293 | Llama-2-70b-chat (CoT) | 19.02 | — | Imported | 2026-05-27 |
| 294 | Qwen3-0.6B-Thinking (CoT) | 18.95 | — | Imported | 2026-05-27 |
| 295 | OpenThinker3-7B (CoT) | 18.37 | — | Imported | 2026-05-27 |
| 296 | Llama-3.1-8B-UltraMedical (CoT) | 18.34 | — | Imported | 2026-05-27 |
| 297 | MedReason-8B (CoT) | 18.29 | — | Imported | 2026-05-27 |
| 298 | MedReason-8B (Zero-Shot) | 18.17 | — | Imported | 2026-05-27 |
| 299 | meditron-7b (Few-Shot) | 17.04 | — | Imported | 2026-05-27 |
| 300 | Qwen3-0.6B-Non-Thinking (CoT) | 16.59 | — | Imported | 2026-05-27 |
| 301 | Llama-2-7b-chat (Zero-Shot) | 16.47 | — | Imported | 2026-05-27 |
| 302 | Llama-2-13b-chat (CoT) | 16.24 | — | Imported | 2026-05-27 |
| 303 | MMed-Llama-3-8B (CoT) | 16.17 | — | Imported | 2026-05-27 |
| 304 | gemma-3-1b-it (Zero-Shot) | 15.73 | — | Imported | 2026-05-27 |
| 305 | meditron-70b (Zero-Shot) | 15.68 | — | Imported | 2026-05-27 |
| 306 | Phi-4-mini-reasoning (Few-Shot) | 15.51 | — | Imported | 2026-05-27 |
| 307 | Qwen3-0.6B-Non-Thinking (Zero-Shot) | 15.21 | — | Imported | 2026-05-27 |
| 308 | DeepSeek-R1-Distill-Qwen-1.5B (Few-Shot) | 14.94 | — | Imported | 2026-05-27 |
| 309 | DeepSeek-R1-Distill-Qwen-1.5B (Zero-Shot) | 14.26 | — | Imported | 2026-05-27 |
| 310 | Llama3-OpenBioLLM-8B (Zero-Shot) | 14.2 | — | Imported | 2026-05-27 |
| 311 | Llama-2-7b-chat (CoT) | 13.66 | — | Imported | 2026-05-27 |
| 312 | gemma-3-1b-it (CoT) | 13.53 | — | Imported | 2026-05-27 |
| 313 | DeepSeek-R1-Distill-Qwen-1.5B (CoT) | 13.42 | — | Imported | 2026-05-27 |
| 314 | Llama3-OpenBioLLM-8B (CoT) | 13.29 | — | Imported | 2026-05-27 |
| 315 | meditron-70b (CoT) | 13.17 | — | Imported | 2026-05-27 |
| 316 | OpenThinker3-7B (Few-Shot) | 13.01 | — | Imported | 2026-05-27 |
| 317 | Llama-3.2-1B-Instruct (Zero-Shot) | 12.72 | Llama 3.2 1B Instruct meta-llama-llama-3.2-1b-instruct | Imported | 2026-05-27 |
| 318 | Llama-3.2-1B-Instruct (CoT) | 11.86 | Llama 3.2 1B Instruct meta-llama-llama-3.2-1b-instruct | Imported | 2026-05-27 |
| 319 | BioMistral-7B (CoT) | 10.84 | — | Imported | 2026-05-27 |
| 320 | meditron-7b (CoT) | 9.52 | — | Imported | 2026-05-27 |
| 321 | meditron-7b (Zero-Shot) | 9.52 | — | Imported | 2026-05-27 |
No matching rows.