Open Medical-LLM Leaderboard
Open Life Science AI leaderboard evaluating LLMs on medical QA and medical MMLU tasks, including PubMedQA, MedQA, MedMCQA, and six medical MMLU subjects.
185rows
medical_averageprimary metric
2026-05-06sampled
Metadata
Metrics
Medical average, Task count, PubMedQA, MedQA 4 options, MedMCQA, MMLU Anatomy, MMLU Clinical Knowledge, MMLU College Biology, MMLU College Medicine, MMLU Medical Genetics, MMLU Professional Medicine
| Rank | Subject | Medical average | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | ProbeMedicalYonseiMAILab/medllama3-v20 | 90.01 | — | Imported | 2026-05-06 |
| 2 | aaditya/OpenBioLLMLlama-70B | 86.06 | — | Imported | 2026-05-06 |
| 3 | Med-PaLM 2 (5 Shots) | 84.09 | — | Imported | 2026-05-06 |
| 4 | GPT-4 | 82.97 | GPT-4 openai-gpt-4 | Imported | 2026-05-06 |
| 5 | skumar9/Llama-medx_v3.2 | 75.42 | — | Imported | 2026-05-06 |
| 6 | Flan-PaLM | 74.70 | — | Imported | 2026-05-06 |
| 7 | Jayant9928/orpo_med_v3 | 73.94 | — | Imported | 2026-05-06 |
| 8 | skumar9/Llama-medx_v3.1 | 73.94 | — | Imported | 2026-05-06 |
| 9 | johnsnowlabs/JSL-MedLlama-3-8B-v2.0 | 73.85 | — | Imported | 2026-05-06 |
| 10 | skumar9/Llama-medx_v3 | 73.83 | — | Imported | 2026-05-06 |
| 11 | Jayant9928/orpo_med_v2 | 73.65 | — | Imported | 2026-05-06 |
| 12 | abhinand/Llama-3-OpenBioMed-8B-slerp-v0.3 | 73.56 | — | Imported | 2026-05-06 |
| 13 | lighteternal/Llama3-merge-biomed-8b | 73.55 | — | Imported | 2026-05-06 |
| 14 | abhinand/Llama-3-Galen-8B-32k-v1 | 72.99 | — | Imported | 2026-05-06 |
| 15 | ChenWeiLi/Med-ChimeraLlama-3-8B_SHERP | 72.71 | — | Imported | 2026-05-06 |
| 16 | aaditya/OpenBioLLM-Llama3-8B | 72.50 | — | Imported | 2026-05-06 |
| 17 | Jayant9928/orpo_med_v0 | 72.43 | — | Imported | 2026-05-06 |
| 18 | johnsnowlabs/JSL-MedLlama-3-8B-v1.0 | 72.40 | — | Imported | 2026-05-06 |
| 19 | shanchen/llama3-8B-slerp-med-chinese | 72.34 | — | Imported | 2026-05-06 |
| 20 | Jayant9928/orpo_v2 | 72.29 | — | Imported | 2026-05-06 |
| 21 | winninghealth/WiNGPT2-Llama-3-8B-Base | 72.10 | — | Imported | 2026-05-06 |
| 22 | aaditya/Llama3-OpenBioLLM-8B | 71.73 | — | Imported | 2026-05-06 |
| 23 | ChenWeiLi/Med-ChimeraLlama-3_1k_5_epoch | 71.59 | — | Imported | 2026-05-06 |
| 24 | probemedicalandyonseimailab/medllama3-v5 | 71.49 | — | Imported | 2026-05-06 |
| 25 | adinath/ollama_v9 | 71.46 | — | Imported | 2026-05-06 |
| 26 | timberrific/open-bio-med-merge | 71.33 | — | Imported | 2026-05-06 |
| 27 | probemedicalandyonseimailab/medllama3-v5.1 | 71.19 | — | Imported | 2026-05-06 |
| 28 | ChenWeiLi/MedLlama-3-8B_DARE_v1.0 | 71.06 | — | Imported | 2026-05-06 |
| 29 | probemedicalandyonseimailab/medllama3-v6 | 71.06 | — | Imported | 2026-05-06 |
| 30 | mlabonne/Daredevil-8B-abliterated-dpomix | 70.99 | — | Imported | 2026-05-06 |
| 31 | mlabonne/Daredevil-8B-abliterated | 70.86 | — | Imported | 2026-05-06 |
| 32 | ChenWeiLi/Med-ChimeraLlama-3_10k | 70.80 | — | Imported | 2026-05-06 |
| 33 | Gemini-1.0 | 70.79 | — | Imported | 2026-05-06 |
| 34 | shanchen/llama3-8B-slerp-biomed-chat-chinese | 70.78 | — | Imported | 2026-05-06 |
| 35 | ChenWeiLi/Med-ChimeraLlama-3_1k_10_epoch | 70.74 | — | Imported | 2026-05-06 |
| 36 | shanchen/llama3-8B-slerp-med-chinese2 | 70.71 | — | Imported | 2026-05-06 |
| 37 | winninghealth/WiNGPT2-Llama-3-8B-Chat | 70.57 | — | Imported | 2026-05-06 |
| 38 | HPAI-BSC/Llama3-Aloe-8B-Alpha | 70.46 | — | Imported | 2026-05-06 |
| 39 | probemedicalandyonseimailab/medllama3-v4 | 70.44 | — | Imported | 2026-05-06 |
| 40 | Kukedlc/NeuralLLaMa-3-8b-DT-v0.1 | 70.43 | — | Imported | 2026-05-06 |
| 41 | Kukedlc/NeuralLLaMa-3-8b-ORPO-v0.3 | 70.40 | — | Imported | 2026-05-06 |
| 42 | abhinand/Llama-3-OpenBioMed-8B-dare-ties-v1.0 | 70.28 | — | Imported | 2026-05-06 |
| 43 | ChenWeiLi/Med-ChimeraLlama-3_1k_20_epoch | 70.24 | — | Imported | 2026-05-06 |
| 44 | adinath/ollama_v6 | 70.23 | — | Imported | 2026-05-06 |
| 45 | uygarkurt/llama-3-merged-linear | 70.13 | — | Imported | 2026-05-06 |
| 46 | DeepMount00/Llama-3-8b-Ita | 70.11 | — | Imported | 2026-05-06 |
| 47 | ProbeMedicalYonseiMAILab/medllama3-v16 | 70.10 | — | Imported | 2026-05-06 |
| 48 | mlabonne/ChimeraLlama-3-8B-v3 | 70.05 | — | Imported | 2026-05-06 |
| 49 | johnsnowlabs/JSL-Med-Sft-Llama-3-8B | 70.02 | — | Imported | 2026-05-06 |
| 50 | Danielbrdz/Barcenas-Llama3-8b-ORPO | 69.93 | — | Imported | 2026-05-06 |
| 51 | skumar9/Llama-medx_v0 | 69.91 | — | Imported | 2026-05-06 |
| 52 | meta-llama/Meta-Llama-3-8B | 69.90 | — | Imported | 2026-05-06 |
| 53 | shanchen/llama3-slerp-med | 69.88 | — | Imported | 2026-05-06 |
| 54 | adinath/ollama-3-8B | 69.83 | — | Imported | 2026-05-06 |
| 55 | VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct | 69.76 | — | Imported | 2026-05-06 |
| 56 | adinath/ollama_v5 | 69.75 | — | Imported | 2026-05-06 |
| 57 | probemedicalandyonseimailab/medllama3-v10 | 69.72 | — | Imported | 2026-05-06 |
| 58 | jondurbin/bagel-8b-v1.0 | 69.66 | — | Imported | 2026-05-06 |
| 59 | mlabonne/Llama-3-8B-Instruct-abliterated-dpomix | 69.65 | — | Imported | 2026-05-06 |
| 60 | IBI-CAAI/MELT-llama-2-7b-chat-v0.1 | 69.46 | — | Imported | 2026-05-06 |
| 61 | lightblue/suzume-llama-3-8B-multilingual | 69.30 | — | Imported | 2026-05-06 |
| 62 | johnsnowlabs/JSL-MedMNX-7B-v2.0 | 69.30 | — | Imported | 2026-05-06 |
| 63 | adinath/lft_8b | 69.30 | — | Imported | 2026-05-06 |
| 64 | qnguyen3/Master-Yi-9B | 69.07 | — | Imported | 2026-05-06 |
| 65 | ruslanmv/ai-medical-model-32bit | 69.00 | — | Imported | 2026-05-06 |
| 66 | meta-llama/Meta-Llama-3-8B-Instruct | 68.99 | Llama 3 8B Instruct meta-llama-llama-3-8b-instruct | Imported | 2026-05-06 |
| 67 | abhishekchohan/Yi-9B-Forest-DPO-v1.0 | 68.93 | — | Imported | 2026-05-06 |
| 68 | probemedicalandyonseimailab/medllama3-v11 | 68.93 | — | Imported | 2026-05-06 |
| 69 | adinath/ollama_v7 | 68.90 | — | Imported | 2026-05-06 |
| 70 | 01-ai/Yi-1.5-9B | 68.89 | — | Imported | 2026-05-06 |
| 71 | failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 | 68.84 | — | Imported | 2026-05-06 |
| 72 | Jayant9928/tnayajv2.0 | 68.62 | — | Imported | 2026-05-06 |
| 73 | skumar9/Llama-medx_v2 | 68.51 | — | Imported | 2026-05-06 |
| 74 | cognitivecomputations/Llama-3-8B-Instruct-abliterated-v2 | 68.06 | — | Imported | 2026-05-06 |
| 75 | 01-ai/Yi-1.5-9B-32K | 67.73 | — | Imported | 2026-05-06 |
| 76 | GPT-3.5 Turbo 1106 | 67.69 | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-06 |
| 77 | gradientai/Llama-3-8B-Instruct-262k | 67.67 | — | Imported | 2026-05-06 |
| 78 | abacusai/Liberated-Qwen1.5-14B | 67.66 | — | Imported | 2026-05-06 |
| 79 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B | 67.52 | — | Imported | 2026-05-06 |
| 80 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v2-5 | 67.47 | — | Imported | 2026-05-06 |
| 81 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v2-6 | 67.47 | — | Imported | 2026-05-06 |
| 82 | johnsnowlabs/JSL-MedMNX-7B | 67.45 | — | Imported | 2026-05-06 |
| 83 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v2-1 | 67.42 | — | Imported | 2026-05-06 |
| 84 | Locutusque/Llama-3-Orca-1.0-8B | 67.17 | — | Imported | 2026-05-06 |
| 85 | johnsnowlabs/JSL-MedMNX-7B-SFT | 67.11 | — | Imported | 2026-05-06 |
| 86 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v2-4 | 67.08 | — | Imported | 2026-05-06 |
| 87 | ruslanmv/Medical-Llama3-8B | 66.95 | — | Imported | 2026-05-06 |
| 88 | abacusai/Llama-3-Smaug-8B | 66.88 | — | Imported | 2026-05-06 |
| 89 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v2-3 | 66.73 | — | Imported | 2026-05-06 |
| 90 | SrikanthChellappa/Collaiborator-MEDLLM-Llama-3-8B-v1 | 66.67 | — | Imported | 2026-05-06 |
| 91 | collaiborateorg/Collaiborator-MEDLLM-Llama-3-8B-v1 | 66.26 | — | Imported | 2026-05-06 |
| 92 | Nexusflow/Starling-LM-7B-beta | 66.25 | — | Imported | 2026-05-06 |
| 93 | NotAiLOL/Yi-1.5-dolphin-9B | 65.60 | — | Imported | 2026-05-06 |
| 94 | shanchen/llama3-8B-slerp-med-262k | 65.56 | — | Imported | 2026-05-06 |
| 95 | refine-ai/Power-Llama-3-7B-Instruct | 65.45 | — | Imported | 2026-05-06 |
| 96 | adinath/lft_8b_v2 | 65.26 | — | Imported | 2026-05-06 |
| 97 | Artples/L-MChat-7b | 65.12 | — | Imported | 2026-05-06 |
| 98 | Jayant9928/tnayaj | 65.10 | — | Imported | 2026-05-06 |
| 99 | vicgalle/CarbonBeagle-11B-truthy | 64.96 | — | Imported | 2026-05-06 |
| 100 | BioMistral/BioMistral-MedMNX | 64.60 | — | Imported | 2026-05-06 |
| 101 | shadowml/BeagSake-7B | 64.54 | — | Imported | 2026-05-06 |
| 102 | epfl-llm/meditron-70b | 64.49 | — | Imported | 2026-05-06 |
| 103 | unsloth/gemma-7b | 64.18 | — | Imported | 2026-05-06 |
| 104 | SeaLLMs/SeaLLM-7B-v2.5 | 64.10 | — | Imported | 2026-05-06 |
| 105 | kekmodel/StopCarbon-10.7B-v5 | 63.99 | — | Imported | 2026-05-06 |
| 106 | invalid-coder/Sakura-SOLAR-Instruct-CarbonVillain-en-10.7B-v2-slerp | 63.97 | — | Imported | 2026-05-06 |
| 107 | upstage/SOLAR-10.7B-Instruct-v1.0 | 63.94 | — | Imported | 2026-05-06 |
| 108 | MoaData/Myrrh_solar_10.7b_3.0 | 63.91 | — | Imported | 2026-05-06 |
| 109 | jeonsworld/CarbonVillain-en-10.7B-v4 | 63.85 | — | Imported | 2026-05-06 |
| 110 | Ppoyaa/Lumina-3.5 | 63.72 | — | Imported | 2026-05-06 |
| 111 | NotAiLOL/Med-Yi-1.5-9B | 63.71 | — | Imported | 2026-05-06 |
| 112 | BioMistral/BioMistral-DARE-NS | 63.69 | — | Imported | 2026-05-06 |
| 113 | google/gemma-7b | 63.62 | — | Imported | 2026-05-06 |
| 114 | eldogbbhed/Peagle-9b | 63.51 | — | Imported | 2026-05-06 |
| 115 | mlabonne/NeuralMonarch-7B | 63.23 | — | Imported | 2026-05-06 |
| 116 | mlabonne/AlphaMonarch-7B | 63.18 | — | Imported | 2026-05-06 |
| 117 | lemon-mint/gemma-ko-7b-instruct-v0.62 | 63.14 | — | Imported | 2026-05-06 |
| 118 | Locutusque/Hercules-3.1-Mistral-7B | 63.09 | — | Imported | 2026-05-06 |
| 119 | mistralai/Mistral-7B-v0.1 | 62.85 | — | Imported | 2026-05-06 |
| 120 | NousResearch/Hermes-2-Pro-Mistral-7B | 62.55 | — | Imported | 2026-05-06 |
| 121 | NousResearch/Nous-Hermes-2-Mistral-7B-DPO | 62.30 | — | Imported | 2026-05-06 |
| 122 | cognitivecomputations/dolphin-2.9.1-llama-3-8b | 61.62 | — | Imported | 2026-05-06 |
| 123 | VAGOsolutions/SauerkrautLM-Gemma-7b | 61.58 | — | Imported | 2026-05-06 |
| 124 | BioMistral/BioMistral-7B-Zephyr-Beta-SLERP | 61.52 | — | Imported | 2026-05-06 |
| 125 | HuggingFaceH4/zephyr-7b-beta | 61.33 | — | Imported | 2026-05-06 |
| 126 | TIGER-Lab/MAmmoTH2-8B-Plus | 61.29 | — | Imported | 2026-05-06 |
| 127 | johnsnowlabs/BioLing-7B-Dare | 61.19 | — | Imported | 2026-05-06 |
| 128 | FreedomIntelligence/Apollo-6B | 60.84 | — | Imported | 2026-05-06 |
| 129 | Kabster/BioMistral-Zephyr-Beta-SLERP | 60.65 | — | Imported | 2026-05-06 |
| 130 | Kabster/Bio-Mistralv2-Squared | 60.48 | — | Imported | 2026-05-06 |
| 131 | skfrost19/BioMistralMerged | 60.32 | — | Imported | 2026-05-06 |
| 132 | Qwen/Qwen1.5-7B | 60.06 | — | Imported | 2026-05-06 |
| 133 | FreedomIntelligence/Apollo-7B | 60.00 | — | Imported | 2026-05-06 |
| 134 | ik28/MedMistral-instruct | 59.67 | — | Imported | 2026-05-06 |
| 135 | BioMistral/BioMistral-7B-SLERP | 59.58 | — | Imported | 2026-05-06 |
| 136 | BioMistral/BioMistral-7B-DARE | 59.45 | — | Imported | 2026-05-06 |
| 137 | Qwen/Qwen1.5-7B-Chat | 58.57 | — | Imported | 2026-05-06 |
| 138 | OpenModels4all/gemma-1.1-7b-it | 58.37 | — | Imported | 2026-05-06 |
| 139 | google/gemma-1.1-7b-it | 58.07 | — | Imported | 2026-05-06 |
| 140 | medalpaca/medalpaca-7b | 58.03 | — | Imported | 2026-05-06 |
| 141 | lemon-mint/gemma-7b-openhermes-v0.80 | 57.54 | — | Imported | 2026-05-06 |
| 142 | BioMistral/BioMistral-7B-BnB.8 | 56.91 | — | Imported | 2026-05-06 |
| 143 | BioMistral/BioMistral-7B | 56.36 | — | Imported | 2026-05-06 |
| 144 | BioMistral/BioMistral-7B-TIES | 56.34 | — | Imported | 2026-05-06 |
| 145 | mistralai/Mistral-7B-Instruct-v0.1 | 56.18 | Mistral: Mistral 7B Instruct v0.1 mistralai-mistral-7b-instruct-v0.1 | Imported | 2026-05-06 |
| 146 | OEvortex/MediKAI | 54.51 | — | Imported | 2026-05-06 |
| 147 | johnsnowlabs/JSL-MedPhi2-2.7B | 54.36 | — | Imported | 2026-05-06 |
| 148 | unsloth/gemma-7b-it | 51.72 | — | Imported | 2026-05-06 |
| 149 | Writer/palmyra-med-20b | 51.32 | — | Imported | 2026-05-06 |
| 150 | lmsys/vicuna-7b-v1.5 | 50.68 | — | Imported | 2026-05-06 |
| 151 | CohereForAI/aya-23-8B | 50.21 | — | Imported | 2026-05-06 |
| 152 | AdaptLLM/medicine-chat | 50.06 | — | Imported | 2026-05-06 |
| 153 | ghost-x/ghost-7b-alpha | 49.03 | — | Imported | 2026-05-06 |
| 154 | LeoLM/leo-mistral-hessianai-7b | 48.32 | — | Imported | 2026-05-06 |
| 155 | varox34/Bio-Saul-Dolphin-Beagle-Breadcrumbs | 46.73 | — | Imported | 2026-05-06 |
| 156 | FreedomIntelligence/Apollo-2B | 43.05 | — | Imported | 2026-05-06 |
| 157 | microsoft/phi-1_5 | 40.13 | — | Imported | 2026-05-06 |
| 158 | OEvortex/EMO-2B | 39.48 | — | Imported | 2026-05-06 |
| 159 | FreedomIntelligence/Apollo-0.5B | 39.13 | — | Imported | 2026-05-06 |
| 160 | IBI-CAAI/MELT-TinyLlama-1.1B-Chat-v1.0 | 37.44 | — | Imported | 2026-05-06 |
| 161 | unsloth/gemma-2b | 34.91 | — | Imported | 2026-05-06 |
| 162 | google/gemma-2b | 34.32 | — | Imported | 2026-05-06 |
| 163 | stabilityai/stablelm-2-1_6b | 33.77 | — | Imported | 2026-05-06 |
| 164 | google/recurrentgemma-2b | 32.94 | — | Imported | 2026-05-06 |
| 165 | EleutherAI/pythia-2.8b | 31.24 | — | Imported | 2026-05-06 |
| 166 | openai-community/gpt2-xl | 30.54 | — | Imported | 2026-05-06 |
| 167 | medicalai/ClinicalGPT-base-zh | 30.36 | — | Imported | 2026-05-06 |
| 168 | EleutherAI/pythia-2.8b-deduped | 30.24 | — | Imported | 2026-05-06 |
| 169 | tiiuae/falcon-7b-instruct | 29.96 | — | Imported | 2026-05-06 |
| 170 | probemedicalandyonseimailab/Llama-8B-1807 | 29.87 | — | Imported | 2026-05-06 |
| 171 | tiiuae/falcon-7b | 29.85 | — | Imported | 2026-05-06 |
| 172 | EleutherAI/gpt-neo-2.7B | 29.62 | — | Imported | 2026-05-06 |
| 173 | EleutherAI/pythia-1.4b | 29.57 | — | Imported | 2026-05-06 |
| 174 | TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T | 29.41 | — | Imported | 2026-05-06 |
| 175 | EleutherAI/pythia-1b | 28.97 | — | Imported | 2026-05-06 |
| 176 | facebook/opt-2.7b | 28.19 | — | Imported | 2026-05-06 |
| 177 | EleutherAI/pythia-1.4b-deduped | 28.08 | — | Imported | 2026-05-06 |
| 178 | EleutherAI/pythia-1b-deduped | 28.02 | — | Imported | 2026-05-06 |
| 179 | BEE-spoke-data/mega-ar-126m-4k | 27.80 | — | Imported | 2026-05-06 |
| 180 | manupande21/GPT2_PMC | 27.67 | — | Imported | 2026-05-06 |
| 181 | health360/Healix-1.1B-V1-Chat-dDPO | 27.46 | — | Imported | 2026-05-06 |
| 182 | pszemraj/mega-ar-525m-v0.07-ultraTBfw | 27.39 | — | Imported | 2026-05-06 |
| 183 | openai-community/gpt2 | 26.97 | — | Imported | 2026-05-06 |
| 184 | pszemraj/mega-ar-350m-v0.13 | 26.72 | — | Imported | 2026-05-06 |
| 185 | HiTZ/Medical-mT5-large | 25.41 | — | Imported | 2026-05-06 |
No matching rows.