R2MED
Reasoning-driven medical retrieval benchmark with Q&A reference retrieval, clinical evidence retrieval, and clinical case retrieval, reported as nDCG@10.
69rows
average_ndcg_at_10primary metric
2026-05-27sampled
Metadata
Metrics
Avg. nDCG@10, Biology, Bioinformatics, Medical Science, MedXpert-QA, MedQA-Diag, PMC-Treat, PMC-Clin, IIYi-Clin
| Rank | Subject | Avg. nDCG@10 | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | BGE-Reasoner-Embed-0928 | 43.18 | — | Imported | 2026-05-27 |
| 2 | E5-mistral-7b-instruct + ReasonRank(32B) | 42.85 | — | Imported | 2026-05-27 |
| 3 | o3-mini + NV-Embed-v2 | 41.35 | — | Imported | 2026-05-27 |
| 4 | o3-mini + BM25 | 41.01 | — | Imported | 2026-05-27 |
| 5 | HuatuoGPT-o1-70B + NV-Embed-v2 | 39.56 | — | Imported | 2026-05-27 |
| 6 | GPT4o + NV-Embed-v2 | 39.37 | — | Imported | 2026-05-27 |
| 7 | o3-mini + Text-embedding-3-large | 39.09 | — | Imported | 2026-05-27 |
| 8 | DeepSeek-R1-Distill-Llama-70B + NV-Embed-v2 | 38.52 | — | Imported | 2026-05-27 |
| 9 | HuatuoGPT-o1-70B + Text-embedding-3-large | 38.24 | — | Imported | 2026-05-27 |
| 10 | Search-O1 (QwQ-32b) + NV-Embed-v2 | 38.22 | — | Imported | 2026-05-27 |
| 11 | Search-O1 (Qwen3-32b) + NV-Embed-v2 | 38.00 | — | Imported | 2026-05-27 |
| 12 | Search-O1 (Qwen3-32b) + Text-embedding-3-large | 37.87 | — | Imported | 2026-05-27 |
| 13 | GPT4o + Text-embedding-3-large | 37.79 | — | Imported | 2026-05-27 |
| 14 | GPT4o + BM25 | 37.70 | — | Imported | 2026-05-27 |
| 15 | DeepSeek-R1-Distill-Llama-70B + Text-embedding-3-large | 37.36 | — | Imported | 2026-05-27 |
| 16 | Search-O1 (QwQ-32b) + Text-embedding-3-large | 37.30 | — | Imported | 2026-05-27 |
| 17 | Llama3.1-70B-Ins + NV-Embed-v2 | 36.82 | — | Imported | 2026-05-27 |
| 18 | QwQ-32B + NV-Embed-v2 | 36.82 | — | Imported | 2026-05-27 |
| 19 | QwQ-32B + Text-embedding-3-large | 36.51 | — | Imported | 2026-05-27 |
| 20 | DeepSeek-R1-Distill-Qwen-32B + NV-Embed-v2 | 36.08 | — | Imported | 2026-05-27 |
| 21 | Llama3.1-70B-Ins + Text-embedding-3-large | 35.76 | — | Imported | 2026-05-27 |
| 22 | Search-O1 (QwQ-32b) + BM25 | 35.57 | — | Imported | 2026-05-27 |
| 23 | Qwen2.5-32B-Ins + NV-Embed-v2 | 35.34 | — | Imported | 2026-05-27 |
| 24 | Search-R1 (Qwen2.5-7b-it-em-ppo) + NV-Embed-v2 | 35.11 | — | Imported | 2026-05-27 |
| 25 | QwQ-32B + BM25 | 35.03 | — | Imported | 2026-05-27 |
| 26 | DeepSeek-R1-Distill-Qwen-32B + Text-embedding-3-large | 34.68 | — | Imported | 2026-05-27 |
| 27 | Qwen2.5-32B-Ins + Text-embedding-3-large | 34.25 | — | Imported | 2026-05-27 |
| 28 | Search-O1 (Qwen3-32b) + BM25 | 33.84 | — | Imported | 2026-05-27 |
| 29 | HuatuoGPT-o1-70B + BM25 | 33.43 | — | Imported | 2026-05-27 |
| 30 | DeepSeek-R1-Distill-Llama-70B + BM25 | 33.29 | — | Imported | 2026-05-27 |
| 31 | Search-R1 (Qwen2.5-7b-it-em-ppo) + Text-embedding-3-large | 32.89 | — | Imported | 2026-05-27 |
| 32 | Qwen2.5-7B-Ins + NV-Embed-v2 | 32.69 | — | Imported | 2026-05-27 |
| 33 | Llama3.1-70B-Ins + BM25 | 32.40 | — | Imported | 2026-05-27 |
| 34 | Search-R1 (Qwen2.5-3b-it-em-ppo) + NV-Embed-v2 | 31.74 | — | Imported | 2026-05-27 |
| 35 | Qwen2.5-7B-Ins + Text-embedding-3-large | 31.53 | — | Imported | 2026-05-27 |
| 36 | NV-Embed-v2 | 31.43 | — | Imported | 2026-05-27 |
| 37 | o3-mini + BGE-Large-en-v1.5 | 31.29 | — | Imported | 2026-05-27 |
| 38 | GritLM-7B | 31.12 | — | Imported | 2026-05-27 |
| 39 | Qwen2.5-32B-Ins + BM25 | 31.12 | — | Imported | 2026-05-27 |
| 40 | SFR-Embedding-Mistral | 30.65 | — | Imported | 2026-05-27 |
| 41 | Search-R1 (Qwen2.5-3b-it-em-ppo) + Text-embedding-3-large | 30.19 | — | Imported | 2026-05-27 |
| 42 | BMRetriever-7B | 30.18 | — | Imported | 2026-05-27 |
| 43 | DeepSeek-R1-Distill-Qwen-32B + BM25 | 29.05 | — | Imported | 2026-05-27 |
| 44 | HuatuoGPT-o1-70B + BGE-Large-en-v1.5 | 28.98 | — | Imported | 2026-05-27 |
| 45 | Search-O1 (QwQ-32b) + BGE-Large-en-v1.5 | 28.66 | — | Imported | 2026-05-27 |
| 46 | GPT4o + BGE-Large-en-v1.5 | 28.63 | — | Imported | 2026-05-27 |
| 47 | Text-embedding-3-large | 28.57 | — | Imported | 2026-05-27 |
| 48 | DeepSeek-R1-Distill-Llama-70B + BGE-Large-en-v1.5 | 28.41 | — | Imported | 2026-05-27 |
| 49 | Search-O1 (Qwen3-32b) + BGE-Large-en-v1.5 | 28.28 | — | Imported | 2026-05-27 |
| 50 | Llama3.1-70B-Ins + BGE-Large-en-v1.5 | 28.18 | — | Imported | 2026-05-27 |
| 51 | Voyage-3 | 27.34 | — | Imported | 2026-05-27 |
| 52 | QwQ-32B + BGE-Large-en-v1.5 | 26.59 | — | Imported | 2026-05-27 |
| 53 | DeepSeek-R1-Distill-Qwen-32B + BGE-Large-en-v1.5 | 26.40 | — | Imported | 2026-05-27 |
| 54 | Qwen2.5-7B-Ins + BM25 | 26.38 | — | Imported | 2026-05-27 |
| 55 | Qwen2.5-32B-Ins + BGE-Large-en-v1.5 | 26.19 | — | Imported | 2026-05-27 |
| 56 | E5-mistral-7b-instruct | 24.92 | — | Imported | 2026-05-27 |
| 57 | Search-R1 (Qwen2.5-7b-it-em-ppo) + BGE-Large-en-v1.5 | 24.78 | — | Imported | 2026-05-27 |
| 58 | BMRetriever-2B | 24.69 | — | Imported | 2026-05-27 |
| 59 | Search-R1 (Qwen2.5-7b-it-em-ppo) + BM25 | 24.56 | — | Imported | 2026-05-27 |
| 60 | Qwen2.5-7B-Ins + BGE-Large-en-v1.5 | 24.00 | — | Imported | 2026-05-27 |
| 61 | Search-R1 (Qwen2.5-3b-it-em-ppo) + BGE-Large-en-v1.5 | 22.14 | — | Imported | 2026-05-27 |
| 62 | Search-R1 (Qwen2.5-3b-it-em-ppo) + BM25 | 20.03 | — | Imported | 2026-05-27 |
| 63 | InstructOR-XL | 18.13 | — | Imported | 2026-05-27 |
| 64 | BMRETRIEVER-410M | 18.10 | — | Imported | 2026-05-27 |
| 65 | BGE-Large-en-v1.5 | 17.02 | — | Imported | 2026-05-27 |
| 66 | InstructOR-L | 16.21 | — | Imported | 2026-05-27 |
| 67 | BM25 | 15.13 | — | Imported | 2026-05-27 |
| 68 | Contriever | 11.76 | — | Imported | 2026-05-27 |
| 69 | MedCPT | 9.02 | — | Imported | 2026-05-27 |
No matching rows.