BioASQ

BioASQ: Evaluates clinical, biomedical, medical-exam, coding, or healthcare-document reasoning.

318rows
mean_exact_answer_scoreprimary metric
2026-05-27sampled

Metadata

Metrics

Mean exact-answer headline score, Yes/No Accuracy, F1 Yes, F1 No, Yes/No Macro F1, Factoid Strict Accuracy, Factoid Lenient Accuracy, Factoid MRR, List Mean Precision, List Recall, List F-Measure

Latest Results

Rows are parsed from the public BioASQ 14b Phase B exact-answer result tables for test batches 1-4. Scores are per system submission and preserve the source test batch in metadata.

Rank Subject Mean exact-answer headline score Model Match Provenance Sampled
1 DMIS_MES_TEST_1 0.7415 Imported 2026-05-27
2 DMIS_MES_TEST_2 0.7415 Imported 2026-05-27
3 DMIS_MES_TEST_3 0.7415 Imported 2026-05-27
4 DMIS_MES_TEST_4 0.7415 Imported 2026-05-27
5 DMIS_MES_TEST_5 0.7415 Imported 2026-05-27
6 Bio26NIA 0.7075 Imported 2026-05-27
7 CSA-IISR 5st 0.690033 Imported 2026-05-27
8 CSA-IISR 1st 0.684967 Imported 2026-05-27
9 CSA-IISR 4th 0.678967 Imported 2026-05-27
10 CSA-IISR 2nd 0.674867 Imported 2026-05-27
11 CSA-IISR 3rd 0.674867 Imported 2026-05-27
12 CSA-IISR 2nd 0.6735 Imported 2026-05-27
13 CSA-IISR 1st 0.672067 Imported 2026-05-27
14 MedQA-3 0.671633 Imported 2026-05-27
15 ku_dmis_3 0.670367 Imported 2026-05-27
16 SATO 0.668267 Imported 2026-05-27
17 ku_dmis_5 0.664567 Imported 2026-05-27
18 dictycite-max-rew-sl 0.6587 Imported 2026-05-27
19 Another 0.653967 Imported 2026-05-27
20 MedQA-1 0.6523 Imported 2026-05-27
21 dmiip2024_4 0.648667 Imported 2026-05-27
22 bioinfo-3 0.647367 Imported 2026-05-27
23 MedQA-5 0.646433 Imported 2026-05-27
24 ku_dmis_2 0.644367 Imported 2026-05-27
25 dictycite-snippet 0.639867 Imported 2026-05-27
26 CSA-IISR 3rd 0.638933 Imported 2026-05-27
27 RMC_2 0.638267 Imported 2026-05-27
28 Another 0.636233 Imported 2026-05-27
29 multi-stage rank&llm 0.6359 Imported 2026-05-27
30 dictycite-baseline 0.635567 Imported 2026-05-27
31 dmiip2024_2 0.634467 Imported 2026-05-27
32 dictycite-max-rew-sl 0.631733 Imported 2026-05-27
33 dmiip2024_3 0.6301 Imported 2026-05-27
34 lean_rag_ft 0.629933 Imported 2026-05-27
35 IR_J-1 0.629633 Imported 2026-05-27
36 IR_J-4 0.628167 Imported 2026-05-27
37 CSA-IISR 4th 0.627733 Imported 2026-05-27
38 EP-2 0.622967 Imported 2026-05-27
39 dmiip2024_1 0.618033 Imported 2026-05-27
40 ku_dmis_4 0.618 Imported 2026-05-27
41 ku_dmis_5 0.618 Imported 2026-05-27
42 ku_dmis_3 0.617733 Imported 2026-05-27
43 EP-4 0.6168 Imported 2026-05-27
44 DMIS_MES_TEST_1 0.613933 Imported 2026-05-27
45 DMIS_MES_TEST_2 0.613933 Imported 2026-05-27
46 DMIS_MES_TEST_3 0.613933 Imported 2026-05-27
47 DMIS_MES_TEST_4 0.613933 Imported 2026-05-27
48 DMIS_MES_TEST_5 0.613933 Imported 2026-05-27
49 EHM-9 0.613867 Imported 2026-05-27
50 ku_dmis 0.613333 Imported 2026-05-27
51 dmiip2024_1 0.612867 Imported 2026-05-27
52 Fleming-2 0.611833 Imported 2026-05-27
53 Dif-C 0.6106 Imported 2026-05-27
54 IR_J-3 0.609633 Imported 2026-05-27
55 EP-3 0.609567 Imported 2026-05-27
56 EP-1 0.609467 Imported 2026-05-27
57 Bio26NIA 0.609 Imported 2026-05-27
58 ku_dmis_2 0.6086 Imported 2026-05-27
59 dictycite-baseline 0.607633 Imported 2026-05-27
60 dictycite-snippet 0.607633 Imported 2026-05-27
61 IR_J-1 0.6041 Imported 2026-05-27
62 multi-stage rank&ll 0.6025 Imported 2026-05-27
63 ku_dmis_4 0.601267 Imported 2026-05-27
64 IR_J-5 0.6003 Imported 2026-05-27
65 EP-2 0.5987 Imported 2026-05-27
66 MedQA-1 0.598567 Imported 2026-05-27
67 MedQA-2 0.5962 Imported 2026-05-27
68 dmiip2024_4 0.5957 Imported 2026-05-27
69 bioinfo-4 0.595133 Imported 2026-05-27
70 IR_J-5 0.594733 Imported 2026-05-27
71 UR-IW-1 0.5934 Imported 2026-05-27
72 MedQA-4 0.592333 Imported 2026-05-27
73 UR-IW-4 0.5922 Imported 2026-05-27
74 1 system 0.591467 Imported 2026-05-27
75 Fleming-1 0.591 Imported 2026-05-27
76 lean_rag 0.589367 Imported 2026-05-27
77 Fleming-1 0.5883 Imported 2026-05-27
78 CS 1st submit 0.588167 Imported 2026-05-27
79 CSA-IISR 1st 0.588167 Imported 2026-05-27
80 IR_Y-4 0.5881 Imported 2026-05-27
81 UR-IW-5 0.587633 Imported 2026-05-27
82 IR_J-3 0.587033 Imported 2026-05-27
83 bioinfo-2 0.5868 Imported 2026-05-27
84 CSA-IISR 5st 0.586567 Imported 2026-05-27
85 UR-IW-2 0.586567 Imported 2026-05-27
86 multi-stage rank&llm 0.586033 Imported 2026-05-27
87 MedQA-2 0.584733 Imported 2026-05-27
88 ku_dmis 0.584567 Imported 2026-05-27
89 MedQA-3 0.584567 Imported 2026-05-27
90 MedQA-5 0.584567 Imported 2026-05-27
91 Fleming-4 0.584033 Imported 2026-05-27
92 CSA-IISR 3rd 0.583933 Imported 2026-05-27
93 CSA-IISR 2nd 0.583533 Imported 2026-05-27
94 MedQA-1 0.582167 Imported 2026-05-27
95 EP-5 0.582067 Imported 2026-05-27
96 lean_rag_ft_sparse 0.5819 Imported 2026-05-27
97 MedQA-4 0.581833 Imported 2026-05-27
98 MedQA-5 0.581833 Imported 2026-05-27
99 CSA-IISR 1st 0.581 Imported 2026-05-27
100 IR_J-2 0.580667 Imported 2026-05-27
101 dmiip2024_2 0.579967 Imported 2026-05-27
102 ku_dmis 0.579267 Imported 2026-05-27
103 bioinfo-1 0.579067 Imported 2026-05-27
104 IR_Y-5 0.578533 Imported 2026-05-27
105 CSA-IISR 4th 0.5777 Imported 2026-05-27
106 CSA-IISR 3rd 0.5776 Imported 2026-05-27
107 RMC_2 0.577567 Imported 2026-05-27
108 IR_Y-1 0.577533 Imported 2026-05-27
109 MedQA-3 0.577133 Imported 2026-05-27
110 dictycite-max-rew-sl 0.577033 Imported 2026-05-27
111 dictycite-baseline 0.5767 Imported 2026-05-27
112 dmiip2024_4 0.5764 Imported 2026-05-27
113 EP-3 0.575233 Imported 2026-05-27
114 NewM 0.574433 Imported 2026-05-27
115 CSA-IISR 2nd 0.574067 Imported 2026-05-27
116 bioinfo-4 0.572533 Imported 2026-05-27
117 EP-1 0.572267 Imported 2026-05-27
118 dmiip2024 0.571867 Imported 2026-05-27
119 Bio26NIA 0.571767 Imported 2026-05-27
120 ku_dmis_4 0.5717 Imported 2026-05-27
121 ku_dmis_5 0.5717 Imported 2026-05-27
122 Dif-C 0.571467 Imported 2026-05-27
123 Another 0.570233 Imported 2026-05-27
124 dictycite-baseline 0.569333 Imported 2026-05-27
125 Gen-Doc 0.569133 Imported 2026-05-27
126 LLM Biomedical QA 0.568967 Imported 2026-05-27
127 LLM Biomedical QA 0.568167 Imported 2026-05-27
128 EP-2 0.567533 Imported 2026-05-27
129 EP-3 0.567533 Imported 2026-05-27
130 bioinfo-0 0.567233 Imported 2026-05-27
131 pancras_crag 0.566967 Imported 2026-05-27
132 pancras_naive 0.566967 Imported 2026-05-27
133 2 system 0.566033 Imported 2026-05-27
134 5 system 0.564967 Imported 2026-05-27
135 pancras_crag 0.563967 Imported 2026-05-27
136 pancras_naive 0.563967 Imported 2026-05-27
137 MedQA-2 0.563633 Imported 2026-05-27
138 EP-1 0.5632 Imported 2026-05-27
139 Fleming-3 0.5632 Imported 2026-05-27
140 Dif-C 0.562867 Imported 2026-05-27
141 IR_Y-3 0.5627 Imported 2026-05-27
142 Bio26NIA 0.561433 Imported 2026-05-27
143 IR_J-4 0.561067 Imported 2026-05-27
144 UR-IW-4 0.5606 Imported 2026-05-27
145 dmiip2024_2 0.560233 Imported 2026-05-27
146 MedQA-4 0.559733 Imported 2026-05-27
147 UR-IW-3 0.559733 Imported 2026-05-27
148 bioinfo-2 0.5593 Imported 2026-05-27
149 MedQA-4 0.558333 Imported 2026-05-27
150 dmiip2024_3 0.558233 Imported 2026-05-27
151 IR_J-2 0.558 Imported 2026-05-27
152 UR-IW-2 0.5571 Imported 2026-05-27
153 ku_dmis 0.557033 Imported 2026-05-27
154 UR-IW-3 0.556733 Imported 2026-05-27
155 UR-IW-4 0.556567 Imported 2026-05-27
156 llama for 14b b 0.55615 Imported 2026-05-27
157 config-2 0.555833 Imported 2026-05-27
158 IR_J-5 0.554667 Imported 2026-05-27
159 EP-4 0.554 Imported 2026-05-27
160 MedQA-3 0.553333 Imported 2026-05-27
161 ku_dmis_3 0.553033 Imported 2026-05-27
162 UR-IW-5 0.552767 Imported 2026-05-27
163 dmiip2024 0.551967 Imported 2026-05-27
164 IR_J-2 0.551633 Imported 2026-05-27
165 EP-5 0.551533 Imported 2026-05-27
166 EP-4 0.550733 Imported 2026-05-27
167 IR_J-4 0.549833 Imported 2026-05-27
168 CSA-IISR 4th 0.548933 Imported 2026-05-27
169 dmiip2024_3 0.5485 Imported 2026-05-27
170 UR-IW-1 0.5485 Imported 2026-05-27
171 dmiip2024_3 0.548433 Imported 2026-05-27
172 EP-3 0.5484 Imported 2026-05-27
173 UR-IW-2 0.548333 Imported 2026-05-27
174 bioinfo-3 0.547133 Imported 2026-05-27
175 bioinfo-0 0.545533 Imported 2026-05-27
176 MedQA-5 0.544933 Imported 2026-05-27
177 EP-5 0.5444 Imported 2026-05-27
178 bm25 + splade 0.543867 Imported 2026-05-27
179 pancras_crag 0.543633 Imported 2026-05-27
180 pancras_naive 0.543633 Imported 2026-05-27
181 dmiip2024_2 0.543567 Imported 2026-05-27
182 dmiip2024_1 0.543233 Imported 2026-05-27
183 ossllm 0.542833 Imported 2026-05-27
184 asmalltrialsystem 0.542633 Imported 2026-05-27
185 ossllm 0.542633 Imported 2026-05-27
186 UR-IW-1 0.5415 Imported 2026-05-27
187 lean_rag 0.5412 Imported 2026-05-27
188 pancras_crag 0.541133 Imported 2026-05-27
189 pancras_naive 0.541133 Imported 2026-05-27
190 EP-1 0.540633 Imported 2026-05-27
191 CSA-IISR 5st 0.540067 Imported 2026-05-27
192 config-1 0.538867 Imported 2026-05-27
193 DS@GT-BioASQ 0.538267 Imported 2026-05-27
194 IR_J-4 0.537933 Imported 2026-05-27
195 bioinfo-4 0.537767 Imported 2026-05-27
196 IR_J-3 0.5376 Imported 2026-05-27
197 lean_rag 0.536967 Imported 2026-05-27
198 FinalQwen 0.53565 Imported 2026-05-27
199 bioinfo-3 0.5353 Imported 2026-05-27
200 DMIS_MES_TEST_5 0.535167 Imported 2026-05-27
201 dmiip2024_1 0.5351 Imported 2026-05-27
202 bioinfo-1 0.534933 Imported 2026-05-27
203 EP-2 0.534733 Imported 2026-05-27
204 DMISTeam3 0.534333 Imported 2026-05-27
205 DMIS_MES_TEST_2 0.5343 Imported 2026-05-27
206 IR_J-5 0.5333 Imported 2026-05-27
207 Fleming-1 0.5323 Imported 2026-05-27
208 DMIS_MES_TEST_1 0.532267 Imported 2026-05-27
209 DMIS_MES_TEST_3 0.532267 Imported 2026-05-27
210 DMIS_MES_TEST_4 0.532267 Imported 2026-05-27
211 MedQA-1 0.5322 Imported 2026-05-27
212 UR-IW-3 0.5322 Imported 2026-05-27
213 13b_phase_a 0.530933 Imported 2026-05-27
214 Another 0.5307 Imported 2026-05-27
215 dmiip2024_4 0.530333 Imported 2026-05-27
216 dmiip2024 0.5302 Imported 2026-05-27
217 UR-IW-1 0.529433 Imported 2026-05-27
218 bioinfo-1 0.529367 Imported 2026-05-27
219 Organization name 0.528933 Imported 2026-05-27
220 ku_dmis_2 0.5273 Imported 2026-05-27
221 4 system 0.526233 Imported 2026-05-27
222 bioinfo-2 0.525233 Imported 2026-05-27
223 IR_J-3 0.524833 Imported 2026-05-27
224 Dif-C 0.5243 Imported 2026-05-27
225 Fleming-1 0.523267 Imported 2026-05-27
226 13b-1 0.5226 Imported 2026-05-27
227 UR-IW-4 0.5196 Imported 2026-05-27
228 Biomedical QA s3 0.518333 Imported 2026-05-27
229 bioinfo-0 0.517833 Imported 2026-05-27
230 RMC_2 0.517133 Imported 2026-05-27
231 Biomedical QA s.4 0.516167 Imported 2026-05-27
232 Biomedical QA system 0.514733 Imported 2026-05-27
233 lean_rag_ft_sparse 0.514533 Imported 2026-05-27
234 qwen 0.514167 Imported 2026-05-27
235 UR-IW-3 0.514 Imported 2026-05-27
236 Finalcorrected 0.513833 Imported 2026-05-27
237 dmiip2024 0.513533 Imported 2026-05-27
238 Biomedical QA system 0.5134 Imported 2026-05-27
239 IR_J-1 0.511767 Imported 2026-05-27
240 IR_J-1 0.511167 Imported 2026-05-27
241 UR-IW-5 0.507367 Imported 2026-05-27
242 mckpt1 0.505633 Imported 2026-05-27
243 IR_Y-5 0.5052 Imported 2026-05-27
244 IR_Y-5 0.503867 Imported 2026-05-27
245 Biomedical QA s3 0.503533 Imported 2026-05-27
246 IR_Y-3 0.5024 Imported 2026-05-27
247 WM Licensing Oracle 0.502233 Imported 2026-05-27
248 Biomedical QA s.4 0.500367 Imported 2026-05-27
249 RMC_2 0.499533 Imported 2026-05-27
250 MedQA-2 0.499 Imported 2026-05-27
251 IR_Y-2 0.497 Imported 2026-05-27
252 bioinfo-0 0.4969 Imported 2026-05-27
253 IR_Y-1 0.495767 Imported 2026-05-27
254 WM Licensing Oracle 0.4937 Imported 2026-05-27
255 IR_Y-4 0.492267 Imported 2026-05-27
256 EP-4 0.4914 Imported 2026-05-27
257 DMIS_MES_TEST_1 0.490767 Imported 2026-05-27
258 DMIS_MES_TEST_2 0.490767 Imported 2026-05-27
259 DMIS_MES_TEST_3 0.490767 Imported 2026-05-27
260 DMIS_MES_TEST_4 0.490767 Imported 2026-05-27
261 UR-IW-2 0.4892 Imported 2026-05-27
262 DMIS_MES_TEST_5 0.488 Imported 2026-05-27
263 h-nlp-autob-medcpt 0.487467 Imported 2026-05-27
264 bioinfo-1 0.4856 Imported 2026-05-27
265 IR_Y-2 0.484933 Imported 2026-05-27
266 porties-llama3-base 0.4848 Imported 2026-05-27
267 3 system 0.482833 Imported 2026-05-27
268 UR-IW-5 0.4827 Imported 2026-05-27
269 EP-5 0.481333 Imported 2026-05-27
270 h-nlp-autob-medcpt 0.475767 Imported 2026-05-27
271 health-nlp-4 0.47365 Imported 2026-05-27
272 "RMC_1" 0.472267 Imported 2026-05-27
273 IR_Y-4 0.470533 Imported 2026-05-27
274 health-nlp-3 0.4702 Imported 2026-05-27
275 ubuntu 0.469967 Imported 2026-05-27
276 DS@GT-BioASQ 0.4681 Imported 2026-05-27
277 DSGTBioasq 0.4681 Imported 2026-05-27
278 "RMC_1" 0.460867 Imported 2026-05-27
279 agentic graph 0.4578 Imported 2026-05-27
280 Biomedical QA s. v2 0.455133 Imported 2026-05-27
281 health-nlp-3 0.454533 Imported 2026-05-27
282 IR_Y-4 0.453833 Imported 2026-05-27
283 IR_Y-3 0.4524 Imported 2026-05-27
284 mckpt2 0.4514 Imported 2026-05-27
285 h-nlp-autob-medcpt 0.449267 Imported 2026-05-27
286 IR_Y-3 0.447733 Imported 2026-05-27
287 health-nlp-3 0.44515 Imported 2026-05-27
288 DS@GT-BioASQ 0.445133 Imported 2026-05-27
289 IR_Y-5 0.441267 Imported 2026-05-27
290 IR_Y-2 0.441133 Imported 2026-05-27
291 Organization name 0.441 Imported 2026-05-27
292 health-nlp-4 0.4402 Imported 2026-05-27
293 ubuntu 0.4396 Imported 2026-05-27
294 health-nlp-1 0.4362 Imported 2026-05-27
295 IR_J-2 0.4361 Imported 2026-05-27
296 Biomedical QA s. v2 0.427967 Imported 2026-05-27
297 IR_Y-1 0.4189 Imported 2026-05-27
298 DS@GT-BioASQ 0.418033 Imported 2026-05-27
299 agentic graph 0.4155 Imported 2026-05-27
300 health-nlp-2 0.404567 Imported 2026-05-27
301 "RMC_1" 0.4009 Imported 2026-05-27
302 health-nlp-1 0.398133 Imported 2026-05-27
303 health-nlp-4 0.397633 Imported 2026-05-27
304 health-nlp-4 0.397067 Imported 2026-05-27
305 health-nlp-2 0.3922 Imported 2026-05-27
306 "RMC_1" 0.391333 Imported 2026-05-27
307 Hybrid Retrieval 0.368433 Imported 2026-05-27
308 health-nlp-2 0.365033 Imported 2026-05-27
309 LLM Biomedical QA 0.3602 Imported 2026-05-27
310 health-nlp-1 0.353433 Imported 2026-05-27
311 health-nlp-2 0.340433 Imported 2026-05-27
312 asmalltrialsystem 0.32775 Imported 2026-05-27
313 Organization name 0.312033 Imported 2026-05-27
314 DSGT 0.298867 Imported 2026-05-27
315 IR_Y-1 0.294933 Imported 2026-05-27
316 health-nlp-3 0.2907 Imported 2026-05-27
317 IR_Y-2 0.2872 Imported 2026-05-27
318 health-nlp-1 0.2553 Imported 2026-05-27