MedicalBenchmark
AllenAI: Olmo 3 7B Think provider

Olmo 3 7B Think

259

#259 of 290 modelsMIR 2025

Net score

62.66 pts

Accuracy

48.0%

Correct / Incorrect

96 / 100

Total Cost

$0.14

Overall Performance

(vs. average)
Accuracy

48.0%

avg: 75.9%

Net score

62.66 pts

avg: 138.99 pts

Correct

96

avg: 152

Incorrect

100

avg: 38

Total Cost

$0.14

avg: $3.59

Average response time

15.2s

avg: 18.1s

Output Tokens

610K

avg: 443K

Reasoning Tokens

544K

avg: 320K

Average confidence

96.9%

avg: 94.7%

Subject Breakdown

Allergology
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
5
Unanswered
0
Accuracy
16.7%
Average
81.3%
Cardiology
Correct
7
Incorrect
15
Unanswered
0
Accuracy
31.8%
Average
77.4%
Dermatology
Correct
6
Incorrect
7
Unanswered
0
Accuracy
46.2%
Average
62.8%
Endocrinology and Nutrition
Correct
10
Incorrect
5
Unanswered
1
Accuracy
62.5%
Average
82.5%
ENT
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
73.8%
Epidemiology
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
67.1%
Gastroenterology
Correct
8
Incorrect
13
Unanswered
0
Accuracy
38.1%
Average
72.9%
Genetics
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
68.2%
Geriatrics
Correct
5
Incorrect
6
Unanswered
0
Accuracy
45.5%
Average
71.2%
Gynecology and Obstetrics
Correct
13
Incorrect
6
Unanswered
0
Accuracy
68.4%
Average
85.9%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
81.6%
Hematology
Correct
9
Incorrect
2
Unanswered
0
Accuracy
81.8%
Average
81.8%
Immunology
Correct
4
Incorrect
4
Unanswered
1
Accuracy
44.4%
Average
82.5%
Infectious Diseases
Correct
11
Incorrect
16
Unanswered
1
Accuracy
39.3%
Average
71.1%
Legal Medicine and Bioethics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
67.2%
Medical Oncology
Correct
17
Incorrect
8
Unanswered
0
Accuracy
68.0%
Average
86.3%
Nephrology
Correct
10
Incorrect
5
Unanswered
0
Accuracy
66.7%
Average
78.2%
Neurology
Correct
11
Incorrect
8
Unanswered
1
Accuracy
55.0%
Average
76.2%
Ophthalmology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
72.6%
Palliative Care
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
77.2%
Pediatrics
Correct
8
Incorrect
16
Unanswered
1
Accuracy
32.0%
Average
72.7%
Pharmacology
Correct
11
Incorrect
6
Unanswered
0
Accuracy
64.7%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
4
Incorrect
10
Unanswered
0
Accuracy
28.6%
Average
73.0%
Radiology-Emergency
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
67.9%
Rheumatology
Correct
5
Incorrect
9
Unanswered
0
Accuracy
35.7%
Average
74.6%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
74.9%
Traumatology
Correct
8
Incorrect
9
Unanswered
1
Accuracy
44.4%
Average
78.2%
Urology
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
77.1%
Biostatistics
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
78.4%
Diagnosis
Correct
44
Incorrect
44
Unanswered
1
Accuracy
49.4%
Average
77.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
75.0%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
72.0%
Interpretation
Correct
18
Incorrect
23
Unanswered
1
Accuracy
42.9%
Average
69.3%
Legal
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
63.6%
Pathophysiology
Correct
11
Incorrect
14
Unanswered
2
Accuracy
40.7%
Average
72.6%
Pharmacology
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
82.4%
Prevention
Correct
5
Incorrect
7
Unanswered
0
Accuracy
41.7%
Average
74.5%
Prognosis
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
77.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
84.3%
Tests
Correct
15
Incorrect
10
Unanswered
1
Accuracy
57.7%
Average
76.3%
Treatment
Correct
40
Incorrect
41
Unanswered
1
Accuracy
48.8%
Average
75.2%
#AnswerCorrectStatus
1BB
2AA
3DC
4DB
5AA
6CC
7BC
8CA
9CA
10AD
11DD
12CD
13AB
14BD
15C
16DB
17AB
18CA
19BC
20AA
21BB
22DD
23C
24BD
25CC
26DAnnulled
27BC
28DAnnulled
29BD
30BB
31DD
32AA
33DD
34AD
35BB
36BD
37DC
38BC
39DD
40AA
41DD
42CC
43DB
44DD
45DD
46AA
47AA
48AA
49BD
50BB
51DC
52CB
53AD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60BA
61BA
62DD
63BB
64BD
65CA
66AA
67BB
68DB
69AB
70DA
71CD
72CA
73BD
74CC
75AA
76BB
77BB
78BB
79DC
80CC
81AC
82CD
83BB
84DD
85DC
86BC
87CA
88DD
89CB
90AA
91B
92DC
93BB
94AC
95AA
96DC
97DD
98DC
99BA
100BC
101AB
102DD
103AA
104BC
105AA
106CC
107CB
108BD
109BB
110CC
111BA
112BC
113AB
114AD
115BD
116CC
117AA
118CD
119CC
120DB
121AD
122AC
123CC
124BC
125BD
126DD
127BB
128DD
129BA
130DD
131DD
132AA
133AB
134CC
135CB
136DC
137AA
138DD
139DD
140BB
141AA
142AA
143BB
144BB
145DD
146CC
147DB
148DA
149CA
150DA
151AA
152AA
153B
154BB
155AB
156CC
157AA
158CC
159CC
160BA
161CA
162C
163CD
164CC
165AA
166BB
167AC
168BD
169CB
170AB
171DC
172BA
173DA
174CB
175CB
176AC
177CC
178BA
179DD
180AA
181BB
182BC
183DB
184BB
185DB
186DAnnulled
187CC
188DD
189BD
190AA
191AB
192BA
193CC
194CA
195AA
196AA
197BB
198BC
199DD
200CC
201B
202BA
203DD
204BC
205BB
206CD
207AA
208CC
209CC
210BB