MedicalBenchmark
AllenAI: Olmo 3 32B Think provider

Olmo 3 32B Think

220

#220 of 290 modelsMIR 2025

Net score

123.66 pts

Accuracy

69.0%

Correct / Incorrect

138 / 43

Total Cost

$0.38

Overall Performance

(vs. average)
Accuracy

69.0%

avg: 75.9%

Net score

123.66 pts

avg: 138.99 pts

Correct

138

avg: 152

Incorrect

43

avg: 38

Total Cost

$0.38

avg: $3.59

Average response time

51.1s

avg: 18.1s

Output Tokens

733K

avg: 443K

Reasoning Tokens

669K

avg: 320K

Average confidence

89.5%

avg: 94.7%

Subject Breakdown

Allergology
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
81.3%
Cardiology
Correct
12
Incorrect
5
Unanswered
5
Accuracy
54.5%
Average
77.4%
Dermatology
Correct
8
Incorrect
4
Unanswered
1
Accuracy
61.5%
Average
62.8%
Endocrinology and Nutrition
Correct
14
Incorrect
2
Unanswered
0
Accuracy
87.5%
Average
82.5%
ENT
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
73.8%
Epidemiology
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
67.1%
Gastroenterology
Correct
13
Incorrect
6
Unanswered
2
Accuracy
61.9%
Average
72.9%
Genetics
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
68.2%
Geriatrics
Correct
9
Incorrect
2
Unanswered
0
Accuracy
81.8%
Average
71.2%
Gynecology and Obstetrics
Correct
16
Incorrect
0
Unanswered
3
Accuracy
84.2%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
8
Incorrect
2
Unanswered
1
Accuracy
72.7%
Average
81.8%
Immunology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
82.5%
Infectious Diseases
Correct
14
Incorrect
10
Unanswered
4
Accuracy
50.0%
Average
71.1%
Legal Medicine and Bioethics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
67.2%
Medical Oncology
Correct
19
Incorrect
4
Unanswered
2
Accuracy
76.0%
Average
86.3%
Nephrology
Correct
10
Incorrect
4
Unanswered
1
Accuracy
66.7%
Average
78.2%
Neurology
Correct
17
Incorrect
2
Unanswered
1
Accuracy
85.0%
Average
76.2%
Ophthalmology
Correct
4
Incorrect
0
Unanswered
1
Accuracy
80.0%
Average
72.6%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
77.2%
Pediatrics
Correct
16
Incorrect
8
Unanswered
1
Accuracy
64.0%
Average
72.7%
Pharmacology
Correct
11
Incorrect
4
Unanswered
2
Accuracy
64.7%
Average
73.1%
Psychiatry
Correct
8
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
82.0%
Pulmonology
Correct
6
Incorrect
7
Unanswered
1
Accuracy
42.9%
Average
73.0%
Radiology-Emergency
Correct
8
Incorrect
4
Unanswered
2
Accuracy
57.1%
Average
67.9%
Rheumatology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
74.6%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
74.9%
Traumatology
Correct
13
Incorrect
3
Unanswered
2
Accuracy
72.2%
Average
78.2%
Urology
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
77.1%
Biostatistics
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
78.4%
Diagnosis
Correct
63
Incorrect
16
Unanswered
10
Accuracy
70.8%
Average
77.9%
Epidemiology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
75.0%
Ethics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
72.0%
Interpretation
Correct
25
Incorrect
7
Unanswered
10
Accuracy
59.5%
Average
69.3%
Legal
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
63.6%
Pathophysiology
Correct
17
Incorrect
7
Unanswered
3
Accuracy
63.0%
Average
72.6%
Pharmacology
Correct
12
Incorrect
0
Unanswered
1
Accuracy
92.3%
Average
82.4%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
74.5%
Prognosis
Correct
3
Incorrect
2
Unanswered
1
Accuracy
50.0%
Average
77.8%
Risk
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
84.3%
Tests
Correct
19
Incorrect
7
Unanswered
0
Accuracy
73.1%
Average
76.3%
Treatment
Correct
55
Incorrect
20
Unanswered
7
Accuracy
67.1%
Average
75.2%
#AnswerCorrectStatus
1B
2AA
3CC
4AB
5AA
6CC
7CC
8AA
9A
10DD
11D
12DD
13B
14D
15B
16CB
17AB
18A
19CC
20BA
21BB
22CD
23C
24D
25CC
26DAnnulled
27BC
28AAnnulled
29DD
30BB
31D
32AA
33DD
34DD
35BB
36DD
37DC
38CC
39DD
40AA
41DD
42BC
43BB
44DD
45DD
46AA
47AA
48AA
49DD
50BB
51DC
52BB
53D
54BB
55A
56BAnnulled
57CC
58BB
59DD
60AA
61AA
62DD
63BB
64DD
65BA
66AA
67BB
68B
69AB
70AA
71DD
72AA
73DD
74CC
75AA
76BB
77BB
78BB
79CC
80CC
81CC
82DD
83BB
84DD
85CC
86BC
87AA
88DD
89BB
90BA
91CB
92CC
93BB
94CC
95AA
96CC
97DD
98CC
99A
100BC
101BB
102D
103BA
104AC
105AA
106CC
107CB
108DD
109AB
110CC
111AA
112CC
113BB
114DD
115AD
116CC
117AA
118DD
119CC
120BB
121BD
122CC
123CC
124CC
125BD
126BD
127AB
128DD
129AA
130DD
131DD
132AA
133BB
134C
135DB
136DC
137AA
138DD
139DD
140BB
141AA
142AA
143BB
144BB
145BD
146CC
147B
148AA
149DA
150DA
151AA
152AA
153BB
154BB
155BB
156CC
157AA
158CC
159CC
160CA
161AA
162
163CD
164C
165DA
166BB
167AC
168BD
169CB
170BB
171CC
172BA
173AA
174BB
175BB
176CC
177CC
178AA
179DD
180AA
181CB
182CC
183BB
184BB
185DB
186AAnnulled
187CC
188DD
189BD
190AA
191BB
192BA
193CC
194AA
195AA
196AA
197BB
198BC
199DD
200CC
201AB
202AA
203DD
204C
205BB
206CD
207AA
208CC
209CC
210B