MedicalBenchmark
AllenAI: Olmo 3.1 32B Think provider

Olmo 3.1 32B Think

246

#246 of 319 modelsMIR 2025

Net score

127.66 pts

Accuracy

70.5%

Correct / Incorrect

141 / 40

Total Cost

$0.41

Overall Performance

(vs. average)
Accuracy

70.5%

avg: 77.9%

Net score

127.66 pts

avg: 143.96 pts

Correct

141

avg: 156

Incorrect

40

avg: 35

Total Cost

$0.41

avg: $3.36

Average response time

54.7s

avg: 19.0s

Output Tokens

787K

avg: 430K

Reasoning Tokens

721K

avg: 306K

Average confidence

88.3%

avg: 95.2%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
5
Incorrect
0
Unanswered
1
Accuracy
83.3%
Average
82.3%
Cardiology
Correct
12
Incorrect
6
Unanswered
4
Accuracy
54.5%
Average
78.6%
Dermatology
Correct
7
Incorrect
3
Unanswered
2
Accuracy
58.3%
Average
69.4%
Endocrinology and Nutrition
Correct
14
Incorrect
2
Unanswered
0
Accuracy
87.5%
Average
83.5%
ENT
Correct
4
Incorrect
2
Unanswered
2
Accuracy
50.0%
Average
74.8%
Epidemiology
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
69.1%
Gastroenterology
Correct
15
Incorrect
6
Unanswered
0
Accuracy
71.4%
Average
74.1%
Genetics
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
69.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
77.5%
Gynecology and Obstetrics
Correct
18
Incorrect
0
Unanswered
1
Accuracy
94.7%
Average
86.7%
Health Planning and Management
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
82.6%
Hematology
Correct
9
Incorrect
2
Unanswered
0
Accuracy
81.8%
Average
82.7%
Immunology
Correct
8
Incorrect
1
Unanswered
0
Accuracy
88.9%
Average
83.3%
Infectious Diseases
Correct
16
Incorrect
10
Unanswered
1
Accuracy
59.3%
Average
74.9%
Legal Medicine and Bioethics
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
68.4%
Medical Oncology
Correct
20
Incorrect
3
Unanswered
2
Accuracy
80.0%
Average
87.2%
Nephrology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
84.8%
Neurology
Correct
16
Incorrect
3
Unanswered
1
Accuracy
80.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
0
Unanswered
2
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
78.6%
Pediatrics
Correct
15
Incorrect
9
Unanswered
2
Accuracy
57.7%
Average
71.9%
Pharmacology
Correct
11
Incorrect
4
Unanswered
2
Accuracy
64.7%
Average
74.1%
Psychiatry
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
83.0%
Pulmonology
Correct
9
Incorrect
2
Unanswered
3
Accuracy
64.3%
Average
80.4%
Radiology-Emergency
Correct
7
Incorrect
5
Unanswered
2
Accuracy
50.0%
Average
69.4%
Rheumatology
Correct
12
Incorrect
3
Unanswered
0
Accuracy
80.0%
Average
76.6%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
76.6%
Traumatology
Correct
12
Incorrect
3
Unanswered
3
Accuracy
66.7%
Average
79.3%
Urology
Correct
4
Incorrect
1
Unanswered
2
Accuracy
57.1%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
78.6%
Biostatistics
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
79.8%
Diagnosis
Correct
60
Incorrect
18
Unanswered
10
Accuracy
68.2%
Average
79.9%
Epidemiology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
76.7%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
74.1%
Interpretation
Correct
25
Incorrect
9
Unanswered
8
Accuracy
59.5%
Average
70.7%
Legal
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
64.6%
Pathophysiology
Correct
20
Incorrect
5
Unanswered
2
Accuracy
74.1%
Average
76.1%
Pharmacology
Correct
10
Incorrect
1
Unanswered
2
Accuracy
76.9%
Average
83.3%
Prevention
Correct
10
Incorrect
2
Unanswered
0
Accuracy
83.3%
Average
75.6%
Prognosis
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
80.8%
Risk
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
85.2%
Tests
Correct
20
Incorrect
6
Unanswered
1
Accuracy
74.1%
Average
77.9%
Treatment
Correct
56
Incorrect
17
Unanswered
8
Accuracy
69.1%
Average
77.3%
#AnswerCorrectStatus
1BB
2A
3CC
4CB
5AA
6CC
7CC
8AA
9CA
10DD
11D
12D
13B
14BD
15BAnnulled
16B
17DB
18BA
19CC
20BA
21BB
22CD
23AC
24DD
25CC
26Annulled
27BC
28AAnnulled
29DD
30BB
31DD
32AA
33DD
34DD
35BB
36DD
37DC
38CC
39CD
40AA
41DD
42CC
43BB
44DD
45DD
46A
47AA
48AA
49D
50B
51DC
52BB
53D
54BB
55A
56BAnnulled
57CC
58BB
59DD
60AA
61AA
62DD
63BB
64DD
65BA
66AA
67BB
68B
69AB
70AA
71CD
72AA
73CD
74CC
75AA
76BB
77BB
78BB
79CC
80CC
81CC
82DD
83BB
84D
85CC
86C
87AA
88DD
89BB
90CA
91BB
92CC
93BB
94CC
95AA
96CC
97DD
98CC
99A
100BC
101BB
102DD
103DA
104DC
105AA
106CC
107CB
108DD
109BB
110CC
111AA
112CC
113B
114DD
115AD
116CC
117AA
118DD
119CC
120BB
121BD
122CC
123CC
124CC
125DD
126BD
127AB
128DD
129AA
130DD
131DD
132AA
133BB
134CC
135BB
136DC
137AA
138DD
139DD
140AB
141AA
142AA
143BB
144BB
145BD
146CC
147BB
148AA
149DA
150DD
151AA
152BA
153BB
154BB
155BB
156CC
157AA
158CC
159CC
160AA
161DA
162Annulled
163DD
164C
165DA
166BB
167AC
168BD
169CB
170BB
171CC
172BA
173AA
174BB
175BB
176CC
177BC
178AA
179DD
180AA
181B
182CC
183BB
184BB
185BB
186AAnnulled
187CC
188DD
189BD
190A
191BB
192AA
193CC
194AA
195AA
196AA
197BB
198CC
199DD
200CC
201CB
202AA
203D
204CC
205BB
206CD
207AA
208AC
209CC
210BB