MedicalBenchmark
AllenAI: Olmo 3.1 32B Think provider

Olmo 3.1 32B Think

215

#215 of 290 modelsMIR 2026

Net score

144.33 pts

Accuracy

76.5%

Correct / Incorrect

153 / 26

Total Cost

$0.35

Overall Performance

(vs. average)
Accuracy

76.5%

avg: 81.6%

Net score

144.33 pts

avg: 154.00 pts

Correct

153

avg: 163

Incorrect

26

avg: 28

Total Cost

$0.35

avg: $3.33

Average response time

42.7s

avg: 16.2s

Output Tokens

659K

avg: 430K

Reasoning Tokens

594K

avg: 310K

Average confidence

87.4%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
69.6%
Cardiology
Correct
17
Incorrect
5
Unanswered
3
Accuracy
68.0%
Average
77.3%
Dermatology
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
72.3%
Endocrinology and Nutrition
Correct
13
Incorrect
1
Unanswered
1
Accuracy
86.7%
Average
84.0%
ENT
Correct
6
Incorrect
0
Unanswered
2
Accuracy
75.0%
Average
84.7%
Epidemiology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
80.2%
Gastroenterology
Correct
22
Incorrect
5
Unanswered
3
Accuracy
73.3%
Average
79.3%
Genetics
Correct
8
Incorrect
1
Unanswered
2
Accuracy
72.7%
Average
78.7%
Geriatrics
Correct
10
Incorrect
2
Unanswered
1
Accuracy
76.9%
Average
83.0%
Gynecology and Obstetrics
Correct
10
Incorrect
0
Unanswered
2
Accuracy
83.3%
Average
84.3%
Health Planning and Management
Correct
8
Incorrect
1
Unanswered
1
Accuracy
80.0%
Average
78.4%
Hematology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
76.6%
Immunology
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.4%
Infectious Diseases
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
77.9%
Legal Medicine and Bioethics
Correct
7
Incorrect
2
Unanswered
2
Accuracy
63.6%
Average
82.9%
Medical Oncology
Correct
18
Incorrect
2
Unanswered
3
Accuracy
78.3%
Average
83.0%
Nephrology
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
85.1%
Neurology
Correct
10
Incorrect
1
Unanswered
2
Accuracy
76.9%
Average
88.6%
Ophthalmology
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.7%
Palliative Care
Correct
3
Incorrect
2
Unanswered
1
Accuracy
50.0%
Average
80.2%
Pediatrics
Correct
19
Incorrect
3
Unanswered
0
Accuracy
86.4%
Average
87.6%
Pharmacology
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
78.6%
Psychiatry
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
87.9%
Pulmonology
Correct
13
Incorrect
2
Unanswered
1
Accuracy
81.3%
Average
82.8%
Radiology-Emergency
Correct
7
Incorrect
3
Unanswered
3
Accuracy
53.8%
Average
67.7%
Rheumatology
Correct
11
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.4%
Statistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Traumatology
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
65.2%
Urology
Correct
6
Incorrect
1
Unanswered
1
Accuracy
75.0%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
82.6%
Biostatistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Diagnosis
Correct
67
Incorrect
6
Unanswered
8
Accuracy
82.7%
Average
82.2%
Epidemiology
Correct
9
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.7%
Ethics
Correct
5
Incorrect
0
Unanswered
1
Accuracy
83.3%
Average
92.0%
Interpretation
Correct
24
Incorrect
5
Unanswered
8
Accuracy
64.9%
Average
72.0%
Legal
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
82.4%
Pathophysiology
Correct
21
Incorrect
2
Unanswered
3
Accuracy
80.8%
Average
84.3%
Pharmacology
Correct
13
Incorrect
2
Unanswered
0
Accuracy
86.7%
Average
82.3%
Prevention
Correct
13
Incorrect
1
Unanswered
2
Accuracy
81.3%
Average
80.6%
Prognosis
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
93.2%
Risk
Correct
11
Incorrect
2
Unanswered
2
Accuracy
73.3%
Average
84.3%
Tests
Correct
26
Incorrect
4
Unanswered
3
Accuracy
78.8%
Average
80.3%
Treatment
Correct
49
Incorrect
17
Unanswered
6
Accuracy
68.1%
Average
80.1%
#AnswerCorrectStatus
1A
2BB
3AD
4DD
5BB
6C
7BB
8D
9DD
10AA
11B
12A
13CAnnulled
14C
15DA
16CC
17CC
18AD
19BB
20CB
21BB
22BC
23B
24AA
25C
26AA
27BB
28DD
29AA
30DD
31CC
32CC
33BB
34AB
35AC
36CD
37DD
38AA
39CC
40BA
41DD
42AA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50Annulled
51AA
52CC
53CC
54CC
55A
56AA
57A
58AA
59B
60CB
61CC
62BB
63DB
64CAnnulled
65CC
66DC
67CC
68AA
69BB
70BB
71BB
72BB
73B
74AA
75CC
76C
77DA
78DD
79AA
80DD
81CC
82CD
83BB
84AA
85D
86AA
87BB
88BB
89BB
90CC
91AA
92CC
93D
94BC
95CC
96BB
97BB
98AA
99BB
100CC
101CC
102AA
103CC
104CC
105CC
106BB
107CC
108AA
109CC
110C
111B
112CC
113CC
114DD
115BB
116DD
117CC
118CB
119AA
120AA
121CC
122BB
123DB
124BB
125CC
126B
127CC
128CB
129DC
130CC
131BB
132AB
133CC
134BB
135CC
136BB
137DD
138BB
139AAnnulled
140DD
141AA
142AAnnulled
143AA
144DD
145C
146BC
147BB
148CC
149BB
150BB
151BB
152CC
153CC
154DD
155BB
156AA
157DD
158BB
159CC
160AA
161CAnnulled
162AA
163CC
164CC
165AA
166AA
167D
168CC
169B
170BB
171AA
172AB
173DD
174CC
175BB
176AA
177CD
178DD
179BB
180CC
181BB
182CC
183DD
184DD
185BB
186DD
187DD
188CC
189CC
190BB
191DD
192BD
193CC
194CC
195DD
196AA
197BC
198BA
199BB
200DD
201CC
202AA
203AA
204CC
205BB
206CC
207AA
208BAnnulled
209BB
210CC