MedicalBenchmark
AllenAI: Olmo 3.1 32B Instruct provider

Olmo 3.1 32B Instruct

276

#276 of 319 modelsMIR 2025

Net score

79.33 pts

Accuracy

53.5%

Correct / Incorrect

107 / 83

Total Cost

$0.12

Overall Performance

(vs. average)
Accuracy

53.5%

avg: 77.9%

Net score

79.33 pts

avg: 143.96 pts

Correct

107

avg: 156

Incorrect

83

avg: 35

Total Cost

$0.12

avg: $3.36

Average response time

13.7s

avg: 19.0s

Output Tokens

163K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

94.0%

avg: 95.2%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
82.3%
Cardiology
Correct
11
Incorrect
11
Unanswered
0
Accuracy
50.0%
Average
78.6%
Dermatology
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
69.4%
Endocrinology and Nutrition
Correct
9
Incorrect
6
Unanswered
1
Accuracy
56.3%
Average
83.5%
ENT
Correct
1
Incorrect
5
Unanswered
2
Accuracy
12.5%
Average
74.8%
Epidemiology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
69.1%
Gastroenterology
Correct
11
Incorrect
7
Unanswered
3
Accuracy
52.4%
Average
74.1%
Genetics
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
69.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
77.5%
Gynecology and Obstetrics
Correct
11
Incorrect
7
Unanswered
1
Accuracy
57.9%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
82.6%
Hematology
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
82.7%
Immunology
Correct
6
Incorrect
3
Unanswered
0
Accuracy
66.7%
Average
83.3%
Infectious Diseases
Correct
17
Incorrect
10
Unanswered
0
Accuracy
63.0%
Average
74.9%
Legal Medicine and Bioethics
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
68.4%
Medical Oncology
Correct
18
Incorrect
7
Unanswered
0
Accuracy
72.0%
Average
87.2%
Nephrology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
84.8%
Neurology
Correct
12
Incorrect
7
Unanswered
1
Accuracy
60.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
78.6%
Pediatrics
Correct
13
Incorrect
11
Unanswered
2
Accuracy
50.0%
Average
71.9%
Pharmacology
Correct
10
Incorrect
6
Unanswered
1
Accuracy
58.8%
Average
74.1%
Psychiatry
Correct
4
Incorrect
4
Unanswered
0
Accuracy
50.0%
Average
83.0%
Pulmonology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
80.4%
Radiology-Emergency
Correct
7
Incorrect
5
Unanswered
2
Accuracy
50.0%
Average
69.4%
Rheumatology
Correct
8
Incorrect
6
Unanswered
1
Accuracy
53.3%
Average
76.6%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
76.6%
Traumatology
Correct
9
Incorrect
6
Unanswered
3
Accuracy
50.0%
Average
79.3%
Urology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
78.6%
Biostatistics
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
79.8%
Diagnosis
Correct
50
Incorrect
31
Unanswered
7
Accuracy
56.8%
Average
79.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
76.7%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.1%
Interpretation
Correct
20
Incorrect
16
Unanswered
6
Accuracy
47.6%
Average
70.7%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
64.6%
Pathophysiology
Correct
12
Incorrect
14
Unanswered
1
Accuracy
44.4%
Average
76.1%
Pharmacology
Correct
11
Incorrect
2
Unanswered
0
Accuracy
84.6%
Average
83.3%
Prevention
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
75.6%
Prognosis
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
80.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
85.2%
Tests
Correct
16
Incorrect
11
Unanswered
0
Accuracy
59.3%
Average
77.9%
Treatment
Correct
44
Incorrect
35
Unanswered
2
Accuracy
54.3%
Average
77.3%
#AnswerCorrectStatus
1BB
2BA
3CC
4DB
5AA
6CC
7CC
8AA
9A
10BD
11D
12CD
13BB
14DD
15BAnnulled
16B
17B
18CA
19C
20BA
21BB
22CD
23CC
24D
25CC
26AAnnulled
27C
28DAnnulled
29DD
30DB
31AD
32DA
33DD
34CD
35BB
36AD
37CC
38CC
39DD
40BA
41DD
42BC
43CB
44DD
45DD
46AA
47AA
48DA
49D
50DB
51DC
52BB
53DD
54DB
55CA
56BAnnulled
57AC
58AB
59DD
60DA
61BA
62DD
63BB
64DD
65BA
66AA
67AB
68BB
69AB
70AA
71DD
72CA
73CD
74CC
75AA
76CB
77BB
78BB
79AC
80C
81CC
82AD
83AB
84BD
85DC
86AC
87AA
88DD
89BB
90AA
91CB
92DC
93BB
94CC
95DA
96CC
97CD
98BC
99AA
100CC
101CB
102CD
103BA
104BC
105CA
106CC
107BB
108DD
109AB
110CC
111AA
112AC
113AB
114DD
115AD
116BC
117AA
118DD
119CC
120BB
121CD
122CC
123CC
124CC
125CD
126DD
127BB
128DD
129AA
130DD
131BD
132DA
133AB
134DC
135DB
136CC
137CA
138AD
139DD
140DB
141AA
142AA
143BB
144BB
145BD
146CC
147DB
148AA
149A
150DD
151AA
152AA
153BB
154BB
155BB
156DC
157AA
158DC
159CC
160CA
161AA
162DAnnulled
163DD
164CC
165DA
166BB
167AC
168DD
169DB
170AB
171CC
172BA
173AA
174BB
175BB
176CC
177BC
178BA
179BD
180AA
181DB
182CC
183DB
184BB
185BB
186BAnnulled
187CC
188DD
189BD
190AA
191CB
192BA
193CC
194CA
195AA
196AA
197BB
198BC
199DD
200CC
201BB
202BA
203DD
204BC
205BB
206DD
207AA
208AC
209CC
210AB