MedicalBenchmark
AllenAI: Olmo 3.1 32B Instruct provider

Olmo 3.1 32B Instruct

251

#251 of 291 modelsMIR 2024

Net score

90.66 pts

Accuracy

58.0%

Correct / Incorrect

116 / 76

Total Cost

$0.11

Overall Performance

(vs. average)
Accuracy

58.0%

avg: 80.5%

Net score

90.66 pts

avg: 150.85 pts

Correct

116

avg: 161

Incorrect

76

avg: 30

Total Cost

$0.11

avg: $3.32

Average response time

12.8s

avg: 16.4s

Output Tokens

147K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

94.7%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
14
Incorrect
5
Unanswered
2
Accuracy
66.7%
Average
79.7%
Dermatology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
80.2%
Endocrinology and Nutrition
Correct
12
Incorrect
7
Unanswered
0
Accuracy
63.2%
Average
84.2%
ENT
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
74.4%
Epidemiology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.3%
Gastroenterology
Correct
10
Incorrect
11
Unanswered
1
Accuracy
45.5%
Average
70.5%
Genetics
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
86.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
0
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
81.2%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
73.2%
Hematology
Correct
11
Incorrect
2
Unanswered
0
Accuracy
84.6%
Average
81.5%
Immunology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.1%
Infectious Diseases
Correct
10
Incorrect
11
Unanswered
2
Accuracy
43.5%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
14
Incorrect
7
Unanswered
0
Accuracy
66.7%
Average
80.2%
Nephrology
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
80.8%
Neurology
Correct
10
Incorrect
11
Unanswered
1
Accuracy
45.5%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
9
Incorrect
7
Unanswered
1
Accuracy
52.9%
Average
82.0%
Pharmacology
Correct
18
Incorrect
5
Unanswered
0
Accuracy
78.3%
Average
85.4%
Psychiatry
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
89.5%
Pulmonology
Correct
10
Incorrect
8
Unanswered
1
Accuracy
52.6%
Average
80.6%
Radiology-Emergency
Correct
6
Incorrect
4
Unanswered
4
Accuracy
42.9%
Average
64.9%
Rheumatology
Correct
5
Incorrect
9
Unanswered
0
Accuracy
35.7%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
5
Incorrect
8
Unanswered
2
Accuracy
33.3%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.7%
Diagnosis
Correct
37
Incorrect
31
Unanswered
5
Accuracy
50.7%
Average
79.2%
Epidemiology
Correct
10
Incorrect
2
Unanswered
0
Accuracy
83.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
16
Incorrect
15
Unanswered
6
Accuracy
43.2%
Average
69.6%
Pathophysiology
Correct
25
Incorrect
8
Unanswered
0
Accuracy
75.8%
Average
85.4%
Pharmacology
Correct
15
Incorrect
10
Unanswered
0
Accuracy
60.0%
Average
84.0%
Prevention
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
89.8%
Prognosis
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
83.9%
Risk
Correct
11
Incorrect
2
Unanswered
0
Accuracy
84.6%
Average
83.6%
Tests
Correct
9
Incorrect
11
Unanswered
1
Accuracy
42.9%
Average
73.9%
Treatment
Correct
37
Incorrect
32
Unanswered
2
Accuracy
52.1%
Average
81.3%
#AnswerCorrectStatus
1AB
2D
3DB
4AC
5CC
6CB
7D
8CC
9BA
10DD
11D
12AA
13C
14BA
15DB
16AA
17CC
18A
19BB
20CC
21D
22CB
23AA
24BA
25AC
26BB
27CC
28AA
29BB
30DC
31DD
32AA
33CC
34BB
35DD
36DD
37BA
38DA
39CC
40BB
41CC
42DD
43AA
44DD
45DD
46BB
47CC
48AC
49BB
50CC
51CA
52DD
53CC
54BB
55CC
56DD
57DA
58AA
59BA
60CA
61CA
62BD
63CD
64BAnnulled
65DD
66CC
67AB
68CAnnulled
69AA
70CB
71CB
72CD
73B
74AC
75CB
76AA
77AD
78CC
79CB
80AA
81CC
82CC
83BB
84CC
85AA
86AA
87BB
88DD
89BB
90AA
91CD
92CA
93AC
94BB
95BD
96BB
97CB
98CB
99AA
100BB
101BA
102BD
103BB
104AD
105BB
106DC
107BC
108BB
109AD
110CD
111BB
112CC
113BAnnulled
114BD
115DD
116DA
117DD
118DD
119AA
120CC
121AA
122AB
123DD
124CD
125CB
126AD
127AA
128DB
129DD
130CC
131BC
132CD
133CA
134CC
135DA
136AD
137AA
138BC
139AA
140AC
141BB
142BC
143AA
144DD
145BC
146BC
147CC
148AA
149AC
150DD
151AA
152AA
153AC
154BB
155DD
156CC
157CC
158DD
159DD
160CB
161BB
162BB
163DB
164AB
165CA
166CC
167AA
168CB
169CC
170CA
171DD
172BB
173DA
174BB
175AA
176CC
177CC
178BB
179BC
180AAnnulled
181B
182DD
183BC
184AA
185CC
186DD
187AA
188CC
189CD
190DD
191BB
192BB
193AC
194CC
195CC
196BB
197DA
198BB
199CD
200AA
201AB
202DD
203BB
204CD
205AD
206BAnnulled
207DA
208AA
209BB
210AD