MedicalBenchmark
AllenAI: Olmo 3 7B Instruct provider

Olmo 3 7B Instruct

280

#280 of 290 modelsMIR 2025

Net score

23.66 pts

Accuracy

32.5%

Correct / Incorrect

65 / 124

Total Cost

$0.05

Overall Performance

(vs. average)
Accuracy

32.5%

avg: 75.9%

Net score

23.66 pts

avg: 138.99 pts

Correct

65

avg: 152

Incorrect

124

avg: 38

Total Cost

$0.05

avg: $3.59

Average response time

19.3s

avg: 18.1s

Output Tokens

187K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

84.0%

avg: 94.7%

Subject Breakdown

Allergology
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
5
Unanswered
0
Accuracy
16.7%
Average
81.3%
Cardiology
Correct
6
Incorrect
14
Unanswered
2
Accuracy
27.3%
Average
77.4%
Dermatology
Correct
3
Incorrect
10
Unanswered
0
Accuracy
23.1%
Average
62.8%
Endocrinology and Nutrition
Correct
6
Incorrect
10
Unanswered
0
Accuracy
37.5%
Average
82.5%
ENT
Correct
1
Incorrect
6
Unanswered
1
Accuracy
12.5%
Average
73.8%
Epidemiology
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
67.1%
Gastroenterology
Correct
8
Incorrect
10
Unanswered
3
Accuracy
38.1%
Average
72.9%
Genetics
Correct
1
Incorrect
5
Unanswered
0
Accuracy
16.7%
Average
68.2%
Geriatrics
Correct
3
Incorrect
8
Unanswered
0
Accuracy
27.3%
Average
71.2%
Gynecology and Obstetrics
Correct
10
Incorrect
7
Unanswered
2
Accuracy
52.6%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
1
Incorrect
8
Unanswered
2
Accuracy
9.1%
Average
81.8%
Immunology
Correct
5
Incorrect
4
Unanswered
0
Accuracy
55.6%
Average
82.5%
Infectious Diseases
Correct
8
Incorrect
18
Unanswered
2
Accuracy
28.6%
Average
71.1%
Legal Medicine and Bioethics
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
67.2%
Medical Oncology
Correct
10
Incorrect
13
Unanswered
2
Accuracy
40.0%
Average
86.3%
Nephrology
Correct
6
Incorrect
9
Unanswered
0
Accuracy
40.0%
Average
78.2%
Neurology
Correct
7
Incorrect
11
Unanswered
2
Accuracy
35.0%
Average
76.2%
Ophthalmology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
72.6%
Palliative Care
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
77.2%
Pediatrics
Correct
5
Incorrect
18
Unanswered
2
Accuracy
20.0%
Average
72.7%
Pharmacology
Correct
7
Incorrect
10
Unanswered
0
Accuracy
41.2%
Average
73.1%
Psychiatry
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
82.0%
Pulmonology
Correct
5
Incorrect
8
Unanswered
1
Accuracy
35.7%
Average
73.0%
Radiology-Emergency
Correct
3
Incorrect
11
Unanswered
0
Accuracy
21.4%
Average
67.9%
Rheumatology
Correct
2
Incorrect
12
Unanswered
0
Accuracy
14.3%
Average
74.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.9%
Traumatology
Correct
7
Incorrect
10
Unanswered
1
Accuracy
38.9%
Average
78.2%
Urology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
77.1%
Biostatistics
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
78.4%
Diagnosis
Correct
30
Incorrect
54
Unanswered
5
Accuracy
33.7%
Average
77.9%
Epidemiology
Correct
0
Incorrect
4
Unanswered
1
Accuracy
0.0%
Average
75.0%
Ethics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
72.0%
Interpretation
Correct
14
Incorrect
25
Unanswered
3
Accuracy
33.3%
Average
69.3%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
63.6%
Pathophysiology
Correct
9
Incorrect
16
Unanswered
2
Accuracy
33.3%
Average
72.6%
Pharmacology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
82.4%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
74.5%
Prognosis
Correct
0
Incorrect
6
Unanswered
0
Accuracy
0.0%
Average
77.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
84.3%
Tests
Correct
8
Incorrect
17
Unanswered
1
Accuracy
30.8%
Average
76.3%
Treatment
Correct
27
Incorrect
50
Unanswered
5
Accuracy
32.9%
Average
75.2%
#AnswerCorrectStatus
1BB
2DA
3BC
4CB
5AA
6CC
7BC
8AA
9AA
10BD
11CD
12D
13BB
14BD
15D
16AB
17BB
18CA
19BC
20BA
21AB
22AD
23AC
24D
25CC
26BAnnulled
27AC
28AAnnulled
29BD
30BB
31AD
32AA
33CD
34CD
35BB
36BD
37DC
38AC
39DD
40DA
41DD
42AC
43DB
44CD
45BD
46CA
47AA
48DA
49BD
50AB
51CC
52CB
53BD
54DB
55CA
56BAnnulled
57AC
58BB
59D
60AA
61BA
62DD
63BB
64BD
65DA
66AA
67AB
68BB
69BB
70A
71CD
72AA
73BD
74DC
75AA
76BB
77DB
78BB
79BC
80BC
81AC
82CD
83CB
84DD
85CC
86AC
87BA
88AD
89BB
90A
91AB
92AC
93AB
94BC
95BA
96CC
97CD
98DC
99AA
100C
101DB
102CD
103BA
104BC
105DA
106DC
107CB
108BD
109BB
110BC
111AA
112C
113AB
114AD
115DD
116BC
117AA
118DD
119AC
120BB
121D
122CC
123CC
124BC
125CD
126AD
127AB
128DD
129BA
130DD
131CD
132AA
133CB
134CC
135CB
136CC
137CA
138DD
139AD
140CB
141AA
142A
143BB
144B
145BD
146AC
147AB
148DA
149CA
150AA
151DA
152DA
153AB
154BB
155AB
156DC
157AA
158AC
159AC
160CA
161CA
162C
163CD
164CC
165DA
166CB
167AC
168DD
169AB
170AB
171CC
172BA
173CA
174DB
175DB
176CC
177BC
178DA
179BD
180AA
181DB
182BC
183BB
184BB
185DB
186CAnnulled
187CC
188DD
189BD
190AA
191DB
192BA
193CC
194CA
195AA
196BA
197BB
198BC
199BD
200CC
201B
202BA
203BD
204CC
205DB
206DD
207DA
208BC
209CC
210BB