MedicalBenchmark
AllenAI: Olmo 3 7B Instruct provider

Olmo 3 7B Instruct

285

#285 of 291 modelsMIR 2024

Net score

25.66 pts

Accuracy

33.5%

Correct / Incorrect

67 / 124

Total Cost

$0.05

Overall Performance

(vs. average)
Accuracy

33.5%

avg: 80.5%

Net score

25.66 pts

avg: 150.85 pts

Correct

67

avg: 161

Incorrect

124

avg: 30

Total Cost

$0.05

avg: $3.32

Average response time

18.5s

avg: 16.4s

Output Tokens

177K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

88.7%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
87.1%
Cardiology
Correct
9
Incorrect
12
Unanswered
0
Accuracy
42.9%
Average
79.7%
Dermatology
Correct
3
Incorrect
10
Unanswered
1
Accuracy
21.4%
Average
80.2%
Endocrinology and Nutrition
Correct
6
Incorrect
12
Unanswered
1
Accuracy
31.6%
Average
84.2%
ENT
Correct
2
Incorrect
3
Unanswered
2
Accuracy
28.6%
Average
74.4%
Epidemiology
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
89.3%
Gastroenterology
Correct
9
Incorrect
13
Unanswered
0
Accuracy
40.9%
Average
70.5%
Genetics
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
5
Incorrect
8
Unanswered
1
Accuracy
35.7%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
73.2%
Hematology
Correct
4
Incorrect
9
Unanswered
0
Accuracy
30.8%
Average
81.5%
Immunology
Correct
3
Incorrect
4
Unanswered
1
Accuracy
37.5%
Average
89.1%
Infectious Diseases
Correct
6
Incorrect
17
Unanswered
0
Accuracy
26.1%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
11
Incorrect
6
Unanswered
4
Accuracy
52.4%
Average
80.2%
Nephrology
Correct
4
Incorrect
9
Unanswered
0
Accuracy
30.8%
Average
80.8%
Neurology
Correct
6
Incorrect
14
Unanswered
2
Accuracy
27.3%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
4
Incorrect
12
Unanswered
1
Accuracy
23.5%
Average
82.0%
Pharmacology
Correct
6
Incorrect
15
Unanswered
2
Accuracy
26.1%
Average
85.4%
Psychiatry
Correct
5
Incorrect
5
Unanswered
0
Accuracy
50.0%
Average
89.5%
Pulmonology
Correct
6
Incorrect
13
Unanswered
0
Accuracy
31.6%
Average
80.6%
Radiology-Emergency
Correct
4
Incorrect
9
Unanswered
1
Accuracy
28.6%
Average
64.9%
Rheumatology
Correct
2
Incorrect
12
Unanswered
0
Accuracy
14.3%
Average
81.4%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
91.1%
Traumatology
Correct
5
Incorrect
10
Unanswered
0
Accuracy
33.3%
Average
74.5%
Urology
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
5
Unanswered
0
Accuracy
16.7%
Average
79.8%
Biostatistics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
27
Incorrect
45
Unanswered
1
Accuracy
37.0%
Average
79.2%
Epidemiology
Correct
5
Incorrect
5
Unanswered
2
Accuracy
41.7%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
8
Incorrect
27
Unanswered
2
Accuracy
21.6%
Average
69.6%
Pathophysiology
Correct
7
Incorrect
23
Unanswered
3
Accuracy
21.2%
Average
85.4%
Pharmacology
Correct
8
Incorrect
16
Unanswered
1
Accuracy
32.0%
Average
84.0%
Prevention
Correct
5
Incorrect
6
Unanswered
1
Accuracy
41.7%
Average
89.8%
Prognosis
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
83.9%
Risk
Correct
6
Incorrect
7
Unanswered
0
Accuracy
46.2%
Average
83.6%
Tests
Correct
8
Incorrect
12
Unanswered
1
Accuracy
38.1%
Average
73.9%
Treatment
Correct
26
Incorrect
42
Unanswered
3
Accuracy
36.6%
Average
81.3%
#AnswerCorrectStatus
1AB
2AD
3BB
4BC
5C
6AB
7CD
8CC
9BA
10BD
11BD
12AA
13BC
14CA
15CB
16DA
17DC
18CA
19BB
20CC
21CD
22AB
23AA
24A
25BC
26BB
27AC
28DA
29AB
30AC
31AD
32BA
33C
34DB
35DD
36BD
37BA
38DA
39CC
40CB
41BC
42AD
43CA
44AD
45DD
46BB
47CC
48CC
49B
50BC
51DA
52CD
53AC
54CB
55CC
56D
57BA
58CA
59AA
60BA
61CA
62CD
63DD
64Annulled
65AD
66CC
67CB
68CAnnulled
69AA
70BB
71BB
72BD
73DB
74DC
75BB
76BA
77CD
78CC
79AB
80BA
81AC
82AC
83BB
84DC
85BA
86AA
87DB
88DD
89BB
90DA
91AD
92DA
93BC
94AB
95BD
96BB
97BB
98DB
99AA
100CB
101AA
102DD
103BB
104CD
105BB
106AC
107BC
108DB
109BD
110CD
111CB
112CC
113BAnnulled
114BD
115DD
116DA
117DD
118DD
119CA
120AC
121BA
122DB
123AD
124CD
125DB
126BD
127CA
128DB
129DD
130CC
131BC
132DD
133AA
134BC
135CA
136BD
137CA
138CC
139AA
140BC
141CB
142CC
143BA
144AD
145CC
146AC
147CC
148BA
149DC
150DD
151AA
152BA
153AC
154BB
155AD
156AC
157CC
158DD
159DD
160CB
161DB
162B
163DB
164AB
165AA
166DC
167CA
168CB
169DC
170BA
171AD
172CB
173BA
174BB
175AA
176AC
177BC
178BB
179BC
180AAnnulled
181AB
182DD
183DC
184DA
185CC
186CD
187A
188CC
189DD
190D
191BB
192DB
193DC
194CC
195CC
196BB
197CA
198BB
199CD
200A
201CB
202DD
203BB
204CD
205DD
206BAnnulled
207CA
208A
209BB
210CD