MedicalBenchmark
Anthropic: Claude 3 Haiku provider

Claude 3 Haiku

253

#253 of 319 modelsMIR 2025

Net score

114.00 pts

Accuracy

67.0%

Correct / Incorrect

134 / 60

Total Cost

$0.14

Overall Performance

(vs. average)
Accuracy

67.0%

avg: 77.9%

Net score

114.00 pts

avg: 143.96 pts

Correct

134

avg: 156

Incorrect

60

avg: 35

Total Cost

$0.14

avg: $3.36

Average response time

3.5s

avg: 19.0s

Output Tokens

86K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

96.4%

avg: 95.2%

Subject Breakdown

Allergology
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
82.3%
Cardiology
Correct
14
Incorrect
8
Unanswered
0
Accuracy
63.6%
Average
78.6%
Dermatology
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
69.4%
Endocrinology and Nutrition
Correct
12
Incorrect
4
Unanswered
0
Accuracy
75.0%
Average
83.5%
ENT
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
74.8%
Epidemiology
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
69.1%
Gastroenterology
Correct
13
Incorrect
7
Unanswered
1
Accuracy
61.9%
Average
74.1%
Genetics
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
69.5%
Geriatrics
Correct
5
Incorrect
5
Unanswered
1
Accuracy
45.5%
Average
77.5%
Gynecology and Obstetrics
Correct
16
Incorrect
3
Unanswered
0
Accuracy
84.2%
Average
86.7%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
82.6%
Hematology
Correct
6
Incorrect
3
Unanswered
2
Accuracy
54.5%
Average
82.7%
Immunology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
83.3%
Infectious Diseases
Correct
15
Incorrect
11
Unanswered
1
Accuracy
55.6%
Average
74.9%
Legal Medicine and Bioethics
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
68.4%
Medical Oncology
Correct
20
Incorrect
3
Unanswered
2
Accuracy
80.0%
Average
87.2%
Nephrology
Correct
11
Incorrect
2
Unanswered
1
Accuracy
78.6%
Average
84.8%
Neurology
Correct
12
Incorrect
8
Unanswered
0
Accuracy
60.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
78.6%
Pediatrics
Correct
17
Incorrect
9
Unanswered
0
Accuracy
65.4%
Average
71.9%
Pharmacology
Correct
9
Incorrect
7
Unanswered
1
Accuracy
52.9%
Average
74.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
83.0%
Pulmonology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
80.4%
Radiology-Emergency
Correct
9
Incorrect
5
Unanswered
0
Accuracy
64.3%
Average
69.4%
Rheumatology
Correct
11
Incorrect
4
Unanswered
0
Accuracy
73.3%
Average
76.6%
Statistics
Correct
2
Incorrect
0
Unanswered
1
Accuracy
66.7%
Average
76.6%
Traumatology
Correct
13
Incorrect
5
Unanswered
0
Accuracy
72.2%
Average
79.3%
Urology
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
78.6%
Biostatistics
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
79.8%
Diagnosis
Correct
59
Incorrect
27
Unanswered
2
Accuracy
67.0%
Average
79.9%
Epidemiology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
76.7%
Ethics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
74.1%
Interpretation
Correct
26
Incorrect
14
Unanswered
2
Accuracy
61.9%
Average
70.7%
Legal
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
64.6%
Pathophysiology
Correct
20
Incorrect
6
Unanswered
1
Accuracy
74.1%
Average
76.1%
Pharmacology
Correct
8
Incorrect
3
Unanswered
2
Accuracy
61.5%
Average
83.3%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
75.6%
Prognosis
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
80.8%
Risk
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
85.2%
Tests
Correct
17
Incorrect
10
Unanswered
0
Accuracy
63.0%
Average
77.9%
Treatment
Correct
51
Incorrect
28
Unanswered
2
Accuracy
63.0%
Average
77.3%
#AnswerCorrectStatus
1BB
2CA
3CC
4AB
5AA
6CC
7CC
8AA
9CA
10DD
11DD
12DD
13AB
14DD
15BAnnulled
16CB
17BB
18AA
19CC
20AA
21B
22CD
23AC
24DD
25CC
26Annulled
27DC
28DAnnulled
29D
30BB
31DD
32AA
33DD
34DD
35BB
36DD
37AC
38CC
39DD
40AA
41DD
42CC
43CB
44DD
45D
46AA
47AA
48AA
49DD
50BB
51CC
52BB
53D
54BB
55CA
56BAnnulled
57CC
58BB
59DD
60BA
61AA
62DD
63BB
64DD
65AA
66AA
67BB
68AB
69AB
70AA
71DD
72AA
73DD
74CC
75AA
76CB
77BB
78BB
79AC
80CC
81CC
82CD
83BB
84DD
85DC
86CC
87AA
88DD
89CB
90BA
91CB
92DC
93BB
94CC
95AA
96CC
97DD
98DC
99AA
100DC
101BB
102CD
103CA
104CC
105AA
106CC
107BB
108DD
109BB
110CC
111AA
112CC
113BB
114AD
115CD
116CC
117AA
118DD
119C
120AB
121CD
122AC
123CC
124CC
125BD
126BD
127BB
128DD
129DA
130DD
131BD
132AA
133BB
134CC
135BB
136CC
137CA
138DD
139DD
140BB
141AA
142A
143BB
144BB
145DD
146CC
147BB
148BA
149DA
150DD
151DA
152AA
153BB
154BB
155BB
156CC
157AA
158BC
159CC
160CA
161CA
162Annulled
163DD
164AC
165DA
166CB
167CC
168DD
169DB
170CB
171CC
172BA
173DA
174BB
175BB
176CC
177BC
178BA
179DD
180AA
181BB
182BC
183DB
184BB
185DB
186DAnnulled
187CC
188DD
189CD
190AA
191CB
192AA
193CC
194AA
195AA
196AA
197BB
198BC
199DD
200CC
201DB
202AA
203DD
204DC
205BB
206CD
207AA
208CC
209CC
210BB