MedicalBenchmark
Anthropic: Claude 3 Haiku provider

Claude 3 Haiku

221

#221 of 291 modelsMIR 2024

Net score

138.00 pts

Accuracy

76.0%

Correct / Incorrect

152 / 42

Total Cost

$0.13

Overall Performance

(vs. average)
Accuracy

76.0%

avg: 80.5%

Net score

138.00 pts

avg: 150.85 pts

Correct

152

avg: 161

Incorrect

42

avg: 30

Total Cost

$0.13

avg: $3.32

Average response time

3.4s

avg: 16.4s

Output Tokens

79K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

97.7%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.1%
Cardiology
Correct
15
Incorrect
5
Unanswered
1
Accuracy
71.4%
Average
79.7%
Dermatology
Correct
12
Incorrect
2
Unanswered
0
Accuracy
85.7%
Average
80.2%
Endocrinology and Nutrition
Correct
14
Incorrect
4
Unanswered
1
Accuracy
73.7%
Average
84.2%
ENT
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
74.4%
Epidemiology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
15
Incorrect
6
Unanswered
1
Accuracy
68.2%
Average
70.5%
Genetics
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
86.5%
Geriatrics
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
86.9%
Gynecology and Obstetrics
Correct
12
Incorrect
2
Unanswered
0
Accuracy
85.7%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
73.2%
Hematology
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
81.5%
Immunology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.1%
Infectious Diseases
Correct
16
Incorrect
5
Unanswered
2
Accuracy
69.6%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
16
Incorrect
4
Unanswered
1
Accuracy
76.2%
Average
80.2%
Nephrology
Correct
12
Incorrect
1
Unanswered
0
Accuracy
92.3%
Average
80.8%
Neurology
Correct
19
Incorrect
2
Unanswered
1
Accuracy
86.4%
Average
83.7%
Ophthalmology
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
13
Incorrect
2
Unanswered
2
Accuracy
76.5%
Average
82.0%
Pharmacology
Correct
18
Incorrect
5
Unanswered
0
Accuracy
78.3%
Average
85.4%
Psychiatry
Correct
10
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
89.5%
Pulmonology
Correct
13
Incorrect
4
Unanswered
2
Accuracy
68.4%
Average
80.6%
Radiology-Emergency
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
64.9%
Rheumatology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
9
Incorrect
6
Unanswered
0
Accuracy
60.0%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
55
Incorrect
16
Unanswered
2
Accuracy
75.3%
Average
79.2%
Epidemiology
Correct
7
Incorrect
4
Unanswered
1
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
26
Incorrect
11
Unanswered
0
Accuracy
70.3%
Average
69.6%
Pathophysiology
Correct
29
Incorrect
3
Unanswered
1
Accuracy
87.9%
Average
85.4%
Pharmacology
Correct
18
Incorrect
7
Unanswered
0
Accuracy
72.0%
Average
84.0%
Prevention
Correct
8
Incorrect
2
Unanswered
2
Accuracy
66.7%
Average
89.8%
Prognosis
Correct
7
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.9%
Risk
Correct
12
Incorrect
1
Unanswered
0
Accuracy
92.3%
Average
83.6%
Tests
Correct
13
Incorrect
6
Unanswered
2
Accuracy
61.9%
Average
73.9%
Treatment
Correct
53
Incorrect
15
Unanswered
3
Accuracy
74.6%
Average
81.3%
#AnswerCorrectStatus
1AB
2CD
3BB
4CC
5CC
6BB
7DD
8CC
9BA
10BD
11DD
12AA
13DC
14BA
15BB
16BA
17CC
18AA
19BB
20CC
21DD
22BB
23AA
24CA
25CC
26BB
27CC
28AA
29AB
30C
31DD
32AA
33CC
34CB
35DD
36DD
37AA
38DA
39CC
40BB
41CC
42DD
43DA
44DD
45DD
46BB
47CC
48CC
49BB
50CC
51CA
52DD
53CC
54BB
55CC
56DD
57DA
58AA
59AA
60AA
61AA
62CD
63DD
64DAnnulled
65DD
66CC
67DB
68BAnnulled
69AA
70BB
71BB
72DD
73CB
74CC
75B
76AA
77AD
78CC
79BB
80AA
81CC
82CC
83BB
84CC
85AA
86AA
87BB
88DD
89BB
90AA
91DD
92AA
93C
94BB
95AD
96BB
97BB
98DB
99AA
100BB
101DA
102DD
103BB
104DD
105BB
106CC
107CC
108BB
109DD
110CD
111CB
112CC
113DAnnulled
114DD
115DD
116AA
117DD
118DD
119AA
120CC
121AA
122DB
123DD
124D
125AB
126DD
127AA
128BB
129DD
130CC
131CC
132CD
133CA
134BC
135AA
136DD
137AA
138CC
139AA
140CC
141BB
142CC
143BA
144DD
145CC
146DC
147BC
148AA
149CC
150DD
151AA
152AA
153DC
154BB
155DD
156CC
157CC
158DD
159DD
160AB
161BB
162BB
163B
164BB
165CA
166CC
167CA
168BB
169CC
170AA
171DD
172BB
173AA
174BB
175AA
176CC
177DC
178DB
179CC
180Annulled
181BB
182DD
183CC
184DA
185CC
186DD
187A
188CC
189CD
190DD
191BB
192DB
193CC
194CC
195CC
196BB
197AA
198BB
199CD
200AA
201BB
202DD
203CB
204DD
205DD
206CAnnulled
207AA
208AA
209DB
210AD