MedicalBenchmark
Meta: Llama 3 8B Instruct provider

Llama 3 8B Instruct

263

#263 of 290 modelsMIR 2026

Net score

73.00 pts

Accuracy

49.5%

Correct / Incorrect

99 / 78

Total Cost

$0.01

Overall Performance

(vs. average)
Accuracy

49.5%

avg: 81.6%

Net score

73.00 pts

avg: 154.00 pts

Correct

99

avg: 163

Incorrect

78

avg: 28

Total Cost

$0.01

avg: $3.33

Average response time

11.4s

avg: 16.2s

Output Tokens

151K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

85.5%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
69.6%
Cardiology
Correct
8
Incorrect
11
Unanswered
6
Accuracy
32.0%
Average
77.3%
Dermatology
Correct
3
Incorrect
6
Unanswered
2
Accuracy
27.3%
Average
72.3%
Endocrinology and Nutrition
Correct
8
Incorrect
6
Unanswered
1
Accuracy
53.3%
Average
84.0%
ENT
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
84.7%
Epidemiology
Correct
6
Incorrect
0
Unanswered
1
Accuracy
85.7%
Average
80.2%
Gastroenterology
Correct
15
Incorrect
14
Unanswered
1
Accuracy
50.0%
Average
79.3%
Genetics
Correct
4
Incorrect
5
Unanswered
2
Accuracy
36.4%
Average
78.7%
Geriatrics
Correct
5
Incorrect
4
Unanswered
4
Accuracy
38.5%
Average
83.0%
Gynecology and Obstetrics
Correct
5
Incorrect
5
Unanswered
2
Accuracy
41.7%
Average
84.3%
Health Planning and Management
Correct
7
Incorrect
2
Unanswered
1
Accuracy
70.0%
Average
78.4%
Hematology
Correct
2
Incorrect
5
Unanswered
2
Accuracy
22.2%
Average
76.6%
Immunology
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
91.4%
Infectious Diseases
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
77.9%
Legal Medicine and Bioethics
Correct
9
Incorrect
1
Unanswered
1
Accuracy
81.8%
Average
82.9%
Medical Oncology
Correct
14
Incorrect
8
Unanswered
1
Accuracy
60.9%
Average
83.0%
Nephrology
Correct
6
Incorrect
4
Unanswered
0
Accuracy
60.0%
Average
85.1%
Neurology
Correct
7
Incorrect
3
Unanswered
3
Accuracy
53.8%
Average
88.6%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
83.7%
Palliative Care
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
80.2%
Pediatrics
Correct
15
Incorrect
5
Unanswered
2
Accuracy
68.2%
Average
87.6%
Pharmacology
Correct
3
Incorrect
8
Unanswered
0
Accuracy
27.3%
Average
78.6%
Psychiatry
Correct
3
Incorrect
4
Unanswered
1
Accuracy
37.5%
Average
87.9%
Pulmonology
Correct
7
Incorrect
8
Unanswered
1
Accuracy
43.8%
Average
82.8%
Radiology-Emergency
Correct
3
Incorrect
8
Unanswered
2
Accuracy
23.1%
Average
67.7%
Rheumatology
Correct
6
Incorrect
4
Unanswered
1
Accuracy
54.5%
Average
88.4%
Statistics
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
83.8%
Traumatology
Correct
3
Incorrect
6
Unanswered
2
Accuracy
27.3%
Average
65.2%
Urology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
82.6%
Biostatistics
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
83.8%
Diagnosis
Correct
34
Incorrect
37
Unanswered
10
Accuracy
42.0%
Average
82.2%
Epidemiology
Correct
8
Incorrect
0
Unanswered
1
Accuracy
88.9%
Average
88.7%
Ethics
Correct
5
Incorrect
0
Unanswered
1
Accuracy
83.3%
Average
92.0%
Interpretation
Correct
11
Incorrect
21
Unanswered
5
Accuracy
29.7%
Average
72.0%
Legal
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
82.4%
Pathophysiology
Correct
15
Incorrect
7
Unanswered
4
Accuracy
57.7%
Average
84.3%
Pharmacology
Correct
6
Incorrect
8
Unanswered
1
Accuracy
40.0%
Average
82.3%
Prevention
Correct
9
Incorrect
6
Unanswered
1
Accuracy
56.3%
Average
80.6%
Prognosis
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
93.2%
Risk
Correct
10
Incorrect
5
Unanswered
0
Accuracy
66.7%
Average
84.3%
Tests
Correct
10
Incorrect
17
Unanswered
6
Accuracy
30.3%
Average
80.3%
Treatment
Correct
33
Incorrect
33
Unanswered
6
Accuracy
45.8%
Average
80.1%
#AnswerCorrectStatus
1BA
2BB
3AD
4CD
5BB
6DC
7AB
8BD
9D
10BA
11B
12CA
13AAnnulled
14AC
15DA
16CC
17BC
18BD
19BB
20DB
21CB
22AC
23BB
24DA
25CC
26CA
27BB
28DD
29AA
30DD
31CC
32CC
33B
34BB
35BC
36BD
37CD
38DA
39AC
40AA
41DD
42BA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50DAnnulled
51CA
52AC
53CC
54AC
55BA
56CA
57A
58BA
59AB
60CB
61C
62CB
63BB
64CAnnulled
65CC
66DC
67AC
68AA
69CB
70BB
71BB
72B
73BB
74AA
75CC
76CC
77CA
78DD
79AA
80AD
81CC
82DD
83B
84AA
85DD
86AA
87BB
88BB
89B
90CC
91BA
92CC
93AD
94AC
95CC
96CB
97BB
98AA
99BB
100CC
101C
102AA
103CC
104CC
105CC
106AB
107DC
108AA
109AC
110AC
111AB
112AC
113C
114AD
115AB
116CD
117BC
118DB
119AA
120A
121C
122BB
123B
124BB
125CC
126B
127CC
128BB
129CC
130CC
131BB
132CB
133C
134B
135CC
136CB
137CD
138BB
139Annulled
140CD
141AA
142CAnnulled
143BA
144BD
145CC
146DC
147BB
148CC
149BB
150DB
151BB
152CC
153BC
154CD
155B
156DA
157DD
158BB
159C
160AA
161Annulled
162AA
163AC
164AC
165A
166AA
167DD
168CC
169AB
170BB
171AA
172B
173AD
174AC
175BB
176A
177AD
178BD
179AB
180BC
181BB
182CC
183DD
184DD
185DB
186DD
187DD
188CC
189CC
190AB
191DD
192AD
193CC
194CC
195DD
196DA
197CC
198BA
199B
200DD
201CC
202A
203AA
204CC
205BB
206BC
207A
208AAnnulled
209CB
210CC