MedicalBenchmark
Meta: Llama 3.1 8B Instruct provider

Llama 3.1 8B Instruct

271

#271 of 290 modelsMIR 2026

Net score

63.33 pts

Accuracy

43.0%

Correct / Incorrect

86 / 68

Total Cost

$0.02

Overall Performance

(vs. average)
Accuracy

43.0%

avg: 81.6%

Net score

63.33 pts

avg: 154.00 pts

Correct

86

avg: 163

Incorrect

68

avg: 28

Total Cost

$0.02

avg: $3.33

Average response time

29.1s

avg: 16.2s

Output Tokens

293K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

78.6%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
69.6%
Cardiology
Correct
7
Incorrect
9
Unanswered
9
Accuracy
28.0%
Average
77.3%
Dermatology
Correct
4
Incorrect
2
Unanswered
5
Accuracy
36.4%
Average
72.3%
Endocrinology and Nutrition
Correct
6
Incorrect
5
Unanswered
4
Accuracy
40.0%
Average
84.0%
ENT
Correct
4
Incorrect
3
Unanswered
1
Accuracy
50.0%
Average
84.7%
Epidemiology
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
80.2%
Gastroenterology
Correct
14
Incorrect
8
Unanswered
8
Accuracy
46.7%
Average
79.3%
Genetics
Correct
6
Incorrect
3
Unanswered
2
Accuracy
54.5%
Average
78.7%
Geriatrics
Correct
3
Incorrect
6
Unanswered
4
Accuracy
23.1%
Average
83.0%
Gynecology and Obstetrics
Correct
4
Incorrect
4
Unanswered
4
Accuracy
33.3%
Average
84.3%
Health Planning and Management
Correct
4
Incorrect
3
Unanswered
3
Accuracy
40.0%
Average
78.4%
Hematology
Correct
3
Incorrect
4
Unanswered
2
Accuracy
33.3%
Average
76.6%
Immunology
Correct
5
Incorrect
0
Unanswered
1
Accuracy
83.3%
Average
91.4%
Infectious Diseases
Correct
3
Incorrect
9
Unanswered
2
Accuracy
21.4%
Average
77.9%
Legal Medicine and Bioethics
Correct
6
Incorrect
3
Unanswered
2
Accuracy
54.5%
Average
82.9%
Medical Oncology
Correct
10
Incorrect
5
Unanswered
8
Accuracy
43.5%
Average
83.0%
Nephrology
Correct
5
Incorrect
3
Unanswered
2
Accuracy
50.0%
Average
85.1%
Neurology
Correct
7
Incorrect
4
Unanswered
2
Accuracy
53.8%
Average
88.6%
Ophthalmology
Correct
2
Incorrect
2
Unanswered
1
Accuracy
40.0%
Average
83.7%
Palliative Care
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
80.2%
Pediatrics
Correct
11
Incorrect
6
Unanswered
5
Accuracy
50.0%
Average
87.6%
Pharmacology
Correct
5
Incorrect
3
Unanswered
3
Accuracy
45.5%
Average
78.6%
Psychiatry
Correct
2
Incorrect
4
Unanswered
2
Accuracy
25.0%
Average
87.9%
Pulmonology
Correct
6
Incorrect
6
Unanswered
4
Accuracy
37.5%
Average
82.8%
Radiology-Emergency
Correct
5
Incorrect
6
Unanswered
2
Accuracy
38.5%
Average
67.7%
Rheumatology
Correct
6
Incorrect
3
Unanswered
2
Accuracy
54.5%
Average
88.4%
Statistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Traumatology
Correct
3
Incorrect
6
Unanswered
2
Accuracy
27.3%
Average
65.2%
Urology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
82.6%
Biostatistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Diagnosis
Correct
31
Incorrect
25
Unanswered
25
Accuracy
38.3%
Average
82.2%
Epidemiology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
88.7%
Ethics
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
92.0%
Interpretation
Correct
12
Incorrect
10
Unanswered
15
Accuracy
32.4%
Average
72.0%
Legal
Correct
4
Incorrect
2
Unanswered
3
Accuracy
44.4%
Average
82.4%
Pathophysiology
Correct
13
Incorrect
8
Unanswered
5
Accuracy
50.0%
Average
84.3%
Pharmacology
Correct
7
Incorrect
3
Unanswered
5
Accuracy
46.7%
Average
82.3%
Prevention
Correct
7
Incorrect
8
Unanswered
1
Accuracy
43.8%
Average
80.6%
Prognosis
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
93.2%
Risk
Correct
4
Incorrect
9
Unanswered
2
Accuracy
26.7%
Average
84.3%
Tests
Correct
14
Incorrect
12
Unanswered
7
Accuracy
42.4%
Average
80.3%
Treatment
Correct
29
Incorrect
26
Unanswered
17
Accuracy
40.3%
Average
80.1%
#AnswerCorrectStatus
1A
2BB
3D
4DD
5BB
6C
7AB
8AD
9DD
10A
11AB
12A
13Annulled
14BC
15A
16C
17C
18BD
19DB
20B
21BB
22C
23AB
24AA
25CC
26DA
27BB
28DD
29AA
30BD
31DC
32CC
33BB
34AB
35C
36CD
37CD
38DA
39AC
40AA
41CD
42AA
43BB
44BB
45BB
46B
47D
48C
49AC
50AAnnulled
51A
52CC
53CC
54AC
55DA
56CA
57A
58BA
59BB
60CB
61C
62B
63DB
64Annulled
65CC
66C
67CC
68AA
69BB
70CB
71BB
72BB
73DB
74AA
75CC
76C
77AA
78DD
79AA
80BD
81C
82D
83B
84AA
85DD
86AA
87BB
88BB
89BB
90C
91BA
92AC
93D
94BC
95CC
96AB
97B
98BA
99B
100CC
101CC
102DA
103CC
104CC
105CC
106CB
107CC
108AA
109C
110DC
111AB
112C
113C
114DD
115DB
116CD
117CC
118AB
119DA
120A
121C
122B
123B
124AB
125CC
126CB
127C
128BB
129AC
130CC
131BB
132AB
133C
134BB
135CC
136CB
137DD
138DB
139AAnnulled
140D
141CA
142AAnnulled
143CA
144DD
145DC
146BC
147B
148CC
149BB
150BB
151BB
152CC
153CC
154D
155BB
156AA
157DD
158BB
159BC
160AA
161Annulled
162A
163BC
164CC
165BA
166AA
167BD
168CC
169AB
170BB
171AA
172AB
173CD
174C
175BB
176A
177AD
178BD
179B
180BC
181BB
182CC
183DD
184AD
185DB
186DD
187DD
188BC
189CC
190AB
191DD
192BD
193CC
194C
195D
196AA
197AC
198BA
199DB
200DD
201CC
202AA
203AA
204C
205BB
206AC
207BA
208AAnnulled
209B
210CC