MedicalBenchmark
Meta: Llama 3 70B Instruct provider

Llama 3 70B Instruct

224

#224 of 290 modelsMIR 2025

Net score

118.66 pts

Accuracy

66.5%

Correct / Incorrect

133 / 43

Total Cost

$0.09

Overall Performance

(vs. average)
Accuracy

66.5%

avg: 75.9%

Net score

118.66 pts

avg: 138.99 pts

Correct

133

avg: 152

Incorrect

43

avg: 38

Total Cost

$0.09

avg: $3.59

Average response time

12.5s

avg: 18.1s

Output Tokens

64K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

88.5%

avg: 94.7%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
81.3%
Cardiology
Correct
18
Incorrect
1
Unanswered
3
Accuracy
81.8%
Average
77.4%
Dermatology
Correct
7
Incorrect
2
Unanswered
4
Accuracy
53.8%
Average
62.8%
Endocrinology and Nutrition
Correct
12
Incorrect
2
Unanswered
2
Accuracy
75.0%
Average
82.5%
ENT
Correct
6
Incorrect
1
Unanswered
1
Accuracy
75.0%
Average
73.8%
Epidemiology
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
67.1%
Gastroenterology
Correct
12
Incorrect
8
Unanswered
1
Accuracy
57.1%
Average
72.9%
Genetics
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
68.2%
Geriatrics
Correct
8
Incorrect
1
Unanswered
2
Accuracy
72.7%
Average
71.2%
Gynecology and Obstetrics
Correct
17
Incorrect
0
Unanswered
2
Accuracy
89.5%
Average
85.9%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
81.6%
Hematology
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
81.8%
Immunology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
82.5%
Infectious Diseases
Correct
17
Incorrect
8
Unanswered
3
Accuracy
60.7%
Average
71.1%
Legal Medicine and Bioethics
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
67.2%
Medical Oncology
Correct
20
Incorrect
2
Unanswered
3
Accuracy
80.0%
Average
86.3%
Nephrology
Correct
12
Incorrect
3
Unanswered
0
Accuracy
80.0%
Average
78.2%
Neurology
Correct
13
Incorrect
4
Unanswered
3
Accuracy
65.0%
Average
76.2%
Ophthalmology
Correct
2
Incorrect
2
Unanswered
1
Accuracy
40.0%
Average
72.6%
Palliative Care
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
77.2%
Pediatrics
Correct
16
Incorrect
6
Unanswered
3
Accuracy
64.0%
Average
72.7%
Pharmacology
Correct
14
Incorrect
2
Unanswered
1
Accuracy
82.4%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
9
Incorrect
3
Unanswered
2
Accuracy
64.3%
Average
73.0%
Radiology-Emergency
Correct
8
Incorrect
4
Unanswered
2
Accuracy
57.1%
Average
67.9%
Rheumatology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
74.6%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
74.9%
Traumatology
Correct
10
Incorrect
6
Unanswered
2
Accuracy
55.6%
Average
78.2%
Urology
Correct
4
Incorrect
1
Unanswered
2
Accuracy
57.1%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
4
Incorrect
1
Unanswered
2
Accuracy
57.1%
Average
77.1%
Biostatistics
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
78.4%
Diagnosis
Correct
62
Incorrect
19
Unanswered
8
Accuracy
69.7%
Average
77.9%
Epidemiology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
75.0%
Ethics
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
72.0%
Interpretation
Correct
26
Incorrect
10
Unanswered
6
Accuracy
61.9%
Average
69.3%
Legal
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
63.6%
Pathophysiology
Correct
20
Incorrect
6
Unanswered
1
Accuracy
74.1%
Average
72.6%
Pharmacology
Correct
11
Incorrect
1
Unanswered
1
Accuracy
84.6%
Average
82.4%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
74.5%
Prognosis
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
77.8%
Risk
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
84.3%
Tests
Correct
15
Incorrect
6
Unanswered
5
Accuracy
57.7%
Average
76.3%
Treatment
Correct
51
Incorrect
18
Unanswered
13
Accuracy
62.2%
Average
75.2%
#AnswerCorrectStatus
1BB
2A
3CC
4B
5AA
6C
7CC
8CA
9CA
10DD
11DD
12DD
13BB
14DD
15B
16CB
17CB
18CA
19CC
20BA
21BB
22CD
23AC
24DD
25C
26Annulled
27DC
28CAnnulled
29DD
30BB
31DD
32AA
33DD
34DD
35BB
36DD
37BC
38CC
39DD
40BA
41BD
42CC
43DB
44DD
45D
46AA
47A
48A
49DD
50B
51DC
52BB
53DD
54DB
55AA
56BAnnulled
57CC
58BB
59DD
60AA
61AA
62DD
63BB
64DD
65AA
66AA
67BB
68BB
69AB
70AA
71D
72CA
73CD
74CC
75AA
76BB
77BB
78BB
79AC
80CC
81CC
82CD
83BB
84DD
85AC
86CC
87CA
88AD
89BB
90A
91B
92BC
93BB
94CC
95AA
96CC
97DD
98CC
99AA
100CC
101BB
102DD
103AA
104CC
105AA
106CC
107BB
108DD
109BB
110CC
111AA
112CC
113B
114DD
115D
116CC
117A
118DD
119AC
120BB
121BD
122CC
123CC
124CC
125BD
126BD
127AB
128DD
129AA
130DD
131DD
132AA
133BB
134CC
135CB
136CC
137A
138D
139DD
140BB
141AA
142AA
143BB
144BB
145DD
146CC
147BB
148BA
149DA
150AA
151AA
152AA
153BB
154BB
155BB
156C
157AA
158CC
159BC
160AA
161AA
162
163DD
164CC
165DA
166CB
167CC
168BD
169CB
170BB
171CC
172BA
173AA
174BB
175BB
176AC
177CC
178A
179DD
180AA
181DB
182C
183B
184B
185BB
186DAnnulled
187CC
188DD
189DD
190AA
191BB
192AA
193CC
194AA
195AA
196AA
197BB
198C
199DD
200CC
201AB
202BA
203DD
204C
205BB
206CD
207A
208BC
209CC
210BB