MedicalBenchmark
Sao10k: Llama 3 Euryale 70B v2.1 provider

Llama 3 Euryale 70B v2.1

237

#237 of 290 modelsMIR 2025

Net score

100.00 pts

Accuracy

61.0%

Correct / Incorrect

122 / 66

Total Cost

$0.28

Overall Performance

(vs. average)
Accuracy

61.0%

avg: 75.9%

Net score

100.00 pts

avg: 138.99 pts

Correct

122

avg: 152

Incorrect

66

avg: 38

Total Cost

$0.28

avg: $3.59

Average response time

9.0s

avg: 18.1s

Output Tokens

89K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

93.1%

avg: 94.7%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
81.3%
Cardiology
Correct
13
Incorrect
7
Unanswered
2
Accuracy
59.1%
Average
77.4%
Dermatology
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
62.8%
Endocrinology and Nutrition
Correct
13
Incorrect
2
Unanswered
1
Accuracy
81.3%
Average
82.5%
ENT
Correct
4
Incorrect
2
Unanswered
2
Accuracy
50.0%
Average
73.8%
Epidemiology
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
67.1%
Gastroenterology
Correct
13
Incorrect
7
Unanswered
1
Accuracy
61.9%
Average
72.9%
Genetics
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
68.2%
Geriatrics
Correct
8
Incorrect
2
Unanswered
1
Accuracy
72.7%
Average
71.2%
Gynecology and Obstetrics
Correct
16
Incorrect
2
Unanswered
1
Accuracy
84.2%
Average
85.9%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
81.6%
Hematology
Correct
8
Incorrect
2
Unanswered
1
Accuracy
72.7%
Average
81.8%
Immunology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
82.5%
Infectious Diseases
Correct
14
Incorrect
12
Unanswered
2
Accuracy
50.0%
Average
71.1%
Legal Medicine and Bioethics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
67.2%
Medical Oncology
Correct
19
Incorrect
5
Unanswered
1
Accuracy
76.0%
Average
86.3%
Nephrology
Correct
9
Incorrect
5
Unanswered
1
Accuracy
60.0%
Average
78.2%
Neurology
Correct
13
Incorrect
6
Unanswered
1
Accuracy
65.0%
Average
76.2%
Ophthalmology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
72.6%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
77.2%
Pediatrics
Correct
12
Incorrect
12
Unanswered
1
Accuracy
48.0%
Average
72.7%
Pharmacology
Correct
12
Incorrect
3
Unanswered
2
Accuracy
70.6%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
73.0%
Radiology-Emergency
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
67.9%
Rheumatology
Correct
7
Incorrect
5
Unanswered
2
Accuracy
50.0%
Average
74.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.9%
Traumatology
Correct
8
Incorrect
9
Unanswered
1
Accuracy
44.4%
Average
78.2%
Urology
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
77.1%
Biostatistics
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
78.4%
Diagnosis
Correct
55
Incorrect
29
Unanswered
5
Accuracy
61.8%
Average
77.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
75.0%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
72.0%
Interpretation
Correct
17
Incorrect
21
Unanswered
4
Accuracy
40.5%
Average
69.3%
Legal
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
63.6%
Pathophysiology
Correct
20
Incorrect
5
Unanswered
2
Accuracy
74.1%
Average
72.6%
Pharmacology
Correct
10
Incorrect
2
Unanswered
1
Accuracy
76.9%
Average
82.4%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
74.5%
Prognosis
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
77.8%
Risk
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
84.3%
Tests
Correct
18
Incorrect
8
Unanswered
0
Accuracy
69.2%
Average
76.3%
Treatment
Correct
46
Incorrect
29
Unanswered
7
Accuracy
56.1%
Average
75.2%
#AnswerCorrectStatus
1BB
2DA
3C
4AB
5AA
6CC
7CC
8CA
9CA
10BD
11AD
12BD
13AB
14DD
15B
16CB
17DB
18CA
19CC
20BA
21BB
22CD
23C
24DD
25CC
26AAnnulled
27AC
28DAnnulled
29DD
30BB
31DD
32AA
33DD
34DD
35BB
36DD
37DC
38CC
39BD
40BA
41DD
42CC
43CB
44DD
45BD
46AA
47AA
48AA
49CD
50BB
51DC
52BB
53D
54DB
55AA
56BAnnulled
57CC
58BB
59DD
60DA
61A
62DD
63BB
64DD
65AA
66AA
67AB
68B
69AB
70AA
71DD
72AA
73AD
74CC
75AA
76BB
77BB
78BB
79AC
80CC
81CC
82DD
83DB
84DD
85AC
86AC
87CA
88AD
89CB
90AA
91BB
92C
93BB
94CC
95BA
96CC
97DD
98CC
99A
100CC
101DB
102DD
103AA
104CC
105DA
106CC
107BB
108BD
109BB
110CC
111AA
112CC
113BB
114DD
115CD
116CC
117AA
118CD
119AC
120BB
121D
122CC
123CC
124CC
125BD
126DD
127AB
128DD
129AA
130DD
131BD
132AA
133DB
134CC
135DB
136CC
137BA
138DD
139DD
140BB
141AA
142BA
143BB
144BB
145DD
146CC
147BB
148BA
149A
150DA
151AA
152AA
153BB
154BB
155BB
156CC
157AA
158C
159CC
160CA
161CA
162C
163DD
164C
165AA
166B
167CC
168DD
169AB
170BB
171CC
172CA
173DA
174BB
175BB
176AC
177CC
178BA
179CD
180AA
181BB
182CC
183DB
184BB
185DB
186DAnnulled
187CC
188DD
189DD
190AA
191BB
192BA
193CC
194AA
195AA
196AA
197BB
198AC
199DD
200CC
201CB
202BA
203DD
204DC
205BB
206CD
207AA
208BC
209CC
210BB