MedicalBenchmark
Sao10K: Llama 3.1 Euryale 70B v2.2 provider

Llama 3.1 Euryale 70B v2.2

252

#252 of 319 modelsMIR 2026

Net score

133.66 pts

Accuracy

72.5%

Correct / Incorrect

145 / 34

Total Cost

$0.20

Overall Performance

(vs. average)
Accuracy

72.5%

avg: 82.4%

Net score

133.66 pts

avg: 156.14 pts

Correct

145

avg: 165

Incorrect

34

avg: 26

Total Cost

$0.20

avg: $3.12

Average response time

22.8s

avg: 17.4s

Output Tokens

141K

avg: 415K

Reasoning Tokens

0

avg: 295K

Average confidence

88.9%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
71.0%
Cardiology
Correct
13
Incorrect
7
Unanswered
5
Accuracy
52.0%
Average
78.4%
Dermatology
Correct
9
Incorrect
0
Unanswered
2
Accuracy
81.8%
Average
73.2%
Endocrinology and Nutrition
Correct
11
Incorrect
3
Unanswered
1
Accuracy
73.3%
Average
84.7%
ENT
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
85.0%
Epidemiology
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
81.3%
Gastroenterology
Correct
22
Incorrect
4
Unanswered
4
Accuracy
73.3%
Average
80.2%
Genetics
Correct
7
Incorrect
4
Unanswered
0
Accuracy
63.6%
Average
79.7%
Geriatrics
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
83.7%
Gynecology and Obstetrics
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
85.2%
Health Planning and Management
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
79.4%
Hematology
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
77.6%
Immunology
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
91.8%
Infectious Diseases
Correct
10
Incorrect
3
Unanswered
1
Accuracy
71.4%
Average
79.0%
Legal Medicine and Bioethics
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
83.4%
Medical Oncology
Correct
18
Incorrect
4
Unanswered
1
Accuracy
78.3%
Average
83.7%
Nephrology
Correct
8
Incorrect
1
Unanswered
1
Accuracy
80.0%
Average
85.6%
Neurology
Correct
10
Incorrect
2
Unanswered
1
Accuracy
76.9%
Average
89.0%
Ophthalmology
Correct
3
Incorrect
0
Unanswered
2
Accuracy
60.0%
Average
84.5%
Palliative Care
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
80.8%
Pediatrics
Correct
19
Incorrect
2
Unanswered
1
Accuracy
86.4%
Average
88.1%
Pharmacology
Correct
6
Incorrect
2
Unanswered
3
Accuracy
54.5%
Average
79.6%
Psychiatry
Correct
7
Incorrect
0
Unanswered
1
Accuracy
87.5%
Average
88.4%
Pulmonology
Correct
10
Incorrect
4
Unanswered
2
Accuracy
62.5%
Average
83.6%
Radiology-Emergency
Correct
5
Incorrect
5
Unanswered
3
Accuracy
38.5%
Average
69.4%
Rheumatology
Correct
9
Incorrect
0
Unanswered
2
Accuracy
81.8%
Average
89.0%
Statistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
84.5%
Traumatology
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
66.6%
Urology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
83.1%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.4%
Biostatistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
84.5%
Diagnosis
Correct
59
Incorrect
9
Unanswered
13
Accuracy
72.8%
Average
83.0%
Epidemiology
Correct
6
Incorrect
2
Unanswered
1
Accuracy
66.7%
Average
89.1%
Ethics
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
92.2%
Interpretation
Correct
20
Incorrect
9
Unanswered
8
Accuracy
54.1%
Average
73.2%
Legal
Correct
8
Incorrect
1
Unanswered
0
Accuracy
88.9%
Average
82.9%
Pathophysiology
Correct
17
Incorrect
7
Unanswered
2
Accuracy
65.4%
Average
85.0%
Pharmacology
Correct
11
Incorrect
2
Unanswered
2
Accuracy
73.3%
Average
83.1%
Prevention
Correct
12
Incorrect
3
Unanswered
1
Accuracy
75.0%
Average
81.5%
Prognosis
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
93.2%
Risk
Correct
13
Incorrect
2
Unanswered
0
Accuracy
86.7%
Average
85.0%
Tests
Correct
22
Incorrect
9
Unanswered
2
Accuracy
66.7%
Average
81.2%
Treatment
Correct
50
Incorrect
14
Unanswered
8
Accuracy
69.4%
Average
80.9%
#AnswerCorrectStatus
1DA
2BB
3D
4DD
5B
6AC
7BB
8CD
9DD
10AA
11B
12CA
13DAnnulled
14BC
15A
16CC
17CC
18DD
19B
20BB
21BB
22BC
23B
24AA
25AC
26AA
27BB
28DD
29AA
30DD
31BC
32CC
33CB
34BB
35CC
36BD
37DD
38A
39CC
40AA
41DD
42AA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50BAnnulled
51AA
52CC
53CC
54DC
55AA
56AA
57CA
58A
59BB
60BB
61CC
62BB
63BB
64CAnnulled
65C
66DC
67C
68AA
69CB
70BB
71B
72BB
73BB
74AA
75CC
76CC
77AA
78AD
79AA
80DD
81CC
82AD
83BB
84AA
85DD
86AA
87BB
88BB
89BB
90CC
91A
92CC
93DD
94CC
95CC
96BB
97BB
98AA
99BB
100C
101CC
102AA
103CC
104CC
105CC
106BB
107C
108A
109CC
110BC
111AB
112CC
113CC
114DD
115BB
116DD
117CC
118AB
119AA
120AA
121CC
122CB
123B
124BB
125CC
126AB
127CC
128CB
129CC
130CC
131BB
132AB
133CC
134BB
135C
136BB
137DD
138BB
139AAnnulled
140BD
141AA
142Annulled
143AA
144DD
145CC
146CC
147BB
148CC
149BB
150BB
151BB
152CC
153CC
154DD
155BB
156AA
157DD
158BB
159CC
160AA
161CAnnulled
162AA
163AC
164CC
165AA
166AA
167DD
168CC
169BB
170AB
171AA
172AB
173DD
174AC
175BB
176A
177DD
178AD
179BB
180CC
181BB
182CC
183DD
184D
185DB
186DD
187DD
188BC
189CC
190BB
191D
192DD
193CC
194CC
195DD
196AA
197DC
198BA
199AB
200DD
201CC
202A
203AA
204CC
205BB
206CC
207AA
208BAnnulled
209BB
210CC