MedicalBenchmark
Sao10K: Llama 3.1 Euryale 70B v2.2 provider

Llama 3.1 Euryale 70B v2.2

225

#225 of 290 modelsMIR 2026

Net score

133.66 pts

Accuracy

72.5%

Correct / Incorrect

145 / 34

Total Cost

$0.20

Overall Performance

(vs. average)
Accuracy

72.5%

avg: 81.6%

Net score

133.66 pts

avg: 154.00 pts

Correct

145

avg: 163

Incorrect

34

avg: 28

Total Cost

$0.20

avg: $3.33

Average response time

22.8s

avg: 16.2s

Output Tokens

141K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

88.9%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
69.6%
Cardiology
Correct
13
Incorrect
7
Unanswered
5
Accuracy
52.0%
Average
77.3%
Dermatology
Correct
9
Incorrect
0
Unanswered
2
Accuracy
81.8%
Average
72.3%
Endocrinology and Nutrition
Correct
11
Incorrect
3
Unanswered
1
Accuracy
73.3%
Average
84.0%
ENT
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
84.7%
Epidemiology
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
80.2%
Gastroenterology
Correct
22
Incorrect
4
Unanswered
4
Accuracy
73.3%
Average
79.3%
Genetics
Correct
7
Incorrect
4
Unanswered
0
Accuracy
63.6%
Average
78.7%
Geriatrics
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
83.0%
Gynecology and Obstetrics
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
84.3%
Health Planning and Management
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
78.4%
Hematology
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
76.6%
Immunology
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
91.4%
Infectious Diseases
Correct
10
Incorrect
3
Unanswered
1
Accuracy
71.4%
Average
77.9%
Legal Medicine and Bioethics
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
82.9%
Medical Oncology
Correct
18
Incorrect
4
Unanswered
1
Accuracy
78.3%
Average
83.0%
Nephrology
Correct
8
Incorrect
1
Unanswered
1
Accuracy
80.0%
Average
85.1%
Neurology
Correct
10
Incorrect
2
Unanswered
1
Accuracy
76.9%
Average
88.6%
Ophthalmology
Correct
3
Incorrect
0
Unanswered
2
Accuracy
60.0%
Average
83.7%
Palliative Care
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
80.2%
Pediatrics
Correct
19
Incorrect
2
Unanswered
1
Accuracy
86.4%
Average
87.6%
Pharmacology
Correct
6
Incorrect
2
Unanswered
3
Accuracy
54.5%
Average
78.6%
Psychiatry
Correct
7
Incorrect
0
Unanswered
1
Accuracy
87.5%
Average
87.9%
Pulmonology
Correct
10
Incorrect
4
Unanswered
2
Accuracy
62.5%
Average
82.8%
Radiology-Emergency
Correct
5
Incorrect
5
Unanswered
3
Accuracy
38.5%
Average
67.7%
Rheumatology
Correct
9
Incorrect
0
Unanswered
2
Accuracy
81.8%
Average
88.4%
Statistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Traumatology
Correct
7
Incorrect
3
Unanswered
1
Accuracy
63.6%
Average
65.2%
Urology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
82.6%
Biostatistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Diagnosis
Correct
59
Incorrect
9
Unanswered
13
Accuracy
72.8%
Average
82.2%
Epidemiology
Correct
6
Incorrect
2
Unanswered
1
Accuracy
66.7%
Average
88.7%
Ethics
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
92.0%
Interpretation
Correct
20
Incorrect
9
Unanswered
8
Accuracy
54.1%
Average
72.0%
Legal
Correct
8
Incorrect
1
Unanswered
0
Accuracy
88.9%
Average
82.4%
Pathophysiology
Correct
17
Incorrect
7
Unanswered
2
Accuracy
65.4%
Average
84.3%
Pharmacology
Correct
11
Incorrect
2
Unanswered
2
Accuracy
73.3%
Average
82.3%
Prevention
Correct
12
Incorrect
3
Unanswered
1
Accuracy
75.0%
Average
80.6%
Prognosis
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
93.2%
Risk
Correct
13
Incorrect
2
Unanswered
0
Accuracy
86.7%
Average
84.3%
Tests
Correct
22
Incorrect
9
Unanswered
2
Accuracy
66.7%
Average
80.3%
Treatment
Correct
50
Incorrect
14
Unanswered
8
Accuracy
69.4%
Average
80.1%
#AnswerCorrectStatus
1DA
2BB
3D
4DD
5B
6AC
7BB
8CD
9DD
10AA
11B
12CA
13DAnnulled
14BC
15A
16CC
17CC
18DD
19B
20BB
21BB
22BC
23B
24AA
25AC
26AA
27BB
28DD
29AA
30DD
31BC
32CC
33CB
34BB
35CC
36BD
37DD
38A
39CC
40AA
41DD
42AA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50BAnnulled
51AA
52CC
53CC
54DC
55AA
56AA
57CA
58A
59BB
60BB
61CC
62BB
63BB
64CAnnulled
65C
66DC
67C
68AA
69CB
70BB
71B
72BB
73BB
74AA
75CC
76CC
77AA
78AD
79AA
80DD
81CC
82AD
83BB
84AA
85DD
86AA
87BB
88BB
89BB
90CC
91A
92CC
93DD
94CC
95CC
96BB
97BB
98AA
99BB
100C
101CC
102AA
103CC
104CC
105CC
106BB
107C
108A
109CC
110BC
111AB
112CC
113CC
114DD
115BB
116DD
117CC
118AB
119AA
120AA
121CC
122CB
123B
124BB
125CC
126AB
127CC
128CB
129CC
130CC
131BB
132AB
133CC
134BB
135C
136BB
137DD
138BB
139AAnnulled
140BD
141AA
142Annulled
143AA
144DD
145CC
146CC
147BB
148CC
149BB
150BB
151BB
152CC
153CC
154DD
155BB
156AA
157DD
158BB
159CC
160AA
161CAnnulled
162AA
163AC
164CC
165AA
166AA
167DD
168CC
169BB
170AB
171AA
172AB
173DD
174AC
175BB
176A
177DD
178AD
179BB
180CC
181BB
182CC
183DD
184D
185DB
186DD
187DD
188BC
189CC
190BB
191D
192DD
193CC
194CC
195DD
196AA
197DC
198BA
199AB
200DD
201CC
202A
203AA
204CC
205BB
206CC
207AA
208BAnnulled
209BB
210CC