MedicalBenchmark
Sao10K: Llama 3.1 Euryale 70B v2.2 provider

Llama 3.1 Euryale 70B v2.2

226

#226 of 290 modelsMIR 2025

Net score

118.00 pts

Accuracy

67.5%

Correct / Incorrect

135 / 51

Total Cost

$0.18

Overall Performance

(vs. average)
Accuracy

67.5%

avg: 75.9%

Net score

118.00 pts

avg: 138.99 pts

Correct

135

avg: 152

Incorrect

51

avg: 38

Total Cost

$0.18

avg: $3.59

Average response time

18.2s

avg: 18.1s

Output Tokens

112K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

91.4%

avg: 94.7%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
81.3%
Cardiology
Correct
16
Incorrect
6
Unanswered
0
Accuracy
72.7%
Average
77.4%
Dermatology
Correct
8
Incorrect
4
Unanswered
1
Accuracy
61.5%
Average
62.8%
Endocrinology and Nutrition
Correct
10
Incorrect
5
Unanswered
1
Accuracy
62.5%
Average
82.5%
ENT
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
73.8%
Epidemiology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
67.1%
Gastroenterology
Correct
14
Incorrect
6
Unanswered
1
Accuracy
66.7%
Average
72.9%
Genetics
Correct
3
Incorrect
2
Unanswered
1
Accuracy
50.0%
Average
68.2%
Geriatrics
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
71.2%
Gynecology and Obstetrics
Correct
13
Incorrect
3
Unanswered
3
Accuracy
68.4%
Average
85.9%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
81.6%
Hematology
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
81.8%
Immunology
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
82.5%
Infectious Diseases
Correct
17
Incorrect
9
Unanswered
2
Accuracy
60.7%
Average
71.1%
Legal Medicine and Bioethics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
67.2%
Medical Oncology
Correct
23
Incorrect
2
Unanswered
0
Accuracy
92.0%
Average
86.3%
Nephrology
Correct
8
Incorrect
5
Unanswered
2
Accuracy
53.3%
Average
78.2%
Neurology
Correct
12
Incorrect
5
Unanswered
3
Accuracy
60.0%
Average
76.2%
Ophthalmology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
72.6%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
77.2%
Pediatrics
Correct
14
Incorrect
9
Unanswered
2
Accuracy
56.0%
Average
72.7%
Pharmacology
Correct
13
Incorrect
3
Unanswered
1
Accuracy
76.5%
Average
73.1%
Psychiatry
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
82.0%
Pulmonology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
73.0%
Radiology-Emergency
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
67.9%
Rheumatology
Correct
12
Incorrect
2
Unanswered
0
Accuracy
85.7%
Average
74.6%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
74.9%
Traumatology
Correct
12
Incorrect
5
Unanswered
1
Accuracy
66.7%
Average
78.2%
Urology
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
77.1%
Biostatistics
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
78.4%
Diagnosis
Correct
63
Incorrect
18
Unanswered
8
Accuracy
70.8%
Average
77.9%
Epidemiology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
75.0%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
72.0%
Interpretation
Correct
21
Incorrect
14
Unanswered
7
Accuracy
50.0%
Average
69.3%
Legal
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
63.6%
Pathophysiology
Correct
15
Incorrect
10
Unanswered
2
Accuracy
55.6%
Average
72.6%
Pharmacology
Correct
11
Incorrect
2
Unanswered
0
Accuracy
84.6%
Average
82.4%
Prevention
Correct
5
Incorrect
5
Unanswered
2
Accuracy
41.7%
Average
74.5%
Prognosis
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
77.8%
Risk
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
84.3%
Tests
Correct
20
Incorrect
5
Unanswered
1
Accuracy
76.9%
Average
76.3%
Treatment
Correct
56
Incorrect
21
Unanswered
5
Accuracy
68.3%
Average
75.2%
#AnswerCorrectStatus
1BB
2A
3CC
4AB
5A
6CC
7DC
8CA
9AA
10D
11DD
12DD
13AB
14DD
15B
16CB
17AB
18CA
19CC
20AA
21BB
22CD
23C
24DD
25CC
26CAnnulled
27DC
28CAnnulled
29DD
30AB
31AD
32AA
33DD
34DD
35BB
36DD
37AC
38CC
39BD
40AA
41DD
42CC
43DB
44DD
45AD
46AA
47AA
48AA
49AD
50BB
51DC
52BB
53DD
54DB
55AA
56BAnnulled
57CC
58BB
59DD
60A
61BA
62DD
63BB
64DD
65A
66AA
67BB
68DB
69CB
70AA
71DD
72AA
73BD
74CC
75AA
76BB
77BB
78BB
79C
80CC
81CC
82AD
83BB
84DD
85AC
86CC
87BA
88DD
89AB
90AA
91DB
92CC
93BB
94CC
95AA
96CC
97CD
98CC
99BA
100CC
101BB
102DD
103AA
104CC
105AA
106CC
107BB
108DD
109BB
110CC
111AA
112CC
113BB
114DD
115DD
116C
117AA
118DD
119BC
120BB
121DD
122CC
123CC
124CC
125DD
126BD
127BB
128DD
129AA
130DD
131BD
132A
133BB
134AC
135BB
136CC
137A
138DD
139DD
140DB
141AA
142AA
143BB
144BB
145CD
146CC
147BB
148BA
149AA
150DA
151AA
152AA
153BB
154BB
155AB
156CC
157AA
158CC
159BC
160A
161CA
162C
163DD
164AC
165AA
166CB
167AC
168DD
169DB
170BB
171CC
172CA
173A
174BB
175AB
176CC
177CC
178BA
179AD
180AA
181BB
182CC
183BB
184BB
185BB
186DAnnulled
187C
188DD
189DD
190AA
191BB
192AA
193CC
194AA
195A
196AA
197BB
198CC
199DD
200CC
201BB
202AA
203DD
204DC
205BB
206CD
207AA
208AC
209CC
210BB