MedicalBenchmark
Sao10K: Llama 3.1 70B Hanami x1 provider

Llama 3.1 70B Hanami x1

193

#193 of 290 modelsMIR 2026

Net score

155.66 pts

Accuracy

81.5%

Correct / Incorrect

163 / 22

Total Cost

$0.61

Overall Performance

(vs. average)
Accuracy

81.5%

avg: 81.6%

Net score

155.66 pts

avg: 154.00 pts

Correct

163

avg: 163

Incorrect

22

avg: 28

Total Cost

$0.61

avg: $3.33

Average response time

31.2s

avg: 16.2s

Output Tokens

103K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

91.7%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
69.6%
Cardiology
Correct
20
Incorrect
2
Unanswered
3
Accuracy
80.0%
Average
77.3%
Dermatology
Correct
9
Incorrect
1
Unanswered
1
Accuracy
81.8%
Average
72.3%
Endocrinology and Nutrition
Correct
13
Incorrect
2
Unanswered
0
Accuracy
86.7%
Average
84.0%
ENT
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
84.7%
Epidemiology
Correct
4
Incorrect
1
Unanswered
2
Accuracy
57.1%
Average
80.2%
Gastroenterology
Correct
21
Incorrect
5
Unanswered
4
Accuracy
70.0%
Average
79.3%
Genetics
Correct
7
Incorrect
2
Unanswered
2
Accuracy
63.6%
Average
78.7%
Geriatrics
Correct
9
Incorrect
1
Unanswered
3
Accuracy
69.2%
Average
83.0%
Gynecology and Obstetrics
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
84.3%
Health Planning and Management
Correct
6
Incorrect
2
Unanswered
2
Accuracy
60.0%
Average
78.4%
Hematology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
76.6%
Immunology
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.4%
Infectious Diseases
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
77.9%
Legal Medicine and Bioethics
Correct
9
Incorrect
2
Unanswered
0
Accuracy
81.8%
Average
82.9%
Medical Oncology
Correct
19
Incorrect
2
Unanswered
2
Accuracy
82.6%
Average
83.0%
Nephrology
Correct
10
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
85.1%
Neurology
Correct
12
Incorrect
1
Unanswered
0
Accuracy
92.3%
Average
88.6%
Ophthalmology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
83.7%
Palliative Care
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
80.2%
Pediatrics
Correct
20
Incorrect
2
Unanswered
0
Accuracy
90.9%
Average
87.6%
Pharmacology
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
78.6%
Psychiatry
Correct
8
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.9%
Pulmonology
Correct
14
Incorrect
2
Unanswered
0
Accuracy
87.5%
Average
82.8%
Radiology-Emergency
Correct
9
Incorrect
3
Unanswered
1
Accuracy
69.2%
Average
67.7%
Rheumatology
Correct
11
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.4%
Statistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Traumatology
Correct
7
Incorrect
2
Unanswered
2
Accuracy
63.6%
Average
65.2%
Urology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
0
Unanswered
1
Accuracy
66.7%
Average
82.6%
Biostatistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Diagnosis
Correct
70
Incorrect
8
Unanswered
3
Accuracy
86.4%
Average
82.2%
Epidemiology
Correct
7
Incorrect
1
Unanswered
1
Accuracy
77.8%
Average
88.7%
Ethics
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
92.0%
Interpretation
Correct
24
Incorrect
7
Unanswered
6
Accuracy
64.9%
Average
72.0%
Legal
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
82.4%
Pathophysiology
Correct
23
Incorrect
2
Unanswered
1
Accuracy
88.5%
Average
84.3%
Pharmacology
Correct
14
Incorrect
1
Unanswered
0
Accuracy
93.3%
Average
82.3%
Prevention
Correct
10
Incorrect
2
Unanswered
4
Accuracy
62.5%
Average
80.6%
Prognosis
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
93.2%
Risk
Correct
11
Incorrect
2
Unanswered
2
Accuracy
73.3%
Average
84.3%
Tests
Correct
24
Incorrect
4
Unanswered
5
Accuracy
72.7%
Average
80.3%
Treatment
Correct
60
Incorrect
7
Unanswered
5
Accuracy
83.3%
Average
80.1%
#AnswerCorrectStatus
1A
2BB
3AD
4DD
5BB
6C
7CB
8D
9DD
10AA
11BB
12CA
13Annulled
14CC
15DA
16CC
17CC
18D
19BB
20BB
21BB
22CC
23B
24AA
25CC
26CA
27B
28DD
29AA
30DD
31C
32CC
33CB
34BB
35DC
36BD
37DD
38AA
39CC
40AA
41DD
42AA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50AAnnulled
51AA
52CC
53CC
54CC
55CA
56AA
57A
58AA
59BB
60BB
61CC
62BB
63BB
64CAnnulled
65CC
66DC
67CC
68AA
69BB
70BB
71BB
72BB
73DB
74AA
75CC
76CC
77CA
78DD
79AA
80DD
81CC
82DD
83BB
84AA
85DD
86AA
87BB
88BB
89BB
90CC
91AA
92CC
93DD
94CC
95CC
96BB
97BB
98AA
99BB
100BC
101CC
102AA
103CC
104CC
105CC
106BB
107CC
108AA
109CC
110CC
111AB
112CC
113CC
114DD
115BB
116DD
117CC
118B
119CA
120AA
121CC
122B
123BB
124BB
125CC
126BB
127C
128CB
129CC
130CC
131BB
132AB
133CC
134BB
135CC
136BB
137DD
138CB
139AAnnulled
140DD
141AA
142AAnnulled
143AA
144DD
145CC
146C
147BB
148CC
149BB
150BB
151BB
152CC
153C
154DD
155B
156AA
157DD
158BB
159CC
160AA
161AAnnulled
162AA
163CC
164CC
165AA
166AA
167DD
168CC
169BB
170BB
171AA
172AB
173DD
174CC
175BB
176AA
177DD
178AD
179BB
180CC
181BB
182CC
183DD
184DD
185BB
186DD
187DD
188AC
189CC
190BB
191DD
192DD
193CC
194CC
195DD
196AA
197C
198BA
199BB
200DD
201CC
202AA
203AA
204CC
205BB
206CC
207AA
208BAnnulled
209BB
210CC