MedicalBenchmark
Sao10K: Llama 3 8B Lunaris provider

Llama 3 8B Lunaris

289

#289 of 319 modelsMIR 2025

Net score

55.33 pts

Accuracy

43.0%

Correct / Incorrect

86 / 92

Total Cost

$0.01

Overall Performance

(vs. average)
Accuracy

43.0%

avg: 77.9%

Net score

55.33 pts

avg: 143.96 pts

Correct

86

avg: 156

Incorrect

92

avg: 35

Total Cost

$0.01

avg: $3.36

Average response time

5.7s

avg: 19.0s

Output Tokens

72K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

87.9%

avg: 95.2%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
82.3%
Cardiology
Correct
7
Incorrect
11
Unanswered
4
Accuracy
31.8%
Average
78.6%
Dermatology
Correct
5
Incorrect
3
Unanswered
4
Accuracy
41.7%
Average
69.4%
Endocrinology and Nutrition
Correct
10
Incorrect
3
Unanswered
3
Accuracy
62.5%
Average
83.5%
ENT
Correct
2
Incorrect
5
Unanswered
1
Accuracy
25.0%
Average
74.8%
Epidemiology
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
69.1%
Gastroenterology
Correct
8
Incorrect
10
Unanswered
3
Accuracy
38.1%
Average
74.1%
Genetics
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
69.5%
Geriatrics
Correct
5
Incorrect
6
Unanswered
0
Accuracy
45.5%
Average
77.5%
Gynecology and Obstetrics
Correct
12
Incorrect
6
Unanswered
1
Accuracy
63.2%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
82.6%
Hematology
Correct
5
Incorrect
5
Unanswered
1
Accuracy
45.5%
Average
82.7%
Immunology
Correct
4
Incorrect
4
Unanswered
1
Accuracy
44.4%
Average
83.3%
Infectious Diseases
Correct
12
Incorrect
11
Unanswered
4
Accuracy
44.4%
Average
74.9%
Legal Medicine and Bioethics
Correct
2
Incorrect
2
Unanswered
1
Accuracy
40.0%
Average
68.4%
Medical Oncology
Correct
12
Incorrect
11
Unanswered
2
Accuracy
48.0%
Average
87.2%
Nephrology
Correct
10
Incorrect
2
Unanswered
2
Accuracy
71.4%
Average
84.8%
Neurology
Correct
7
Incorrect
11
Unanswered
2
Accuracy
35.0%
Average
77.3%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
74.2%
Palliative Care
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
78.6%
Pediatrics
Correct
14
Incorrect
11
Unanswered
1
Accuracy
53.8%
Average
71.9%
Pharmacology
Correct
7
Incorrect
7
Unanswered
3
Accuracy
41.2%
Average
74.1%
Psychiatry
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
83.0%
Pulmonology
Correct
4
Incorrect
8
Unanswered
2
Accuracy
28.6%
Average
80.4%
Radiology-Emergency
Correct
6
Incorrect
8
Unanswered
0
Accuracy
42.9%
Average
69.4%
Rheumatology
Correct
6
Incorrect
7
Unanswered
2
Accuracy
40.0%
Average
76.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
76.6%
Traumatology
Correct
8
Incorrect
9
Unanswered
1
Accuracy
44.4%
Average
79.3%
Urology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
78.6%
Biostatistics
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
79.8%
Diagnosis
Correct
40
Incorrect
41
Unanswered
7
Accuracy
45.5%
Average
79.9%
Epidemiology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
76.7%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.1%
Interpretation
Correct
18
Incorrect
19
Unanswered
5
Accuracy
42.9%
Average
70.7%
Legal
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
64.6%
Pathophysiology
Correct
8
Incorrect
11
Unanswered
8
Accuracy
29.6%
Average
76.1%
Pharmacology
Correct
6
Incorrect
5
Unanswered
2
Accuracy
46.2%
Average
83.3%
Prevention
Correct
8
Incorrect
2
Unanswered
2
Accuracy
66.7%
Average
75.6%
Prognosis
Correct
2
Incorrect
3
Unanswered
2
Accuracy
28.6%
Average
80.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
85.2%
Tests
Correct
9
Incorrect
14
Unanswered
4
Accuracy
33.3%
Average
77.9%
Treatment
Correct
41
Incorrect
34
Unanswered
6
Accuracy
50.6%
Average
77.3%
#AnswerCorrectStatus
1AB
2CA
3CC
4AB
5AA
6CC
7CC
8AA
9CA
10DD
11CD
12DD
13AB
14BD
15CAnnulled
16BB
17AB
18BA
19CC
20DA
21BB
22DD
23AC
24AD
25CC
26Annulled
27DC
28AAnnulled
29BD
30BB
31BD
32DA
33DD
34AD
35BB
36AD
37CC
38BC
39DD
40BA
41DD
42BC
43BB
44DD
45AD
46AA
47A
48CA
49BD
50BB
51DC
52BB
53CD
54DB
55CA
56CAnnulled
57CC
58BB
59BD
60BA
61AA
62DD
63BB
64DD
65AA
66AA
67BB
68CB
69AB
70DA
71BD
72CA
73AD
74DC
75AA
76BB
77BB
78BB
79AC
80BC
81AC
82CD
83DB
84DD
85BC
86BC
87AA
88CD
89CB
90AA
91BB
92C
93BB
94CC
95CA
96CC
97D
98DC
99A
100C
101AB
102DD
103AA
104CC
105AA
106DC
107CB
108AD
109AB
110CC
111A
112BC
113BB
114BD
115CD
116BC
117AA
118CD
119CC
120AB
121D
122CC
123CC
124BC
125DD
126BD
127AB
128DD
129A
130DD
131DD
132AA
133BB
134CC
135CB
136CC
137CA
138AD
139DD
140B
141BA
142BA
143BB
144BB
145BD
146CC
147CB
148BA
149DA
150AD
151DA
152AA
153B
154BB
155BB
156CC
157AA
158C
159BC
160AA
161A
162BAnnulled
163DD
164BC
165DA
166B
167AC
168BD
169B
170B
171DC
172BA
173A
174BB
175CB
176CC
177CC
178BA
179D
180AA
181B
182AC
183DB
184BB
185CB
186DAnnulled
187CC
188DD
189BD
190BA
191CB
192BA
193CC
194AA
195AA
196AA
197BB
198C
199D
200BC
201BB
202BA
203DD
204C
205B
206CD
207AA
208CC
209CC
210B