MedicalBenchmark
NeverSleep: Lumimaid v0.2 8B provider

Lumimaid v0.2 8B

276

#276 of 291 modelsMIR 2024

Net score

40.00 pts

Accuracy

38.5%

Correct / Incorrect

77 / 111

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

38.5%

avg: 80.5%

Net score

40.00 pts

avg: 150.85 pts

Correct

77

avg: 161

Incorrect

111

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

4.1s

avg: 16.4s

Output Tokens

56K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

92.1%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
1
Unanswered
1
Accuracy
33.3%
Average
90.5%
Anesthesiology and Resuscitation
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
87.1%
Cardiology
Correct
9
Incorrect
12
Unanswered
0
Accuracy
42.9%
Average
79.7%
Dermatology
Correct
3
Incorrect
10
Unanswered
1
Accuracy
21.4%
Average
80.2%
Endocrinology and Nutrition
Correct
8
Incorrect
8
Unanswered
3
Accuracy
42.1%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.3%
Gastroenterology
Correct
9
Incorrect
12
Unanswered
1
Accuracy
40.9%
Average
70.5%
Genetics
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
4
Incorrect
5
Unanswered
1
Accuracy
40.0%
Average
86.9%
Gynecology and Obstetrics
Correct
2
Incorrect
11
Unanswered
1
Accuracy
14.3%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
73.2%
Hematology
Correct
3
Incorrect
10
Unanswered
0
Accuracy
23.1%
Average
81.5%
Immunology
Correct
2
Incorrect
5
Unanswered
1
Accuracy
25.0%
Average
89.1%
Infectious Diseases
Correct
8
Incorrect
15
Unanswered
0
Accuracy
34.8%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
8
Incorrect
13
Unanswered
0
Accuracy
38.1%
Average
80.2%
Nephrology
Correct
4
Incorrect
8
Unanswered
1
Accuracy
30.8%
Average
80.8%
Neurology
Correct
9
Incorrect
11
Unanswered
2
Accuracy
40.9%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
6
Incorrect
11
Unanswered
0
Accuracy
35.3%
Average
82.0%
Pharmacology
Correct
10
Incorrect
11
Unanswered
2
Accuracy
43.5%
Average
85.4%
Psychiatry
Correct
3
Incorrect
5
Unanswered
2
Accuracy
30.0%
Average
89.5%
Pulmonology
Correct
5
Incorrect
14
Unanswered
0
Accuracy
26.3%
Average
80.6%
Radiology-Emergency
Correct
5
Incorrect
8
Unanswered
1
Accuracy
35.7%
Average
64.9%
Rheumatology
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
7
Incorrect
6
Unanswered
2
Accuracy
46.7%
Average
74.5%
Urology
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
30
Incorrect
38
Unanswered
5
Accuracy
41.1%
Average
79.2%
Epidemiology
Correct
6
Incorrect
5
Unanswered
1
Accuracy
50.0%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
13
Incorrect
23
Unanswered
1
Accuracy
35.1%
Average
69.6%
Pathophysiology
Correct
11
Incorrect
22
Unanswered
0
Accuracy
33.3%
Average
85.4%
Pharmacology
Correct
10
Incorrect
11
Unanswered
4
Accuracy
40.0%
Average
84.0%
Prevention
Correct
5
Incorrect
6
Unanswered
1
Accuracy
41.7%
Average
89.8%
Prognosis
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
83.9%
Risk
Correct
8
Incorrect
3
Unanswered
2
Accuracy
61.5%
Average
83.6%
Tests
Correct
6
Incorrect
14
Unanswered
1
Accuracy
28.6%
Average
73.9%
Treatment
Correct
26
Incorrect
40
Unanswered
5
Accuracy
36.6%
Average
81.3%
#AnswerCorrectStatus
1BB
2D
3BB
4AC
5BC
6BB
7DD
8CC
9CA
10BD
11DD
12CA
13DC
14DA
15AB
16DA
17CC
18AA
19DB
20CC
21CD
22AB
23CA
24DA
25DC
26DB
27DC
28DA
29BB
30BC
31CD
32DA
33AC
34BB
35DD
36AD
37CA
38DA
39DC
40BB
41BC
42DD
43CA
44DD
45DD
46BB
47CC
48CC
49BB
50CC
51AA
52CD
53CC
54DB
55CC
56CD
57CA
58DA
59CA
60CA
61CA
62DD
63DD
64AAnnulled
65D
66DC
67AB
68CAnnulled
69AA
70CB
71AB
72DD
73DB
74BC
75BB
76CA
77DD
78BC
79CB
80AA
81DC
82AC
83B
84C
85BA
86DA
87DB
88DD
89CB
90CA
91BD
92A
93CC
94BB
95BD
96BB
97BB
98B
99AA
100BB
101AA
102DD
103BB
104AD
105CB
106DC
107CC
108BB
109CD
110DD
111AB
112AC
113BAnnulled
114DD
115DD
116CA
117DD
118DD
119CA
120CC
121BA
122AB
123DD
124DD
125CB
126BD
127DA
128CB
129DD
130CC
131CC
132DD
133BA
134DC
135DA
136DD
137AA
138C
139CA
140BC
141BB
142CC
143BA
144CD
145CC
146CC
147BC
148A
149AC
150CD
151DA
152DA
153AC
154BB
155AD
156AC
157DC
158CD
159DD
160CB
161BB
162BB
163BB
164BB
165BA
166DC
167CA
168CB
169AC
170DA
171AD
172BB
173A
174DB
175AA
176DC
177AC
178BB
179CC
180CAnnulled
181CB
182DD
183CC
184DA
185DC
186DD
187AA
188C
189CD
190D
191AB
192AB
193BC
194DC
195CC
196CB
197DA
198B
199CD
200BA
201AB
202DD
203CB
204AD
205DD
206AAnnulled
207AA
208AA
209CB
210CD