MedicalBenchmark
EleutherAI: Llemma 7b provider

Llemma 7b

289

#289 of 290 modelsMIR 2026

Net score

9.33 pts

Accuracy

8.5%

Correct / Incorrect

17 / 23

Total Cost

$0.68

Overall Performance

(vs. average)
Accuracy

8.5%

avg: 81.6%

Net score

9.33 pts

avg: 154.00 pts

Correct

17

avg: 163

Incorrect

23

avg: 28

Total Cost

$0.68

avg: $3.33

Average response time

72.4s

avg: 16.2s

Output Tokens

484K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

19.4%

avg: 95.1%

Subject Breakdown

Allergology
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
0
Incorrect
1
Unanswered
6
Accuracy
0.0%
Average
69.6%
Cardiology
Correct
1
Incorrect
4
Unanswered
20
Accuracy
4.0%
Average
77.3%
Dermatology
Correct
1
Incorrect
0
Unanswered
10
Accuracy
9.1%
Average
72.3%
Endocrinology and Nutrition
Correct
1
Incorrect
0
Unanswered
14
Accuracy
6.7%
Average
84.0%
ENT
Correct
0
Incorrect
3
Unanswered
5
Accuracy
0.0%
Average
84.7%
Epidemiology
Correct
1
Incorrect
1
Unanswered
5
Accuracy
14.3%
Average
80.2%
Gastroenterology
Correct
2
Incorrect
6
Unanswered
22
Accuracy
6.7%
Average
79.3%
Genetics
Correct
2
Incorrect
0
Unanswered
9
Accuracy
18.2%
Average
78.7%
Geriatrics
Correct
2
Incorrect
2
Unanswered
9
Accuracy
15.4%
Average
83.0%
Gynecology and Obstetrics
Correct
1
Incorrect
1
Unanswered
10
Accuracy
8.3%
Average
84.3%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
8
Accuracy
20.0%
Average
78.4%
Hematology
Correct
0
Incorrect
0
Unanswered
9
Accuracy
0.0%
Average
76.6%
Immunology
Correct
0
Incorrect
1
Unanswered
5
Accuracy
0.0%
Average
91.4%
Infectious Diseases
Correct
2
Incorrect
2
Unanswered
10
Accuracy
14.3%
Average
77.9%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
9
Accuracy
18.2%
Average
82.9%
Medical Oncology
Correct
3
Incorrect
4
Unanswered
16
Accuracy
13.0%
Average
83.0%
Nephrology
Correct
0
Incorrect
1
Unanswered
9
Accuracy
0.0%
Average
85.1%
Neurology
Correct
0
Incorrect
3
Unanswered
10
Accuracy
0.0%
Average
88.6%
Ophthalmology
Correct
0
Incorrect
0
Unanswered
5
Accuracy
0.0%
Average
83.7%
Palliative Care
Correct
1
Incorrect
1
Unanswered
4
Accuracy
16.7%
Average
80.2%
Pediatrics
Correct
2
Incorrect
3
Unanswered
17
Accuracy
9.1%
Average
87.6%
Pharmacology
Correct
4
Incorrect
0
Unanswered
7
Accuracy
36.4%
Average
78.6%
Psychiatry
Correct
0
Incorrect
0
Unanswered
8
Accuracy
0.0%
Average
87.9%
Pulmonology
Correct
1
Incorrect
2
Unanswered
13
Accuracy
6.3%
Average
82.8%
Radiology-Emergency
Correct
1
Incorrect
2
Unanswered
10
Accuracy
7.7%
Average
67.7%
Rheumatology
Correct
2
Incorrect
0
Unanswered
9
Accuracy
18.2%
Average
88.4%
Statistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Traumatology
Correct
1
Incorrect
0
Unanswered
10
Accuracy
9.1%
Average
65.2%
Urology
Correct
1
Incorrect
1
Unanswered
6
Accuracy
12.5%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
0
Incorrect
0
Unanswered
3
Accuracy
0.0%
Average
82.6%
Biostatistics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
83.8%
Diagnosis
Correct
5
Incorrect
11
Unanswered
65
Accuracy
6.2%
Average
82.2%
Epidemiology
Correct
1
Incorrect
1
Unanswered
7
Accuracy
11.1%
Average
88.7%
Ethics
Correct
1
Incorrect
0
Unanswered
5
Accuracy
16.7%
Average
92.0%
Interpretation
Correct
3
Incorrect
4
Unanswered
30
Accuracy
8.1%
Average
72.0%
Legal
Correct
2
Incorrect
0
Unanswered
7
Accuracy
22.2%
Average
82.4%
Pathophysiology
Correct
1
Incorrect
1
Unanswered
24
Accuracy
3.8%
Average
84.3%
Pharmacology
Correct
3
Incorrect
2
Unanswered
10
Accuracy
20.0%
Average
82.3%
Prevention
Correct
2
Incorrect
1
Unanswered
13
Accuracy
12.5%
Average
80.6%
Prognosis
Correct
0
Incorrect
2
Unanswered
3
Accuracy
0.0%
Average
93.2%
Risk
Correct
2
Incorrect
0
Unanswered
13
Accuracy
13.3%
Average
84.3%
Tests
Correct
4
Incorrect
7
Unanswered
22
Accuracy
12.1%
Average
80.3%
Treatment
Correct
4
Incorrect
10
Unanswered
58
Accuracy
5.6%
Average
80.1%
#AnswerCorrectStatus
1A
2BB
3CD
4D
5B
6C
7B
8D
9D
10AA
11CB
12A
13Annulled
14CC
15A
16C
17C
18D
19B
20B
21B
22C
23B
24A
25AC
26A
27B
28D
29AA
30D
31C
32C
33DB
34B
35DC
36D
37D
38AA
39C
40A
41D
42A
43B
44CB
45AB
46B
47D
48C
49C
50Annulled
51A
52C
53AC
54C
55AA
56A
57A
58A
59B
60B
61C
62B
63B
64Annulled
65C
66CC
67C
68A
69B
70B
71BB
72B
73B
74AA
75C
76C
77A
78D
79A
80D
81AC
82D
83B
84A
85D
86AA
87B
88DB
89B
90C
91A
92C
93D
94C
95C
96B
97AB
98A
99B
100C
101BC
102A
103BC
104CC
105C
106B
107C
108A
109AC
110C
111B
112C
113CC
114D
115B
116D
117C
118B
119A
120A
121C
122B
123AB
124B
125C
126B
127C
128CB
129C
130C
131B
132B
133C
134B
135C
136BB
137D
138B
139Annulled
140D
141A
142Annulled
143A
144D
145C
146C
147B
148C
149AB
150B
151B
152C
153CC
154D
155B
156A
157BD
158B
159C
160A
161AAnnulled
162AA
163DC
164DC
165A
166A
167AD
168C
169B
170B
171A
172B
173D
174C
175B
176A
177D
178D
179BB
180C
181B
182C
183D
184D
185B
186D
187D
188C
189C
190B
191D
192D
193C
194C
195D
196AA
197AC
198A
199B
200D
201C
202A
203A
204C
205B
206BC
207A
208DAnnulled
209B
210C