MedicalBenchmark
AlfredPros: CodeLLaMa 7B Instruct Solidity provider

CodeLLaMa 7B Instruct Solidity

290

#290 of 290 modelsMIR 2026

Net score

0.00 pts

Accuracy

1.5%

Correct / Incorrect

3 / 21

Total Cost

$0.32

Overall Performance

(vs. average)
Accuracy

1.5%

avg: 81.6%

Net score

0.00 pts

avg: 154.00 pts

Correct

3

avg: 163

Incorrect

21

avg: 28

Total Cost

$0.32

avg: $3.33

Average response time

27.9s

avg: 16.2s

Output Tokens

185K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

16.5%

avg: 95.1%

Subject Breakdown

Allergology
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
0
Incorrect
2
Unanswered
5
Accuracy
0.0%
Average
69.6%
Cardiology
Correct
0
Incorrect
2
Unanswered
23
Accuracy
0.0%
Average
77.3%
Dermatology
Correct
0
Incorrect
1
Unanswered
10
Accuracy
0.0%
Average
72.3%
Endocrinology and Nutrition
Correct
0
Incorrect
3
Unanswered
12
Accuracy
0.0%
Average
84.0%
ENT
Correct
0
Incorrect
0
Unanswered
8
Accuracy
0.0%
Average
84.7%
Epidemiology
Correct
0
Incorrect
2
Unanswered
5
Accuracy
0.0%
Average
80.2%
Gastroenterology
Correct
1
Incorrect
5
Unanswered
24
Accuracy
3.3%
Average
79.3%
Genetics
Correct
0
Incorrect
1
Unanswered
10
Accuracy
0.0%
Average
78.7%
Geriatrics
Correct
1
Incorrect
1
Unanswered
11
Accuracy
7.7%
Average
83.0%
Gynecology and Obstetrics
Correct
0
Incorrect
2
Unanswered
10
Accuracy
0.0%
Average
84.3%
Health Planning and Management
Correct
0
Incorrect
1
Unanswered
9
Accuracy
0.0%
Average
78.4%
Hematology
Correct
0
Incorrect
0
Unanswered
9
Accuracy
0.0%
Average
76.6%
Immunology
Correct
0
Incorrect
1
Unanswered
5
Accuracy
0.0%
Average
91.4%
Infectious Diseases
Correct
0
Incorrect
3
Unanswered
11
Accuracy
0.0%
Average
77.9%
Legal Medicine and Bioethics
Correct
0
Incorrect
0
Unanswered
11
Accuracy
0.0%
Average
82.9%
Medical Oncology
Correct
1
Incorrect
3
Unanswered
19
Accuracy
4.3%
Average
83.0%
Nephrology
Correct
0
Incorrect
1
Unanswered
9
Accuracy
0.0%
Average
85.1%
Neurology
Correct
1
Incorrect
2
Unanswered
10
Accuracy
7.7%
Average
88.6%
Ophthalmology
Correct
0
Incorrect
1
Unanswered
4
Accuracy
0.0%
Average
83.7%
Palliative Care
Correct
0
Incorrect
0
Unanswered
6
Accuracy
0.0%
Average
80.2%
Pediatrics
Correct
1
Incorrect
1
Unanswered
20
Accuracy
4.5%
Average
87.6%
Pharmacology
Correct
0
Incorrect
1
Unanswered
10
Accuracy
0.0%
Average
78.6%
Psychiatry
Correct
0
Incorrect
0
Unanswered
8
Accuracy
0.0%
Average
87.9%
Pulmonology
Correct
1
Incorrect
1
Unanswered
14
Accuracy
6.3%
Average
82.8%
Radiology-Emergency
Correct
0
Incorrect
2
Unanswered
11
Accuracy
0.0%
Average
67.7%
Rheumatology
Correct
0
Incorrect
1
Unanswered
10
Accuracy
0.0%
Average
88.4%
Statistics
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
83.8%
Traumatology
Correct
0
Incorrect
1
Unanswered
10
Accuracy
0.0%
Average
65.2%
Urology
Correct
0
Incorrect
1
Unanswered
7
Accuracy
0.0%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
0
Incorrect
0
Unanswered
3
Accuracy
0.0%
Average
82.6%
Biostatistics
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
83.8%
Diagnosis
Correct
1
Incorrect
6
Unanswered
74
Accuracy
1.2%
Average
82.2%
Epidemiology
Correct
0
Incorrect
1
Unanswered
8
Accuracy
0.0%
Average
88.7%
Ethics
Correct
0
Incorrect
0
Unanswered
6
Accuracy
0.0%
Average
92.0%
Interpretation
Correct
0
Incorrect
3
Unanswered
34
Accuracy
0.0%
Average
72.0%
Legal
Correct
0
Incorrect
0
Unanswered
9
Accuracy
0.0%
Average
82.4%
Pathophysiology
Correct
0
Incorrect
4
Unanswered
22
Accuracy
0.0%
Average
84.3%
Pharmacology
Correct
0
Incorrect
2
Unanswered
13
Accuracy
0.0%
Average
82.3%
Prevention
Correct
0
Incorrect
2
Unanswered
14
Accuracy
0.0%
Average
80.6%
Prognosis
Correct
0
Incorrect
0
Unanswered
5
Accuracy
0.0%
Average
93.2%
Risk
Correct
0
Incorrect
3
Unanswered
12
Accuracy
0.0%
Average
84.3%
Tests
Correct
1
Incorrect
2
Unanswered
30
Accuracy
3.0%
Average
80.3%
Treatment
Correct
2
Incorrect
8
Unanswered
62
Accuracy
2.8%
Average
80.1%
#AnswerCorrectStatus
1A
2B
3AD
4D
5B
6C
7B
8D
9D
10A
11B
12A
13Annulled
14DC
15A
16C
17C
18D
19B
20B
21B
22C
23B
24A
25C
26A
27B
28D
29A
30AD
31BC
32C
33B
34B
35C
36AD
37D
38A
39C
40A
41D
42A
43B
44B
45B
46B
47AD
48CC
49C
50Annulled
51CA
52C
53C
54DC
55A
56A
57A
58A
59B
60B
61C
62B
63B
64CAnnulled
65C
66C
67C
68A
69AB
70AB
71B
72B
73B
74A
75C
76C
77A
78D
79A
80D
81C
82D
83B
84A
85D
86A
87B
88AB
89B
90C
91A
92C
93D
94C
95C
96B
97B
98A
99DB
100C
101C
102A
103C
104C
105C
106B
107C
108A
109C
110C
111B
112C
113C
114D
115B
116AD
117C
118B
119A
120A
121C
122B
123B
124B
125C
126AB
127C
128B
129C
130C
131AB
132B
133C
134B
135AC
136B
137D
138B
139Annulled
140D
141A
142Annulled
143A
144D
145C
146C
147B
148BC
149B
150B
151B
152C
153C
154D
155B
156A
157D
158B
159C
160A
161Annulled
162A
163C
164C
165AA
166A
167BD
168C
169B
170B
171A
172B
173D
174C
175B
176A
177D
178D
179B
180C
181B
182C
183D
184D
185B
186AD
187CD
188C
189C
190B
191D
192D
193C
194C
195D
196A
197C
198A
199CB
200D
201C
202A
203A
204C
205B
206CC
207A
208Annulled
209B
210C