MedicalBenchmark
Meta: Llama 3.2 1B Instruct provider

Llama 3.2 1B Instruct

315

#315 of 319 modelsMIR 2025

Net score

6.66 pts

Accuracy

14.5%

Correct / Incorrect

29 / 67

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

14.5%

avg: 77.9%

Net score

6.66 pts

avg: 143.96 pts

Correct

29

avg: 156

Incorrect

67

avg: 35

Total Cost

$0.03

avg: $3.36

Average response time

4.9s

avg: 19.0s

Output Tokens

121K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

47.1%

avg: 95.2%

Subject Breakdown

Allergology
Correct
0
Incorrect
0
Unanswered
4
Accuracy
0.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
2
Unanswered
2
Accuracy
33.3%
Average
82.3%
Cardiology
Correct
1
Incorrect
8
Unanswered
13
Accuracy
4.5%
Average
78.6%
Dermatology
Correct
1
Incorrect
4
Unanswered
7
Accuracy
8.3%
Average
69.4%
Endocrinology and Nutrition
Correct
2
Incorrect
7
Unanswered
7
Accuracy
12.5%
Average
83.5%
ENT
Correct
2
Incorrect
4
Unanswered
2
Accuracy
25.0%
Average
74.8%
Epidemiology
Correct
1
Incorrect
4
Unanswered
2
Accuracy
14.3%
Average
69.1%
Gastroenterology
Correct
4
Incorrect
10
Unanswered
7
Accuracy
19.0%
Average
74.1%
Genetics
Correct
3
Incorrect
1
Unanswered
2
Accuracy
50.0%
Average
69.5%
Geriatrics
Correct
1
Incorrect
3
Unanswered
7
Accuracy
9.1%
Average
77.5%
Gynecology and Obstetrics
Correct
3
Incorrect
7
Unanswered
9
Accuracy
15.8%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
82.6%
Hematology
Correct
3
Incorrect
2
Unanswered
6
Accuracy
27.3%
Average
82.7%
Immunology
Correct
3
Incorrect
2
Unanswered
4
Accuracy
33.3%
Average
83.3%
Infectious Diseases
Correct
4
Incorrect
10
Unanswered
13
Accuracy
14.8%
Average
74.9%
Legal Medicine and Bioethics
Correct
1
Incorrect
2
Unanswered
2
Accuracy
20.0%
Average
68.4%
Medical Oncology
Correct
7
Incorrect
8
Unanswered
10
Accuracy
28.0%
Average
87.2%
Nephrology
Correct
0
Incorrect
3
Unanswered
11
Accuracy
0.0%
Average
84.8%
Neurology
Correct
4
Incorrect
6
Unanswered
10
Accuracy
20.0%
Average
77.3%
Ophthalmology
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
74.2%
Palliative Care
Correct
0
Incorrect
1
Unanswered
3
Accuracy
0.0%
Average
78.6%
Pediatrics
Correct
4
Incorrect
11
Unanswered
11
Accuracy
15.4%
Average
71.9%
Pharmacology
Correct
1
Incorrect
5
Unanswered
11
Accuracy
5.9%
Average
74.1%
Psychiatry
Correct
4
Incorrect
2
Unanswered
2
Accuracy
50.0%
Average
83.0%
Pulmonology
Correct
0
Incorrect
5
Unanswered
9
Accuracy
0.0%
Average
80.4%
Radiology-Emergency
Correct
1
Incorrect
5
Unanswered
8
Accuracy
7.1%
Average
69.4%
Rheumatology
Correct
1
Incorrect
3
Unanswered
11
Accuracy
6.7%
Average
76.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
76.6%
Traumatology
Correct
2
Incorrect
6
Unanswered
10
Accuracy
11.1%
Average
79.3%
Urology
Correct
0
Incorrect
4
Unanswered
3
Accuracy
0.0%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
2
Unanswered
3
Accuracy
28.6%
Average
78.6%
Biostatistics
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
79.8%
Diagnosis
Correct
15
Incorrect
24
Unanswered
49
Accuracy
17.0%
Average
79.9%
Epidemiology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
76.7%
Ethics
Correct
0
Incorrect
1
Unanswered
2
Accuracy
0.0%
Average
74.1%
Interpretation
Correct
5
Incorrect
12
Unanswered
25
Accuracy
11.9%
Average
70.7%
Legal
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
64.6%
Pathophysiology
Correct
4
Incorrect
15
Unanswered
8
Accuracy
14.8%
Average
76.1%
Pharmacology
Correct
3
Incorrect
4
Unanswered
6
Accuracy
23.1%
Average
83.3%
Prevention
Correct
2
Incorrect
4
Unanswered
6
Accuracy
16.7%
Average
75.6%
Prognosis
Correct
1
Incorrect
4
Unanswered
2
Accuracy
14.3%
Average
80.8%
Risk
Correct
0
Incorrect
4
Unanswered
1
Accuracy
0.0%
Average
85.2%
Tests
Correct
5
Incorrect
11
Unanswered
11
Accuracy
18.5%
Average
77.9%
Treatment
Correct
8
Incorrect
23
Unanswered
50
Accuracy
9.9%
Average
77.3%
#AnswerCorrectStatus
1BB
2AA
3AC
4DB
5A
6CC
7BC
8BA
9A
10D
11D
12D
13B
14D
15BAnnulled
16B
17B
18A
19C
20A
21B
22D
23C
24AD
25C
26BAnnulled
27C
28CAnnulled
29AD
30DB
31AD
32A
33DD
34CD
35BB
36AD
37CC
38C
39D
40AA
41CD
42AC
43B
44CD
45AD
46AA
47A
48AA
49DD
50BB
51BC
52B
53BD
54B
55CA
56Annulled
57C
58BB
59D
60A
61A
62AD
63B
64BD
65A
66A
67B
68B
69B
70CA
71D
72AA
73DD
74BC
75BA
76B
77BB
78AB
79C
80AC
81CC
82D
83BB
84AD
85C
86DC
87A
88D
89B
90BA
91B
92AC
93B
94C
95A
96C
97D
98C
99CA
100C
101B
102CD
103BA
104C
105A
106C
107CB
108D
109B
110C
111DA
112C
113B
114BD
115D
116AC
117BA
118CD
119CC
120B
121DD
122C
123C
124CC
125BD
126CD
127BB
128BD
129BA
130D
131D
132BA
133B
134C
135B
136C
137BA
138D
139AD
140DB
141BA
142AA
143B
144BB
145D
146C
147CB
148A
149A
150D
151A
152A
153BB
154CB
155B
156C
157A
158C
159CC
160A
161BA
162BAnnulled
163D
164BC
165A
166B
167AC
168D
169B
170CB
171CC
172BA
173A
174B
175B
176CC
177C
178BA
179CD
180AA
181AB
182C
183DB
184AB
185B
186Annulled
187AC
188AD
189D
190CA
191B
192A
193C
194AA
195A
196BA
197B
198C
199D
200BC
201CB
202A
203AD
204C
205AB
206CD
207A
208C
209CC
210B