MedicalBenchmark
Meta: Llama 3.2 1B Instruct provider

Llama 3.2 1B Instruct

287

#287 of 290 modelsMIR 2025

Net score

6.66 pts

Accuracy

14.5%

Correct / Incorrect

29 / 67

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

14.5%

avg: 75.9%

Net score

6.66 pts

avg: 138.99 pts

Correct

29

avg: 152

Incorrect

67

avg: 38

Total Cost

$0.03

avg: $3.59

Average response time

4.9s

avg: 18.1s

Output Tokens

121K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

47.1%

avg: 94.7%

Subject Breakdown

Allergology
Correct
0
Incorrect
0
Unanswered
4
Accuracy
0.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
2
Unanswered
2
Accuracy
33.3%
Average
81.3%
Cardiology
Correct
1
Incorrect
8
Unanswered
13
Accuracy
4.5%
Average
77.4%
Dermatology
Correct
1
Incorrect
5
Unanswered
7
Accuracy
7.7%
Average
62.8%
Endocrinology and Nutrition
Correct
2
Incorrect
7
Unanswered
7
Accuracy
12.5%
Average
82.5%
ENT
Correct
2
Incorrect
4
Unanswered
2
Accuracy
25.0%
Average
73.8%
Epidemiology
Correct
1
Incorrect
4
Unanswered
2
Accuracy
14.3%
Average
67.1%
Gastroenterology
Correct
4
Incorrect
10
Unanswered
7
Accuracy
19.0%
Average
72.9%
Genetics
Correct
3
Incorrect
1
Unanswered
2
Accuracy
50.0%
Average
68.2%
Geriatrics
Correct
1
Incorrect
3
Unanswered
7
Accuracy
9.1%
Average
71.2%
Gynecology and Obstetrics
Correct
3
Incorrect
7
Unanswered
9
Accuracy
15.8%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
3
Incorrect
2
Unanswered
6
Accuracy
27.3%
Average
81.8%
Immunology
Correct
3
Incorrect
2
Unanswered
4
Accuracy
33.3%
Average
82.5%
Infectious Diseases
Correct
4
Incorrect
11
Unanswered
13
Accuracy
14.3%
Average
71.1%
Legal Medicine and Bioethics
Correct
1
Incorrect
2
Unanswered
2
Accuracy
20.0%
Average
67.2%
Medical Oncology
Correct
7
Incorrect
8
Unanswered
10
Accuracy
28.0%
Average
86.3%
Nephrology
Correct
0
Incorrect
4
Unanswered
11
Accuracy
0.0%
Average
78.2%
Neurology
Correct
4
Incorrect
6
Unanswered
10
Accuracy
20.0%
Average
76.2%
Ophthalmology
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
72.6%
Palliative Care
Correct
0
Incorrect
1
Unanswered
3
Accuracy
0.0%
Average
77.2%
Pediatrics
Correct
4
Incorrect
10
Unanswered
11
Accuracy
16.0%
Average
72.7%
Pharmacology
Correct
1
Incorrect
5
Unanswered
11
Accuracy
5.9%
Average
73.1%
Psychiatry
Correct
4
Incorrect
2
Unanswered
2
Accuracy
50.0%
Average
82.0%
Pulmonology
Correct
0
Incorrect
5
Unanswered
9
Accuracy
0.0%
Average
73.0%
Radiology-Emergency
Correct
1
Incorrect
5
Unanswered
8
Accuracy
7.1%
Average
67.9%
Rheumatology
Correct
1
Incorrect
2
Unanswered
11
Accuracy
7.1%
Average
74.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.9%
Traumatology
Correct
2
Incorrect
6
Unanswered
10
Accuracy
11.1%
Average
78.2%
Urology
Correct
0
Incorrect
4
Unanswered
3
Accuracy
0.0%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
2
Unanswered
3
Accuracy
28.6%
Average
77.1%
Biostatistics
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
78.4%
Diagnosis
Correct
15
Incorrect
25
Unanswered
49
Accuracy
16.9%
Average
77.9%
Epidemiology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
75.0%
Ethics
Correct
0
Incorrect
1
Unanswered
2
Accuracy
0.0%
Average
72.0%
Interpretation
Correct
5
Incorrect
12
Unanswered
25
Accuracy
11.9%
Average
69.3%
Legal
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
63.6%
Pathophysiology
Correct
4
Incorrect
15
Unanswered
8
Accuracy
14.8%
Average
72.6%
Pharmacology
Correct
3
Incorrect
4
Unanswered
6
Accuracy
23.1%
Average
82.4%
Prevention
Correct
2
Incorrect
4
Unanswered
6
Accuracy
16.7%
Average
74.5%
Prognosis
Correct
1
Incorrect
3
Unanswered
2
Accuracy
16.7%
Average
77.8%
Risk
Correct
0
Incorrect
4
Unanswered
1
Accuracy
0.0%
Average
84.3%
Tests
Correct
5
Incorrect
10
Unanswered
11
Accuracy
19.2%
Average
76.3%
Treatment
Correct
8
Incorrect
24
Unanswered
50
Accuracy
9.8%
Average
75.2%
#AnswerCorrectStatus
1BB
2AA
3AC
4DB
5A
6CC
7BC
8BA
9A
10D
11D
12D
13B
14D
15B
16B
17B
18A
19C
20A
21B
22D
23C
24AD
25C
26BAnnulled
27C
28CAnnulled
29AD
30DB
31AD
32A
33DD
34CD
35BB
36AD
37CC
38C
39D
40AA
41CD
42AC
43B
44CD
45AD
46AA
47A
48AA
49DD
50BB
51BC
52B
53BD
54B
55CA
56Annulled
57C
58BB
59D
60A
61A
62AD
63B
64BD
65A
66A
67B
68B
69B
70CA
71D
72AA
73DD
74BC
75BA
76B
77BB
78AB
79C
80AC
81CC
82D
83BB
84AD
85C
86DC
87A
88D
89B
90BA
91B
92AC
93B
94C
95A
96C
97D
98C
99CA
100C
101B
102CD
103BA
104C
105A
106C
107CB
108D
109B
110C
111DA
112C
113B
114BD
115D
116AC
117BA
118CD
119CC
120B
121DD
122C
123C
124CC
125BD
126CD
127BB
128BD
129BA
130D
131D
132BA
133B
134C
135B
136C
137BA
138D
139AD
140DB
141BA
142AA
143B
144BB
145D
146C
147CB
148A
149A
150A
151A
152A
153BB
154CB
155B
156C
157A
158C
159CC
160A
161BA
162B
163D
164BC
165A
166B
167AC
168D
169B
170CB
171CC
172BA
173A
174B
175B
176CC
177C
178BA
179CD
180AA
181AB
182C
183DB
184AB
185B
186Annulled
187AC
188AD
189D
190CA
191B
192A
193C
194AA
195A
196BA
197B
198C
199D
200BC
201CB
202A
203AD
204C
205AB
206CD
207A
208C
209CC
210B