MedicalBenchmark
Meta: Llama 3.2 3B Instruct provider

Llama 3.2 3B Instruct

277

#277 of 290 modelsMIR 2025

Net score

27.66 pts

Accuracy

31.5%

Correct / Incorrect

63 / 106

Total Cost

$0.01

Overall Performance

(vs. average)
Accuracy

31.5%

avg: 75.9%

Net score

27.66 pts

avg: 138.99 pts

Correct

63

avg: 152

Incorrect

106

avg: 38

Total Cost

$0.01

avg: $3.59

Average response time

19.5s

avg: 18.1s

Output Tokens

113K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

82.5%

avg: 94.7%

Subject Breakdown

Allergology
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
3
Unanswered
2
Accuracy
16.7%
Average
81.3%
Cardiology
Correct
3
Incorrect
14
Unanswered
5
Accuracy
13.6%
Average
77.4%
Dermatology
Correct
4
Incorrect
7
Unanswered
2
Accuracy
30.8%
Average
62.8%
Endocrinology and Nutrition
Correct
9
Incorrect
4
Unanswered
3
Accuracy
56.3%
Average
82.5%
ENT
Correct
0
Incorrect
7
Unanswered
1
Accuracy
0.0%
Average
73.8%
Epidemiology
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
67.1%
Gastroenterology
Correct
3
Incorrect
15
Unanswered
3
Accuracy
14.3%
Average
72.9%
Genetics
Correct
1
Incorrect
3
Unanswered
2
Accuracy
16.7%
Average
68.2%
Geriatrics
Correct
4
Incorrect
6
Unanswered
1
Accuracy
36.4%
Average
71.2%
Gynecology and Obstetrics
Correct
8
Incorrect
5
Unanswered
6
Accuracy
42.1%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
4
Incorrect
5
Unanswered
2
Accuracy
36.4%
Average
81.8%
Immunology
Correct
5
Incorrect
3
Unanswered
1
Accuracy
55.6%
Average
82.5%
Infectious Diseases
Correct
11
Incorrect
12
Unanswered
5
Accuracy
39.3%
Average
71.1%
Legal Medicine and Bioethics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
67.2%
Medical Oncology
Correct
11
Incorrect
11
Unanswered
3
Accuracy
44.0%
Average
86.3%
Nephrology
Correct
7
Incorrect
7
Unanswered
1
Accuracy
46.7%
Average
78.2%
Neurology
Correct
7
Incorrect
7
Unanswered
6
Accuracy
35.0%
Average
76.2%
Ophthalmology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
72.6%
Palliative Care
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
77.2%
Pediatrics
Correct
9
Incorrect
13
Unanswered
3
Accuracy
36.0%
Average
72.7%
Pharmacology
Correct
6
Incorrect
9
Unanswered
2
Accuracy
35.3%
Average
73.1%
Psychiatry
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
82.0%
Pulmonology
Correct
3
Incorrect
10
Unanswered
1
Accuracy
21.4%
Average
73.0%
Radiology-Emergency
Correct
2
Incorrect
11
Unanswered
1
Accuracy
14.3%
Average
67.9%
Rheumatology
Correct
6
Incorrect
7
Unanswered
1
Accuracy
42.9%
Average
74.6%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
74.9%
Traumatology
Correct
10
Incorrect
7
Unanswered
1
Accuracy
55.6%
Average
78.2%
Urology
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
77.1%
Biostatistics
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
78.4%
Diagnosis
Correct
31
Incorrect
44
Unanswered
14
Accuracy
34.8%
Average
77.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
75.0%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
72.0%
Interpretation
Correct
9
Incorrect
26
Unanswered
7
Accuracy
21.4%
Average
69.3%
Legal
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
63.6%
Pathophysiology
Correct
10
Incorrect
14
Unanswered
3
Accuracy
37.0%
Average
72.6%
Pharmacology
Correct
4
Incorrect
7
Unanswered
2
Accuracy
30.8%
Average
82.4%
Prevention
Correct
5
Incorrect
5
Unanswered
2
Accuracy
41.7%
Average
74.5%
Prognosis
Correct
0
Incorrect
6
Unanswered
0
Accuracy
0.0%
Average
77.8%
Risk
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
84.3%
Tests
Correct
8
Incorrect
15
Unanswered
3
Accuracy
30.8%
Average
76.3%
Treatment
Correct
24
Incorrect
45
Unanswered
13
Accuracy
29.3%
Average
75.2%
#AnswerCorrectStatus
1DB
2A
3DC
4AB
5A
6CC
7AC
8AA
9AA
10BD
11CD
12BD
13AB
14CD
15D
16AB
17AB
18CA
19BC
20CA
21CB
22AD
23C
24DD
25CC
26BAnnulled
27AC
28DAnnulled
29CD
30BB
31CD
32DA
33CD
34D
35BB
36BD
37CC
38BC
39DD
40BA
41D
42BC
43BB
44D
45CD
46A
47CA
48AA
49AD
50BB
51DC
52AB
53BD
54DB
55CA
56Annulled
57AC
58BB
59DD
60A
61A
62DD
63CB
64DD
65A
66AA
67BB
68AB
69DB
70BA
71DD
72CA
73CD
74DC
75AA
76CB
77BB
78BB
79C
80C
81C
82D
83AB
84D
85BC
86BC
87A
88BD
89BB
90AA
91BB
92CC
93BB
94CC
95AA
96CC
97D
98BC
99CA
100DC
101AB
102CD
103BA
104BC
105DA
106C
107BB
108BD
109AB
110CC
111CA
112C
113AB
114AD
115CD
116AC
117DA
118D
119AC
120B
121CD
122AC
123CC
124DC
125CD
126AD
127AB
128D
129CA
130BD
131DD
132AA
133AB
134CC
135B
136CC
137BA
138AD
139BD
140AB
141AA
142AA
143BB
144BB
145AD
146CC
147BB
148A
149DA
150DA
151AA
152AA
153BB
154BB
155BB
156C
157AA
158C
159AC
160BA
161AA
162
163BD
164CC
165DA
166CB
167C
168BD
169DB
170DB
171BC
172BA
173CA
174CB
175DB
176CC
177DC
178AA
179CD
180AA
181CB
182CC
183AB
184BB
185B
186Annulled
187CC
188CD
189DD
190AA
191B
192AA
193AC
194BA
195CA
196AA
197AB
198DC
199CD
200CC
201CB
202A
203DD
204DC
205AB
206CD
207BA
208AC
209CC
210B