MedicalBenchmark
Meta: Llama 3.2 3B Instruct provider

Llama 3.2 3B Instruct

305

#305 of 319 modelsMIR 2025

Net score

28.66 pts

Accuracy

32.0%

Correct / Incorrect

64 / 106

Total Cost

$0.01

Overall Performance

(vs. average)
Accuracy

32.0%

avg: 77.9%

Net score

28.66 pts

avg: 143.96 pts

Correct

64

avg: 156

Incorrect

106

avg: 35

Total Cost

$0.01

avg: $3.36

Average response time

19.5s

avg: 19.0s

Output Tokens

113K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

82.5%

avg: 95.2%

Subject Breakdown

Allergology
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
3
Unanswered
2
Accuracy
16.7%
Average
82.3%
Cardiology
Correct
3
Incorrect
14
Unanswered
5
Accuracy
13.6%
Average
78.6%
Dermatology
Correct
4
Incorrect
7
Unanswered
1
Accuracy
33.3%
Average
69.4%
Endocrinology and Nutrition
Correct
9
Incorrect
4
Unanswered
3
Accuracy
56.3%
Average
83.5%
ENT
Correct
0
Incorrect
7
Unanswered
1
Accuracy
0.0%
Average
74.8%
Epidemiology
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
69.1%
Gastroenterology
Correct
3
Incorrect
15
Unanswered
3
Accuracy
14.3%
Average
74.1%
Genetics
Correct
1
Incorrect
3
Unanswered
2
Accuracy
16.7%
Average
69.5%
Geriatrics
Correct
5
Incorrect
5
Unanswered
1
Accuracy
45.5%
Average
77.5%
Gynecology and Obstetrics
Correct
8
Incorrect
5
Unanswered
6
Accuracy
42.1%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
82.6%
Hematology
Correct
4
Incorrect
5
Unanswered
2
Accuracy
36.4%
Average
82.7%
Immunology
Correct
5
Incorrect
3
Unanswered
1
Accuracy
55.6%
Average
83.3%
Infectious Diseases
Correct
11
Incorrect
12
Unanswered
4
Accuracy
40.7%
Average
74.9%
Legal Medicine and Bioethics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
68.4%
Medical Oncology
Correct
11
Incorrect
11
Unanswered
3
Accuracy
44.0%
Average
87.2%
Nephrology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
84.8%
Neurology
Correct
7
Incorrect
7
Unanswered
6
Accuracy
35.0%
Average
77.3%
Ophthalmology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
74.2%
Palliative Care
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
78.6%
Pediatrics
Correct
9
Incorrect
14
Unanswered
3
Accuracy
34.6%
Average
71.9%
Pharmacology
Correct
6
Incorrect
9
Unanswered
2
Accuracy
35.3%
Average
74.1%
Psychiatry
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
83.0%
Pulmonology
Correct
3
Incorrect
10
Unanswered
1
Accuracy
21.4%
Average
80.4%
Radiology-Emergency
Correct
2
Incorrect
11
Unanswered
1
Accuracy
14.3%
Average
69.4%
Rheumatology
Correct
6
Incorrect
8
Unanswered
1
Accuracy
40.0%
Average
76.6%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
76.6%
Traumatology
Correct
10
Incorrect
7
Unanswered
1
Accuracy
55.6%
Average
79.3%
Urology
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
78.6%
Biostatistics
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
79.8%
Diagnosis
Correct
31
Incorrect
43
Unanswered
14
Accuracy
35.2%
Average
79.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
76.7%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
74.1%
Interpretation
Correct
9
Incorrect
26
Unanswered
7
Accuracy
21.4%
Average
70.7%
Legal
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
64.6%
Pathophysiology
Correct
11
Incorrect
13
Unanswered
3
Accuracy
40.7%
Average
76.1%
Pharmacology
Correct
4
Incorrect
7
Unanswered
2
Accuracy
30.8%
Average
83.3%
Prevention
Correct
5
Incorrect
5
Unanswered
2
Accuracy
41.7%
Average
75.6%
Prognosis
Correct
0
Incorrect
7
Unanswered
0
Accuracy
0.0%
Average
80.8%
Risk
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
85.2%
Tests
Correct
8
Incorrect
16
Unanswered
3
Accuracy
29.6%
Average
77.9%
Treatment
Correct
24
Incorrect
45
Unanswered
12
Accuracy
29.6%
Average
77.3%
#AnswerCorrectStatus
1DB
2A
3DC
4AB
5A
6CC
7AC
8AA
9AA
10BD
11CD
12BD
13AB
14CD
15DAnnulled
16AB
17AB
18CA
19BC
20CA
21CB
22AD
23C
24DD
25CC
26BAnnulled
27AC
28DAnnulled
29CD
30BB
31CD
32DA
33CD
34D
35BB
36BD
37CC
38BC
39DD
40BA
41D
42BC
43BB
44D
45CD
46A
47CA
48AA
49AD
50BB
51DC
52AB
53BD
54DB
55CA
56Annulled
57AC
58BB
59DD
60A
61A
62DD
63CB
64DD
65A
66AA
67BB
68AB
69DB
70BA
71DD
72CA
73CD
74DC
75AA
76CB
77BB
78BB
79C
80C
81C
82D
83AB
84D
85BC
86BC
87A
88BD
89BB
90AA
91BB
92CC
93BB
94CC
95AA
96CC
97D
98BC
99CA
100DC
101AB
102CD
103BA
104BC
105DA
106C
107BB
108BD
109AB
110CC
111CA
112C
113AB
114AD
115CD
116AC
117DA
118D
119AC
120B
121CD
122AC
123CC
124DC
125CD
126AD
127AB
128D
129CA
130BD
131DD
132AA
133AB
134CC
135B
136CC
137BA
138AD
139BD
140AB
141AA
142AA
143BB
144BB
145AD
146CC
147BB
148A
149DA
150DD
151AA
152AA
153BB
154BB
155BB
156C
157AA
158C
159AC
160BA
161AA
162Annulled
163BD
164CC
165DA
166CB
167C
168BD
169DB
170DB
171BC
172BA
173CA
174CB
175DB
176CC
177DC
178AA
179CD
180AA
181CB
182CC
183AB
184BB
185B
186Annulled
187CC
188CD
189DD
190AA
191B
192AA
193AC
194BA
195CA
196AA
197AB
198DC
199CD
200CC
201CB
202A
203DD
204DC
205AB
206CD
207BA
208AC
209CC
210B