MedicalBenchmark
Meta: Llama 3.2 11B Vision Instruct provider

Llama 3.2 11B Vision Instruct

304

#304 of 319 modelsMIR 2025

Net score

30.33 pts

Accuracy

23.0%

Correct / Incorrect

46 / 47

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

23.0%

avg: 77.9%

Net score

30.33 pts

avg: 143.96 pts

Correct

46

avg: 156

Incorrect

47

avg: 35

Total Cost

$0.04

avg: $3.36

Average response time

50.8s

avg: 19.0s

Output Tokens

171K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

45.4%

avg: 95.2%

Subject Breakdown

Allergology
Correct
0
Incorrect
2
Unanswered
2
Accuracy
0.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
2
Accuracy
50.0%
Average
82.3%
Cardiology
Correct
1
Incorrect
4
Unanswered
17
Accuracy
4.5%
Average
78.6%
Dermatology
Correct
5
Incorrect
4
Unanswered
3
Accuracy
41.7%
Average
69.4%
Endocrinology and Nutrition
Correct
4
Incorrect
1
Unanswered
11
Accuracy
25.0%
Average
83.5%
ENT
Correct
0
Incorrect
2
Unanswered
6
Accuracy
0.0%
Average
74.8%
Epidemiology
Correct
0
Incorrect
2
Unanswered
5
Accuracy
0.0%
Average
69.1%
Gastroenterology
Correct
8
Incorrect
4
Unanswered
9
Accuracy
38.1%
Average
74.1%
Genetics
Correct
0
Incorrect
2
Unanswered
4
Accuracy
0.0%
Average
69.5%
Geriatrics
Correct
2
Incorrect
4
Unanswered
5
Accuracy
18.2%
Average
77.5%
Gynecology and Obstetrics
Correct
5
Incorrect
4
Unanswered
10
Accuracy
26.3%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
82.6%
Hematology
Correct
1
Incorrect
3
Unanswered
7
Accuracy
9.1%
Average
82.7%
Immunology
Correct
3
Incorrect
2
Unanswered
4
Accuracy
33.3%
Average
83.3%
Infectious Diseases
Correct
8
Incorrect
5
Unanswered
14
Accuracy
29.6%
Average
74.9%
Legal Medicine and Bioethics
Correct
1
Incorrect
2
Unanswered
2
Accuracy
20.0%
Average
68.4%
Medical Oncology
Correct
4
Incorrect
7
Unanswered
14
Accuracy
16.0%
Average
87.2%
Nephrology
Correct
3
Incorrect
3
Unanswered
8
Accuracy
21.4%
Average
84.8%
Neurology
Correct
4
Incorrect
5
Unanswered
11
Accuracy
20.0%
Average
77.3%
Ophthalmology
Correct
0
Incorrect
1
Unanswered
4
Accuracy
0.0%
Average
74.2%
Palliative Care
Correct
1
Incorrect
1
Unanswered
2
Accuracy
25.0%
Average
78.6%
Pediatrics
Correct
7
Incorrect
6
Unanswered
13
Accuracy
26.9%
Average
71.9%
Pharmacology
Correct
9
Incorrect
2
Unanswered
6
Accuracy
52.9%
Average
74.1%
Psychiatry
Correct
4
Incorrect
0
Unanswered
4
Accuracy
50.0%
Average
83.0%
Pulmonology
Correct
4
Incorrect
5
Unanswered
5
Accuracy
28.6%
Average
80.4%
Radiology-Emergency
Correct
4
Incorrect
4
Unanswered
6
Accuracy
28.6%
Average
69.4%
Rheumatology
Correct
3
Incorrect
5
Unanswered
7
Accuracy
20.0%
Average
76.6%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
76.6%
Traumatology
Correct
2
Incorrect
5
Unanswered
11
Accuracy
11.1%
Average
79.3%
Urology
Correct
1
Incorrect
2
Unanswered
4
Accuracy
14.3%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
0
Unanswered
6
Accuracy
14.3%
Average
78.6%
Biostatistics
Correct
0
Incorrect
2
Unanswered
2
Accuracy
0.0%
Average
79.8%
Diagnosis
Correct
19
Incorrect
22
Unanswered
47
Accuracy
21.6%
Average
79.9%
Epidemiology
Correct
1
Incorrect
1
Unanswered
3
Accuracy
20.0%
Average
76.7%
Ethics
Correct
0
Incorrect
1
Unanswered
2
Accuracy
0.0%
Average
74.1%
Interpretation
Correct
5
Incorrect
11
Unanswered
26
Accuracy
11.9%
Average
70.7%
Legal
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
64.6%
Pathophysiology
Correct
4
Incorrect
10
Unanswered
13
Accuracy
14.8%
Average
76.1%
Pharmacology
Correct
8
Incorrect
0
Unanswered
5
Accuracy
61.5%
Average
83.3%
Prevention
Correct
4
Incorrect
2
Unanswered
6
Accuracy
33.3%
Average
75.6%
Prognosis
Correct
3
Incorrect
1
Unanswered
3
Accuracy
42.9%
Average
80.8%
Risk
Correct
1
Incorrect
2
Unanswered
2
Accuracy
20.0%
Average
85.2%
Tests
Correct
6
Incorrect
8
Unanswered
13
Accuracy
22.2%
Average
77.9%
Treatment
Correct
20
Incorrect
17
Unanswered
44
Accuracy
24.7%
Average
77.3%
#AnswerCorrectStatus
1B
2A
3DC
4AB
5A
6C
7C
8BA
9A
10D
11D
12D
13AB
14BD
15DAnnulled
16B
17B
18AA
19CC
20DA
21CB
22D
23AC
24BD
25CC
26Annulled
27C
28CAnnulled
29D
30CB
31AD
32A
33D
34AD
35BB
36BD
37C
38C
39CD
40BA
41D
42C
43B
44D
45BD
46AA
47A
48AA
49D
50B
51DC
52B
53D
54DB
55A
56Annulled
57C
58B
59D
60BA
61AA
62CD
63BB
64D
65A
66AA
67BB
68B
69B
70AA
71DD
72AA
73D
74C
75AA
76BB
77B
78B
79BC
80C
81C
82D
83BB
84BD
85C
86C
87A
88AD
89BB
90DA
91B
92C
93B
94C
95CA
96CC
97D
98C
99A
100C
101B
102D
103A
104CC
105A
106C
107B
108D
109B
110CC
111A
112BC
113AB
114DD
115CD
116C
117AA
118DD
119C
120BB
121D
122CC
123C
124AC
125BD
126D
127B
128DD
129BA
130D
131DD
132A
133B
134C
135B
136AC
137AA
138D
139DD
140CB
141AA
142A
143B
144B
145D
146C
147CB
148BA
149CA
150AD
151A
152AA
153B
154B
155B
156C
157AA
158CC
159DC
160A
161A
162BAnnulled
163DD
164C
165A
166B
167C
168D
169B
170B
171CC
172AA
173A
174BB
175AB
176C
177C
178BA
179D
180AA
181AB
182C
183DB
184B
185BB
186Annulled
187C
188DD
189BD
190BA
191CB
192A
193CC
194A
195A
196AA
197B
198DC
199DD
200C
201CB
202A
203D
204CC
205BB
206DD
207A
208C
209CC
210B