MedicalBenchmark
Meta: Llama 3.2 11B Vision Instruct provider

Llama 3.2 11B Vision Instruct

274

#274 of 291 modelsMIR 2024

Net score

44.66 pts

Accuracy

30.5%

Correct / Incorrect

61 / 49

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

30.5%

avg: 80.5%

Net score

44.66 pts

avg: 150.85 pts

Correct

61

avg: 161

Incorrect

49

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

53.9s

avg: 16.4s

Output Tokens

207K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

57.2%

avg: 95.4%

Subject Breakdown

Allergology
Correct
0
Incorrect
0
Unanswered
3
Accuracy
0.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
6
Incorrect
5
Unanswered
10
Accuracy
28.6%
Average
79.7%
Dermatology
Correct
7
Incorrect
4
Unanswered
3
Accuracy
50.0%
Average
80.2%
Endocrinology and Nutrition
Correct
8
Incorrect
5
Unanswered
6
Accuracy
42.1%
Average
84.2%
ENT
Correct
2
Incorrect
2
Unanswered
3
Accuracy
28.6%
Average
74.4%
Epidemiology
Correct
3
Incorrect
2
Unanswered
3
Accuracy
37.5%
Average
89.3%
Gastroenterology
Correct
3
Incorrect
5
Unanswered
14
Accuracy
13.6%
Average
70.5%
Genetics
Correct
3
Incorrect
1
Unanswered
3
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
5
Incorrect
0
Unanswered
5
Accuracy
50.0%
Average
86.9%
Gynecology and Obstetrics
Correct
2
Incorrect
5
Unanswered
7
Accuracy
14.3%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
73.2%
Hematology
Correct
3
Incorrect
2
Unanswered
8
Accuracy
23.1%
Average
81.5%
Immunology
Correct
3
Incorrect
2
Unanswered
3
Accuracy
37.5%
Average
89.1%
Infectious Diseases
Correct
6
Incorrect
7
Unanswered
10
Accuracy
26.1%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
4
Incorrect
7
Unanswered
10
Accuracy
19.0%
Average
80.2%
Nephrology
Correct
4
Incorrect
2
Unanswered
7
Accuracy
30.8%
Average
80.8%
Neurology
Correct
9
Incorrect
4
Unanswered
9
Accuracy
40.9%
Average
83.7%
Ophthalmology
Correct
3
Incorrect
0
Unanswered
2
Accuracy
60.0%
Average
80.0%
Palliative Care
Correct
0
Incorrect
0
Unanswered
4
Accuracy
0.0%
Average
88.2%
Pediatrics
Correct
1
Incorrect
7
Unanswered
9
Accuracy
5.9%
Average
82.0%
Pharmacology
Correct
9
Incorrect
7
Unanswered
7
Accuracy
39.1%
Average
85.4%
Psychiatry
Correct
5
Incorrect
1
Unanswered
4
Accuracy
50.0%
Average
89.5%
Pulmonology
Correct
4
Incorrect
0
Unanswered
15
Accuracy
21.1%
Average
80.6%
Radiology-Emergency
Correct
4
Incorrect
6
Unanswered
4
Accuracy
28.6%
Average
64.9%
Rheumatology
Correct
7
Incorrect
0
Unanswered
7
Accuracy
50.0%
Average
81.4%
Statistics
Correct
1
Incorrect
1
Unanswered
1
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
2
Incorrect
4
Unanswered
9
Accuracy
13.3%
Average
74.5%
Urology
Correct
0
Incorrect
3
Unanswered
3
Accuracy
0.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
1
Unanswered
3
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
24
Incorrect
17
Unanswered
32
Accuracy
32.9%
Average
79.2%
Epidemiology
Correct
3
Incorrect
3
Unanswered
6
Accuracy
25.0%
Average
81.2%
Ethics
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
94.5%
Interpretation
Correct
12
Incorrect
12
Unanswered
13
Accuracy
32.4%
Average
69.6%
Pathophysiology
Correct
11
Incorrect
9
Unanswered
13
Accuracy
33.3%
Average
85.4%
Pharmacology
Correct
9
Incorrect
5
Unanswered
11
Accuracy
36.0%
Average
84.0%
Prevention
Correct
6
Incorrect
1
Unanswered
5
Accuracy
50.0%
Average
89.8%
Prognosis
Correct
2
Incorrect
2
Unanswered
3
Accuracy
28.6%
Average
83.9%
Risk
Correct
5
Incorrect
1
Unanswered
7
Accuracy
38.5%
Average
83.6%
Tests
Correct
5
Incorrect
8
Unanswered
8
Accuracy
23.8%
Average
73.9%
Treatment
Correct
17
Incorrect
19
Unanswered
35
Accuracy
23.9%
Average
81.3%
#AnswerCorrectStatus
1BB
2D
3BB
4AC
5BC
6BB
7DD
8CC
9CA
10D
11AD
12BA
13BC
14A
15BB
16BA
17CC
18A
19AB
20CC
21AD
22B
23AA
24DA
25CC
26B
27DC
28A
29B
30C
31DD
32A
33C
34CB
35DD
36DD
37CA
38DA
39C
40B
41BC
42D
43AA
44AD
45D
46BB
47CC
48CC
49CB
50CC
51CA
52DD
53BC
54BB
55AC
56D
57CA
58A
59A
60AA
61A
62D
63DD
64BAnnulled
65D
66C
67B
68BAnnulled
69AA
70CB
71CB
72DD
73CB
74C
75B
76A
77BD
78DC
79CB
80AA
81DC
82CC
83B
84CC
85A
86A
87DB
88D
89BB
90AA
91D
92DA
93C
94B
95CD
96BB
97B
98B
99BA
100B
101A
102DD
103B
104CD
105B
106BC
107CC
108BB
109D
110D
111BB
112C
113Annulled
114DD
115DD
116A
117CD
118D
119A
120C
121A
122B
123D
124D
125BB
126D
127A
128B
129DD
130C
131AC
132D
133A
134C
135DA
136D
137A
138C
139AA
140CC
141B
142CC
143A
144D
145C
146C
147DC
148A
149C
150D
151A
152A
153BC
154BB
155D
156CC
157CC
158DD
159D
160B
161BB
162B
163B
164DB
165CA
166CC
167A
168BB
169CC
170CA
171CD
172BB
173A
174B
175AA
176CC
177C
178B
179CC
180Annulled
181B
182D
183C
184A
185C
186DD
187DA
188C
189BD
190DD
191B
192BB
193CC
194AC
195CC
196B
197DA
198B
199CD
200AA
201BB
202D
203CB
204DD
205AD
206Annulled
207A
208BA
209AB
210AD