MedicalBenchmark
Meta: Llama 3.2 1B Instruct provider

Llama 3.2 1B Instruct

289

#289 of 291 modelsMIR 2024

Net score

7.33 pts

Accuracy

14.5%

Correct / Incorrect

29 / 65

Total Cost

$0.02

Overall Performance

(vs. average)
Accuracy

14.5%

avg: 80.5%

Net score

7.33 pts

avg: 150.85 pts

Correct

29

avg: 161

Incorrect

65

avg: 30

Total Cost

$0.02

avg: $3.32

Average response time

3.6s

avg: 16.4s

Output Tokens

101K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

47.8%

avg: 95.4%

Subject Breakdown

Allergology
Correct
0
Incorrect
1
Unanswered
2
Accuracy
0.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
0
Incorrect
0
Unanswered
4
Accuracy
0.0%
Average
87.1%
Cardiology
Correct
1
Incorrect
11
Unanswered
9
Accuracy
4.8%
Average
79.7%
Dermatology
Correct
2
Incorrect
5
Unanswered
7
Accuracy
14.3%
Average
80.2%
Endocrinology and Nutrition
Correct
5
Incorrect
9
Unanswered
5
Accuracy
26.3%
Average
84.2%
ENT
Correct
3
Incorrect
2
Unanswered
2
Accuracy
42.9%
Average
74.4%
Epidemiology
Correct
3
Incorrect
1
Unanswered
4
Accuracy
37.5%
Average
89.3%
Gastroenterology
Correct
1
Incorrect
6
Unanswered
15
Accuracy
4.5%
Average
70.5%
Genetics
Correct
1
Incorrect
2
Unanswered
4
Accuracy
14.3%
Average
86.5%
Geriatrics
Correct
1
Incorrect
5
Unanswered
4
Accuracy
10.0%
Average
86.9%
Gynecology and Obstetrics
Correct
3
Incorrect
4
Unanswered
7
Accuracy
21.4%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
1
Unanswered
1
Accuracy
0.0%
Average
73.2%
Hematology
Correct
2
Incorrect
4
Unanswered
7
Accuracy
15.4%
Average
81.5%
Immunology
Correct
2
Incorrect
3
Unanswered
3
Accuracy
25.0%
Average
89.1%
Infectious Diseases
Correct
2
Incorrect
10
Unanswered
11
Accuracy
8.7%
Average
81.8%
Legal Medicine and Bioethics
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
91.7%
Medical Oncology
Correct
6
Incorrect
5
Unanswered
10
Accuracy
28.6%
Average
80.2%
Nephrology
Correct
1
Incorrect
6
Unanswered
6
Accuracy
7.7%
Average
80.8%
Neurology
Correct
3
Incorrect
6
Unanswered
13
Accuracy
13.6%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
1
Unanswered
3
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
1
Incorrect
1
Unanswered
2
Accuracy
25.0%
Average
88.2%
Pediatrics
Correct
3
Incorrect
1
Unanswered
13
Accuracy
17.6%
Average
82.0%
Pharmacology
Correct
4
Incorrect
10
Unanswered
9
Accuracy
17.4%
Average
85.4%
Psychiatry
Correct
2
Incorrect
0
Unanswered
8
Accuracy
20.0%
Average
89.5%
Pulmonology
Correct
2
Incorrect
6
Unanswered
11
Accuracy
10.5%
Average
80.6%
Radiology-Emergency
Correct
0
Incorrect
3
Unanswered
11
Accuracy
0.0%
Average
64.9%
Rheumatology
Correct
1
Incorrect
6
Unanswered
7
Accuracy
7.1%
Average
81.4%
Statistics
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
2
Incorrect
7
Unanswered
6
Accuracy
13.3%
Average
74.5%
Urology
Correct
0
Incorrect
2
Unanswered
4
Accuracy
0.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
1
Incorrect
0
Unanswered
4
Accuracy
20.0%
Average
90.7%
Diagnosis
Correct
10
Incorrect
19
Unanswered
44
Accuracy
13.7%
Average
79.2%
Epidemiology
Correct
3
Incorrect
3
Unanswered
6
Accuracy
25.0%
Average
81.2%
Ethics
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
94.5%
Interpretation
Correct
3
Incorrect
9
Unanswered
25
Accuracy
8.1%
Average
69.6%
Pathophysiology
Correct
8
Incorrect
12
Unanswered
13
Accuracy
24.2%
Average
85.4%
Pharmacology
Correct
3
Incorrect
11
Unanswered
11
Accuracy
12.0%
Average
84.0%
Prevention
Correct
3
Incorrect
5
Unanswered
4
Accuracy
25.0%
Average
89.8%
Prognosis
Correct
0
Incorrect
1
Unanswered
6
Accuracy
0.0%
Average
83.9%
Risk
Correct
0
Incorrect
4
Unanswered
9
Accuracy
0.0%
Average
83.6%
Tests
Correct
4
Incorrect
4
Unanswered
13
Accuracy
19.0%
Average
73.9%
Treatment
Correct
10
Incorrect
27
Unanswered
34
Accuracy
14.1%
Average
81.3%
#AnswerCorrectStatus
1AB
2D
3B
4AC
5C
6B
7BD
8C
9A
10D
11D
12A
13BC
14A
15B
16AA
17C
18A
19CB
20C
21AD
22B
23A
24AA
25C
26B
27AC
28CA
29AB
30AC
31AD
32AA
33C
34AB
35BD
36CD
37AA
38AA
39C
40B
41BC
42AD
43CA
44D
45D
46BB
47C
48CC
49CB
50C
51AA
52D
53C
54B
55AC
56DD
57DA
58CA
59A
60AA
61AA
62D
63D
64CAnnulled
65D
66BC
67B
68Annulled
69A
70DB
71BB
72D
73B
74C
75AB
76A
77D
78CC
79B
80A
81CC
82C
83B
84C
85AA
86CA
87B
88D
89B
90AA
91D
92A
93C
94B
95CD
96B
97B
98B
99AA
100CB
101BA
102AD
103BB
104D
105B
106C
107C
108B
109D
110BD
111AB
112DC
113BAnnulled
114D
115D
116DA
117D
118D
119A
120C
121A
122AB
123BD
124DD
125B
126AD
127A
128B
129D
130C
131AC
132D
133A
134AC
135A
136D
137BA
138AC
139A
140C
141B
142BC
143BA
144BD
145AC
146AC
147C
148A
149C
150CD
151A
152A
153BC
154B
155AD
156C
157CC
158D
159CD
160CB
161B
162AB
163BB
164DB
165A
166C
167AA
168AB
169AC
170A
171D
172B
173A
174B
175CA
176DC
177DC
178DB
179DC
180CAnnulled
181AB
182D
183C
184BA
185CC
186D
187A
188DC
189CD
190DD
191BB
192B
193C
194C
195AC
196B
197AA
198B
199CD
200AA
201BB
202D
203B
204DD
205D
206BAnnulled
207CA
208BA
209AB
210D