MedicalBenchmark
Qwen2.5 Coder 32B Instruct provider

Qwen2.5 Coder 32B Instruct

249

#249 of 291 modelsMIR 2024

Net score

92.33 pts

Accuracy

56.5%

Correct / Incorrect

113 / 62

Total Cost

$0.08

Overall Performance

(vs. average)
Accuracy

56.5%

avg: 80.5%

Net score

92.33 pts

avg: 150.85 pts

Correct

113

avg: 161

Incorrect

62

avg: 30

Total Cost

$0.08

avg: $3.32

Average response time

15.1s

avg: 16.4s

Output Tokens

121K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

85.8%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
0
Unanswered
1
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
11
Incorrect
7
Unanswered
3
Accuracy
52.4%
Average
79.7%
Dermatology
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
80.2%
Endocrinology and Nutrition
Correct
9
Incorrect
6
Unanswered
4
Accuracy
47.4%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
6
Incorrect
0
Unanswered
2
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
10
Incorrect
7
Unanswered
5
Accuracy
45.5%
Average
70.5%
Genetics
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
86.5%
Geriatrics
Correct
7
Incorrect
1
Unanswered
2
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
7
Incorrect
5
Unanswered
2
Accuracy
50.0%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
1
Unanswered
1
Accuracy
0.0%
Average
73.2%
Hematology
Correct
7
Incorrect
3
Unanswered
3
Accuracy
53.8%
Average
81.5%
Immunology
Correct
6
Incorrect
1
Unanswered
1
Accuracy
75.0%
Average
89.1%
Infectious Diseases
Correct
15
Incorrect
6
Unanswered
2
Accuracy
65.2%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
7
Incorrect
10
Unanswered
4
Accuracy
33.3%
Average
80.2%
Nephrology
Correct
6
Incorrect
5
Unanswered
2
Accuracy
46.2%
Average
80.8%
Neurology
Correct
17
Incorrect
4
Unanswered
1
Accuracy
77.3%
Average
83.7%
Ophthalmology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
80.0%
Palliative Care
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
7
Unanswered
0
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
17
Incorrect
4
Unanswered
2
Accuracy
73.9%
Average
85.4%
Psychiatry
Correct
7
Incorrect
2
Unanswered
1
Accuracy
70.0%
Average
89.5%
Pulmonology
Correct
11
Incorrect
5
Unanswered
3
Accuracy
57.9%
Average
80.6%
Radiology-Emergency
Correct
4
Incorrect
7
Unanswered
3
Accuracy
28.6%
Average
64.9%
Rheumatology
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
8
Incorrect
4
Unanswered
3
Accuracy
53.3%
Average
74.5%
Urology
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
42
Incorrect
23
Unanswered
8
Accuracy
57.5%
Average
79.2%
Epidemiology
Correct
8
Incorrect
1
Unanswered
3
Accuracy
66.7%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
18
Incorrect
16
Unanswered
3
Accuracy
48.6%
Average
69.6%
Pathophysiology
Correct
18
Incorrect
10
Unanswered
5
Accuracy
54.5%
Average
85.4%
Pharmacology
Correct
17
Incorrect
4
Unanswered
4
Accuracy
68.0%
Average
84.0%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
3
Incorrect
2
Unanswered
2
Accuracy
42.9%
Average
83.9%
Risk
Correct
8
Incorrect
3
Unanswered
2
Accuracy
61.5%
Average
83.6%
Tests
Correct
6
Incorrect
13
Unanswered
2
Accuracy
28.6%
Average
73.9%
Treatment
Correct
42
Incorrect
21
Unanswered
8
Accuracy
59.2%
Average
81.3%
#AnswerCorrectStatus
1AB
2CD
3BB
4BC
5BC
6BB
7DD
8CC
9BA
10DD
11DD
12AA
13DC
14BA
15CB
16BA
17CC
18AA
19AB
20DC
21D
22BB
23AA
24CA
25DC
26BB
27AC
28AA
29BB
30DC
31AD
32A
33DC
34CB
35DD
36CD
37AA
38AA
39CC
40BB
41CC
42D
43AA
44DD
45DD
46BB
47CC
48CC
49BB
50CC
51A
52DD
53CC
54DB
55AC
56DD
57DA
58AA
59CA
60AA
61AA
62BD
63DD
64AAnnulled
65DD
66CC
67BB
68BAnnulled
69AA
70BB
71BB
72D
73DB
74CC
75BB
76AA
77DD
78CC
79DB
80AA
81CC
82CC
83BB
84CC
85AA
86AA
87DB
88DD
89CB
90AA
91DD
92AA
93BC
94BB
95DD
96BB
97B
98AB
99AA
100BB
101AA
102DD
103B
104DD
105DB
106AC
107CC
108BB
109DD
110CD
111CB
112CC
113DAnnulled
114DD
115DD
116DA
117DD
118DD
119CA
120CC
121AA
122BB
123DD
124DD
125CB
126DD
127AA
128CB
129DD
130CC
131BC
132DD
133A
134C
135A
136DD
137AA
138CC
139A
140CC
141BB
142C
143BA
144AD
145CC
146BC
147C
148BA
149AC
150CD
151A
152A
153DC
154B
155DD
156C
157CC
158D
159DD
160B
161BB
162BB
163DB
164DB
165CA
166C
167CA
168BB
169CC
170CA
171BD
172BB
173CA
174BB
175AA
176C
177C
178BB
179BC
180Annulled
181BB
182DD
183CC
184AA
185CC
186DD
187AA
188BC
189AD
190DD
191B
192DB
193CC
194AC
195CC
196BB
197BA
198BB
199DD
200BA
201B
202D
203AB
204CD
205BD
206CAnnulled
207DA
208AA
209B
210D