MedicalBenchmark
Qwen: Qwen2.5 7B Instruct provider

Qwen2.5 7B Instruct

257

#257 of 291 modelsMIR 2024

Net score

73.00 pts

Accuracy

50.0%

Correct / Incorrect

100 / 81

Total Cost

$0.02

Overall Performance

(vs. average)
Accuracy

50.0%

avg: 80.5%

Net score

73.00 pts

avg: 150.85 pts

Correct

100

avg: 161

Incorrect

81

avg: 30

Total Cost

$0.02

avg: $3.32

Average response time

10.5s

avg: 16.4s

Output Tokens

121K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

90.2%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
87.1%
Cardiology
Correct
10
Incorrect
7
Unanswered
4
Accuracy
47.6%
Average
79.7%
Dermatology
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
80.2%
Endocrinology and Nutrition
Correct
7
Incorrect
8
Unanswered
4
Accuracy
36.8%
Average
84.2%
ENT
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
74.4%
Epidemiology
Correct
3
Incorrect
2
Unanswered
3
Accuracy
37.5%
Average
89.3%
Gastroenterology
Correct
10
Incorrect
11
Unanswered
1
Accuracy
45.5%
Average
70.5%
Genetics
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
7
Incorrect
2
Unanswered
1
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
73.2%
Hematology
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
81.5%
Immunology
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
16
Incorrect
6
Unanswered
1
Accuracy
69.6%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
8
Incorrect
11
Unanswered
2
Accuracy
38.1%
Average
80.2%
Nephrology
Correct
2
Incorrect
10
Unanswered
1
Accuracy
15.4%
Average
80.8%
Neurology
Correct
11
Incorrect
8
Unanswered
3
Accuracy
50.0%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
7
Unanswered
0
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
12
Incorrect
9
Unanswered
2
Accuracy
52.2%
Average
85.4%
Psychiatry
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
89.5%
Pulmonology
Correct
12
Incorrect
6
Unanswered
1
Accuracy
63.2%
Average
80.6%
Radiology-Emergency
Correct
7
Incorrect
5
Unanswered
2
Accuracy
50.0%
Average
64.9%
Rheumatology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
81.4%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
91.1%
Traumatology
Correct
6
Incorrect
9
Unanswered
0
Accuracy
40.0%
Average
74.5%
Urology
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
2
Incorrect
1
Unanswered
2
Accuracy
40.0%
Average
90.7%
Diagnosis
Correct
38
Incorrect
31
Unanswered
4
Accuracy
52.1%
Average
79.2%
Epidemiology
Correct
1
Incorrect
6
Unanswered
5
Accuracy
8.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
16
Incorrect
17
Unanswered
4
Accuracy
43.2%
Average
69.6%
Pathophysiology
Correct
13
Incorrect
16
Unanswered
4
Accuracy
39.4%
Average
85.4%
Pharmacology
Correct
16
Incorrect
6
Unanswered
3
Accuracy
64.0%
Average
84.0%
Prevention
Correct
8
Incorrect
1
Unanswered
3
Accuracy
66.7%
Average
89.8%
Prognosis
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
83.9%
Risk
Correct
5
Incorrect
4
Unanswered
4
Accuracy
38.5%
Average
83.6%
Tests
Correct
10
Incorrect
9
Unanswered
2
Accuracy
47.6%
Average
73.9%
Treatment
Correct
43
Incorrect
25
Unanswered
3
Accuracy
60.6%
Average
81.3%
#AnswerCorrectStatus
1AB
2CD
3DB
4BC
5BC
6BB
7D
8CC
9AA
10DD
11DD
12BA
13C
14DA
15DB
16BA
17AC
18AA
19BB
20C
21CD
22AB
23AA
24DA
25AC
26BB
27DC
28A
29AB
30DC
31AD
32DA
33DC
34BB
35DD
36BD
37BA
38DA
39CC
40BB
41BC
42AD
43A
44AD
45DD
46BB
47C
48CC
49BB
50BC
51CA
52DD
53C
54CB
55CC
56CD
57BA
58DA
59CA
60CA
61AA
62DD
63BD
64AAnnulled
65DD
66CC
67BB
68BAnnulled
69AA
70BB
71BB
72DD
73CB
74CC
75BB
76DA
77BD
78CC
79AB
80AA
81CC
82CC
83B
84CC
85AA
86AA
87DB
88DD
89CB
90AA
91DD
92AA
93CC
94BB
95AD
96BB
97BB
98CB
99AA
100BB
101AA
102DD
103BB
104CD
105DB
106DC
107CC
108BB
109AD
110CD
111BB
112CC
113DAnnulled
114DD
115DD
116AA
117AD
118BD
119AA
120CC
121AA
122BB
123DD
124DD
125BB
126BD
127DA
128DB
129DD
130C
131CC
132DD
133CA
134C
135BA
136DD
137AA
138BC
139AA
140CC
141DB
142DC
143CA
144CD
145CC
146BC
147CC
148BA
149CC
150CD
151CA
152DA
153AC
154BB
155AD
156CC
157CC
158DD
159DD
160CB
161BB
162BB
163BB
164B
165CA
166DC
167DA
168BB
169CC
170CA
171DD
172BB
173CA
174BB
175A
176DC
177CC
178B
179CC
180AAnnulled
181BB
182DD
183C
184CA
185CC
186D
187CA
188CC
189D
190DD
191BB
192B
193C
194AC
195CC
196DB
197DA
198BB
199DD
200AA
201AB
202DD
203CB
204D
205BD
206CAnnulled
207AA
208AA
209AB
210DD