MedicalBenchmark
Qwen: Qwen2.5-VL 7B Instruct provider

Qwen2.5-VL 7B Instruct

264

#264 of 291 modelsMIR 2024

Net score

60.66 pts

Accuracy

45.0%

Correct / Incorrect

90 / 88

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

45.0%

avg: 80.5%

Net score

60.66 pts

avg: 150.85 pts

Correct

90

avg: 161

Incorrect

88

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

7.3s

avg: 16.4s

Output Tokens

104K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

87.0%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
6
Incorrect
12
Unanswered
3
Accuracy
28.6%
Average
79.7%
Dermatology
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
80.2%
Endocrinology and Nutrition
Correct
10
Incorrect
9
Unanswered
0
Accuracy
52.6%
Average
84.2%
ENT
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.3%
Gastroenterology
Correct
9
Incorrect
12
Unanswered
1
Accuracy
40.9%
Average
70.5%
Genetics
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
8
Incorrect
4
Unanswered
2
Accuracy
57.1%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
73.2%
Hematology
Correct
5
Incorrect
4
Unanswered
4
Accuracy
38.5%
Average
81.5%
Immunology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
12
Incorrect
10
Unanswered
1
Accuracy
52.2%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
14
Incorrect
6
Unanswered
1
Accuracy
66.7%
Average
80.2%
Nephrology
Correct
2
Incorrect
9
Unanswered
2
Accuracy
15.4%
Average
80.8%
Neurology
Correct
11
Incorrect
7
Unanswered
4
Accuracy
50.0%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
0
Unanswered
2
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
6
Incorrect
8
Unanswered
3
Accuracy
35.3%
Average
82.0%
Pharmacology
Correct
12
Incorrect
7
Unanswered
4
Accuracy
52.2%
Average
85.4%
Psychiatry
Correct
5
Incorrect
3
Unanswered
2
Accuracy
50.0%
Average
89.5%
Pulmonology
Correct
7
Incorrect
11
Unanswered
1
Accuracy
36.8%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
81.4%
Statistics
Correct
1
Incorrect
1
Unanswered
1
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
9
Incorrect
5
Unanswered
1
Accuracy
60.0%
Average
74.5%
Urology
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
2
Incorrect
2
Unanswered
1
Accuracy
40.0%
Average
90.7%
Diagnosis
Correct
34
Incorrect
31
Unanswered
8
Accuracy
46.6%
Average
79.2%
Epidemiology
Correct
6
Incorrect
4
Unanswered
2
Accuracy
50.0%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
15
Incorrect
18
Unanswered
4
Accuracy
40.5%
Average
69.6%
Pathophysiology
Correct
14
Incorrect
16
Unanswered
3
Accuracy
42.4%
Average
85.4%
Pharmacology
Correct
16
Incorrect
5
Unanswered
4
Accuracy
64.0%
Average
84.0%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
89.8%
Prognosis
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
83.9%
Risk
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
83.6%
Tests
Correct
10
Incorrect
10
Unanswered
1
Accuracy
47.6%
Average
73.9%
Treatment
Correct
33
Incorrect
31
Unanswered
7
Accuracy
46.5%
Average
81.3%
#AnswerCorrectStatus
1BB
2DD
3DB
4AC
5CC
6DB
7DD
8CC
9AA
10DD
11BD
12AA
13CC
14DA
15CB
16DA
17C
18AA
19AB
20AC
21DD
22AB
23AA
24CA
25AC
26BB
27CC
28DA
29AB
30DC
31AD
32BA
33CC
34CB
35DD
36BD
37AA
38BA
39CC
40BB
41BC
42BD
43DA
44D
45AD
46BB
47CC
48CC
49AB
50C
51AA
52DD
53AC
54BB
55CC
56DD
57CA
58BA
59BA
60AA
61AA
62D
63CD
64DAnnulled
65DD
66CC
67BB
68BAnnulled
69AA
70AB
71AB
72BD
73CB
74AC
75BB
76AA
77D
78DC
79CB
80AA
81DC
82CC
83BB
84DC
85AA
86AA
87AB
88D
89CB
90AA
91D
92DA
93AC
94BB
95DD
96BB
97BB
98B
99AA
100BB
101AA
102DD
103BB
104AD
105DB
106C
107CC
108BB
109AD
110CD
111B
112BC
113BAnnulled
114BD
115DD
116DA
117BD
118D
119CA
120C
121AA
122AB
123BD
124DD
125AB
126AD
127CA
128DB
129CD
130CC
131BC
132DD
133CA
134BC
135DA
136DD
137AA
138BC
139BA
140CC
141DB
142BC
143A
144DD
145BC
146BC
147CC
148BA
149BC
150DD
151AA
152A
153DC
154BB
155D
156C
157C
158DD
159DD
160BB
161DB
162BB
163BB
164CB
165CA
166DC
167AA
168BB
169CC
170CA
171DD
172AB
173CA
174B
175AA
176CC
177C
178BB
179CC
180AAnnulled
181AB
182DD
183CC
184A
185C
186DD
187CA
188CC
189BD
190DD
191BB
192DB
193AC
194DC
195CC
196BB
197AA
198BB
199DD
200AA
201AB
202BD
203B
204AD
205DD
206BAnnulled
207BA
208AA
209B
210AD