MedicalBenchmark
Baidu: ERNIE 4.5 VL 28B A3B provider

ERNIE 4.5 VL 28B A3B

233

#233 of 290 modelsMIR 2025

Net score

105.66 pts

Accuracy

64.0%

Correct / Incorrect

128 / 67

Total Cost

$0.14

Overall Performance

(vs. average)
Accuracy

64.0%

avg: 75.9%

Net score

105.66 pts

avg: 138.99 pts

Correct

128

avg: 152

Incorrect

67

avg: 38

Total Cost

$0.14

avg: $3.59

Average response time

16.1s

avg: 18.1s

Output Tokens

224K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

96.6%

avg: 94.7%

Subject Breakdown

Allergology
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
81.3%
Cardiology
Correct
16
Incorrect
6
Unanswered
0
Accuracy
72.7%
Average
77.4%
Dermatology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
62.8%
Endocrinology and Nutrition
Correct
12
Incorrect
4
Unanswered
0
Accuracy
75.0%
Average
82.5%
ENT
Correct
4
Incorrect
4
Unanswered
0
Accuracy
50.0%
Average
73.8%
Epidemiology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
67.1%
Gastroenterology
Correct
13
Incorrect
8
Unanswered
0
Accuracy
61.9%
Average
72.9%
Genetics
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
68.2%
Geriatrics
Correct
5
Incorrect
5
Unanswered
1
Accuracy
45.5%
Average
71.2%
Gynecology and Obstetrics
Correct
14
Incorrect
5
Unanswered
0
Accuracy
73.7%
Average
85.9%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
81.6%
Hematology
Correct
7
Incorrect
4
Unanswered
0
Accuracy
63.6%
Average
81.8%
Immunology
Correct
6
Incorrect
1
Unanswered
2
Accuracy
66.7%
Average
82.5%
Infectious Diseases
Correct
15
Incorrect
13
Unanswered
0
Accuracy
53.6%
Average
71.1%
Legal Medicine and Bioethics
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
67.2%
Medical Oncology
Correct
20
Incorrect
4
Unanswered
1
Accuracy
80.0%
Average
86.3%
Nephrology
Correct
10
Incorrect
4
Unanswered
1
Accuracy
66.7%
Average
78.2%
Neurology
Correct
13
Incorrect
6
Unanswered
1
Accuracy
65.0%
Average
76.2%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
72.6%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
77.2%
Pediatrics
Correct
17
Incorrect
7
Unanswered
1
Accuracy
68.0%
Average
72.7%
Pharmacology
Correct
10
Incorrect
6
Unanswered
1
Accuracy
58.8%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
10
Incorrect
3
Unanswered
1
Accuracy
71.4%
Average
73.0%
Radiology-Emergency
Correct
9
Incorrect
5
Unanswered
0
Accuracy
64.3%
Average
67.9%
Rheumatology
Correct
9
Incorrect
5
Unanswered
0
Accuracy
64.3%
Average
74.6%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
74.9%
Traumatology
Correct
12
Incorrect
6
Unanswered
0
Accuracy
66.7%
Average
78.2%
Urology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
77.1%
Biostatistics
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
78.4%
Diagnosis
Correct
58
Incorrect
29
Unanswered
2
Accuracy
65.2%
Average
77.9%
Epidemiology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
75.0%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
72.0%
Interpretation
Correct
19
Incorrect
22
Unanswered
1
Accuracy
45.2%
Average
69.3%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
63.6%
Pathophysiology
Correct
15
Incorrect
11
Unanswered
1
Accuracy
55.6%
Average
72.6%
Pharmacology
Correct
9
Incorrect
3
Unanswered
1
Accuracy
69.2%
Average
82.4%
Prevention
Correct
6
Incorrect
6
Unanswered
0
Accuracy
50.0%
Average
74.5%
Prognosis
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
77.8%
Risk
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
84.3%
Tests
Correct
20
Incorrect
5
Unanswered
1
Accuracy
76.9%
Average
76.3%
Treatment
Correct
55
Incorrect
24
Unanswered
3
Accuracy
67.1%
Average
75.2%
#AnswerCorrectStatus
1BB
2CA
3BC
4AB
5DA
6CC
7CC
8CA
9CA
10DD
11BD
12DD
13AB
14DD
15C
16BB
17CB
18CA
19CC
20BA
21CB
22CD
23AC
24DD
25BC
26DAnnulled
27AC
28DAnnulled
29AD
30CB
31AD
32AA
33AD
34BD
35BB
36DD
37C
38CC
39DD
40AA
41DD
42CC
43CB
44DD
45CD
46AA
47AA
48AA
49DD
50BB
51CC
52BB
53DD
54DB
55AA
56BAnnulled
57CC
58BB
59DD
60AA
61BA
62DD
63BB
64DD
65AA
66AA
67BB
68BB
69AB
70AA
71DD
72CA
73DD
74CC
75AA
76CB
77BB
78BB
79CC
80AC
81CC
82DD
83BB
84AD
85C
86CC
87BA
88DD
89CB
90AA
91AB
92CC
93BB
94CC
95DA
96CC
97CD
98CC
99AA
100CC
101BB
102BD
103BA
104CC
105AA
106CC
107BB
108DD
109AB
110CC
111AA
112CC
113BB
114DD
115D
116CC
117AA
118DD
119CC
120BB
121CD
122AC
123CC
124CC
125DD
126BD
127BB
128DD
129BA
130DD
131BD
132AA
133BB
134CC
135BB
136CC
137AA
138DD
139DD
140CB
141AA
142DA
143BB
144BB
145DD
146CC
147BB
148BA
149DA
150DA
151AA
152A
153BB
154BB
155BB
156CC
157AA
158AC
159CC
160CA
161CA
162B
163DD
164CC
165AA
166BB
167CC
168DD
169CB
170AB
171DC
172BA
173DA
174BB
175CB
176CC
177DC
178BA
179CD
180AA
181DB
182CC
183BB
184BB
185BB
186DAnnulled
187CC
188DD
189BD
190AA
191CB
192AA
193CC
194AA
195A
196AA
197BB
198CC
199DD
200CC
201BB
202BA
203DD
204DC
205BB
206CD
207BA
208BC
209CC
210DB