MedicalBenchmark
Baidu: ERNIE 4.5 21B A3B Thinking provider

ERNIE 4.5 21B A3B Thinking

256

#256 of 319 modelsMIR 2025

Net score

112.00 pts

Accuracy

67.0%

Correct / Incorrect

134 / 66

Total Cost

$0.17

Overall Performance

(vs. average)
Accuracy

67.0%

avg: 77.9%

Net score

112.00 pts

avg: 143.96 pts

Correct

134

avg: 156

Incorrect

66

avg: 35

Total Cost

$0.17

avg: $3.36

Average response time

30.9s

avg: 19.0s

Output Tokens

585K

avg: 430K

Reasoning Tokens

441K

avg: 306K

Average confidence

98.9%

avg: 95.2%

Subject Breakdown

Allergology
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
82.3%
Cardiology
Correct
17
Incorrect
5
Unanswered
0
Accuracy
77.3%
Average
78.6%
Dermatology
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
69.4%
Endocrinology and Nutrition
Correct
13
Incorrect
3
Unanswered
0
Accuracy
81.3%
Average
83.5%
ENT
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
74.8%
Epidemiology
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
69.1%
Gastroenterology
Correct
13
Incorrect
8
Unanswered
0
Accuracy
61.9%
Average
74.1%
Genetics
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
69.5%
Geriatrics
Correct
5
Incorrect
6
Unanswered
0
Accuracy
45.5%
Average
77.5%
Gynecology and Obstetrics
Correct
16
Incorrect
3
Unanswered
0
Accuracy
84.2%
Average
86.7%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
82.6%
Hematology
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
82.7%
Immunology
Correct
6
Incorrect
3
Unanswered
0
Accuracy
66.7%
Average
83.3%
Infectious Diseases
Correct
20
Incorrect
7
Unanswered
0
Accuracy
74.1%
Average
74.9%
Legal Medicine and Bioethics
Correct
0
Incorrect
5
Unanswered
0
Accuracy
0.0%
Average
68.4%
Medical Oncology
Correct
22
Incorrect
3
Unanswered
0
Accuracy
88.0%
Average
87.2%
Nephrology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
84.8%
Neurology
Correct
13
Incorrect
7
Unanswered
0
Accuracy
65.0%
Average
77.3%
Ophthalmology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
74.2%
Palliative Care
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
78.6%
Pediatrics
Correct
18
Incorrect
8
Unanswered
0
Accuracy
69.2%
Average
71.9%
Pharmacology
Correct
11
Incorrect
6
Unanswered
0
Accuracy
64.7%
Average
74.1%
Psychiatry
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
83.0%
Pulmonology
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
80.4%
Radiology-Emergency
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
69.4%
Rheumatology
Correct
10
Incorrect
5
Unanswered
0
Accuracy
66.7%
Average
76.6%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
76.6%
Traumatology
Correct
13
Incorrect
5
Unanswered
0
Accuracy
72.2%
Average
79.3%
Urology
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
78.6%
Biostatistics
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
79.8%
Diagnosis
Correct
61
Incorrect
27
Unanswered
0
Accuracy
69.3%
Average
79.9%
Epidemiology
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
76.7%
Ethics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
74.1%
Interpretation
Correct
27
Incorrect
15
Unanswered
0
Accuracy
64.3%
Average
70.7%
Legal
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
64.6%
Pathophysiology
Correct
12
Incorrect
15
Unanswered
0
Accuracy
44.4%
Average
76.1%
Pharmacology
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
83.3%
Prevention
Correct
10
Incorrect
2
Unanswered
0
Accuracy
83.3%
Average
75.6%
Prognosis
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
80.8%
Risk
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
85.2%
Tests
Correct
16
Incorrect
11
Unanswered
0
Accuracy
59.3%
Average
77.9%
Treatment
Correct
61
Incorrect
20
Unanswered
0
Accuracy
75.3%
Average
77.3%
#AnswerCorrectStatus
1BB
2AA
3CC
4CB
5AA
6CC
7CC
8AA
9AA
10DD
11DD
12DD
13AB
14BD
15BAnnulled
16CB
17DB
18CA
19CC
20DA
21CB
22CD
23AC
24DD
25CC
26DAnnulled
27DC
28AAnnulled
29BD
30CB
31AD
32AA
33DD
34AD
35BB
36DD
37AC
38CC
39DD
40DA
41DD
42CC
43BB
44DD
45DD
46AA
47AA
48AA
49DD
50BB
51DC
52BB
53DD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60AA
61CA
62DD
63BB
64DD
65DA
66AA
67BB
68BB
69AB
70AA
71DD
72CA
73CD
74CC
75AA
76CB
77BB
78BB
79AC
80AC
81CC
82DD
83BB
84DD
85DC
86AC
87AA
88DD
89DB
90AA
91CB
92CC
93BB
94CC
95BA
96CC
97CD
98CC
99AA
100BC
101BB
102BD
103AA
104CC
105AA
106CC
107BB
108DD
109AB
110CC
111AA
112CC
113BB
114CD
115DD
116CC
117AA
118DD
119CC
120BB
121DD
122AC
123CC
124CC
125DD
126BD
127AB
128DD
129AA
130DD
131DD
132AA
133BB
134CC
135BB
136DC
137CA
138DD
139DD
140CB
141AA
142AA
143BB
144BB
145DD
146CC
147BB
148BA
149DA
150AD
151AA
152CA
153BB
154BB
155BB
156CC
157AA
158CC
159BC
160CA
161AA
162AAnnulled
163DD
164CC
165DA
166BB
167CC
168BD
169CB
170AB
171CC
172AA
173DA
174BB
175BB
176CC
177DC
178BA
179CD
180CA
181DB
182BC
183BB
184BB
185DB
186DAnnulled
187CC
188DD
189BD
190BA
191BB
192AA
193CC
194AA
195AA
196AA
197BB
198BC
199DD
200CC
201BB
202DA
203DD
204CC
205BB
206DD
207AA
208BC
209AC
210CB