MedicalBenchmark
Baidu: ERNIE 4.5 21B A3B Thinking provider

ERNIE 4.5 21B A3B Thinking

239

#239 of 291 modelsMIR 2024

Net score

107.66 pts

Accuracy

65.0%

Correct / Incorrect

130 / 67

Total Cost

$0.16

Overall Performance

(vs. average)
Accuracy

65.0%

avg: 80.5%

Net score

107.66 pts

avg: 150.85 pts

Correct

130

avg: 161

Incorrect

67

avg: 30

Total Cost

$0.16

avg: $3.32

Average response time

29.0s

avg: 16.4s

Output Tokens

559K

avg: 427K

Reasoning Tokens

417K

avg: 310K

Average confidence

98.4%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
12
Incorrect
7
Unanswered
2
Accuracy
57.1%
Average
79.7%
Dermatology
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
80.2%
Endocrinology and Nutrition
Correct
18
Incorrect
1
Unanswered
0
Accuracy
94.7%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.3%
Gastroenterology
Correct
11
Incorrect
11
Unanswered
0
Accuracy
50.0%
Average
70.5%
Genetics
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
86.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
0
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
81.5%
Immunology
Correct
8
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
89.1%
Infectious Diseases
Correct
14
Incorrect
9
Unanswered
0
Accuracy
60.9%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
15
Incorrect
6
Unanswered
0
Accuracy
71.4%
Average
80.2%
Nephrology
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
80.8%
Neurology
Correct
18
Incorrect
4
Unanswered
0
Accuracy
81.8%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
6
Unanswered
1
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
15
Incorrect
8
Unanswered
0
Accuracy
65.2%
Average
85.4%
Psychiatry
Correct
7
Incorrect
3
Unanswered
0
Accuracy
70.0%
Average
89.5%
Pulmonology
Correct
14
Incorrect
4
Unanswered
1
Accuracy
73.7%
Average
80.6%
Radiology-Emergency
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
64.9%
Rheumatology
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
8
Incorrect
7
Unanswered
0
Accuracy
53.3%
Average
74.5%
Urology
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.7%
Diagnosis
Correct
53
Incorrect
20
Unanswered
0
Accuracy
72.6%
Average
79.2%
Epidemiology
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
18
Incorrect
17
Unanswered
2
Accuracy
48.6%
Average
69.6%
Pathophysiology
Correct
27
Incorrect
6
Unanswered
0
Accuracy
81.8%
Average
85.4%
Pharmacology
Correct
16
Incorrect
9
Unanswered
0
Accuracy
64.0%
Average
84.0%
Prevention
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
83.9%
Risk
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
83.6%
Tests
Correct
9
Incorrect
10
Unanswered
2
Accuracy
42.9%
Average
73.9%
Treatment
Correct
42
Incorrect
28
Unanswered
1
Accuracy
59.2%
Average
81.3%
#AnswerCorrectStatus
1BB
2BD
3DB
4DC
5AC
6DB
7DD
8CC
9DA
10DD
11DD
12AA
13C
14DA
15DB
16DA
17CC
18AA
19CB
20CC
21DD
22DB
23BA
24AA
25CC
26BB
27CC
28AA
29BB
30AC
31AD
32BA
33CC
34DB
35DD
36DD
37AA
38AA
39CC
40BB
41CC
42DD
43AA
44DD
45DD
46BB
47CC
48CC
49BB
50BC
51AA
52DD
53CC
54BB
55CC
56DD
57DA
58AA
59BA
60AA
61CA
62BD
63CD
64DAnnulled
65DD
66CC
67DB
68CAnnulled
69AA
70BB
71BB
72BD
73CB
74CC
75BB
76DA
77DD
78CC
79B
80AA
81DC
82CC
83AB
84CC
85BA
86AA
87BB
88DD
89AB
90AA
91DD
92AA
93BC
94BB
95DD
96BB
97BB
98BB
99AA
100BB
101DA
102DD
103BB
104CD
105DB
106AC
107CC
108BB
109CD
110BD
111AB
112CC
113CAnnulled
114BD
115DD
116AA
117CD
118DD
119AA
120CC
121AA
122B
123DD
124DD
125CB
126CD
127AA
128BB
129DD
130AC
131CC
132CD
133CA
134BC
135DA
136DD
137AA
138CC
139AA
140BC
141BB
142CC
143BA
144DD
145DC
146BC
147CC
148BA
149AC
150DD
151AA
152AA
153DC
154BB
155DD
156CC
157AC
158DD
159DD
160CB
161BB
162BB
163BB
164DB
165AA
166CC
167BA
168BB
169CC
170CA
171DD
172CB
173BA
174BB
175AA
176CC
177DC
178BB
179CC
180AAnnulled
181CB
182DD
183CC
184CA
185CC
186DD
187AA
188CC
189BD
190DD
191BB
192BB
193CC
194DC
195CC
196BB
197AA
198BB
199DD
200AA
201BB
202DD
203AB
204DD
205DD
206DAnnulled
207DA
208AA
209CB
210DD