MedicalBenchmark
OpenAI: GPT-4o Search Preview provider

GPT-4o Search Preview

162

#162 of 319 modelsMIR 2025

Net score

161.33 pts

Accuracy

84.5%

Correct / Incorrect

169 / 23

Total Cost

$0.00

Overall Performance

(vs. average)
Accuracy

84.5%

avg: 77.9%

Net score

161.33 pts

avg: 143.96 pts

Correct

169

avg: 156

Incorrect

23

avg: 35

Total Cost

$0.00

avg: $3.36

Average response time

7.4s

avg: 19.0s

Output Tokens

97K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

95.6%

avg: 95.2%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
6
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
82.3%
Cardiology
Correct
20
Incorrect
2
Unanswered
0
Accuracy
90.9%
Average
78.6%
Dermatology
Correct
9
Incorrect
1
Unanswered
2
Accuracy
75.0%
Average
69.4%
Endocrinology and Nutrition
Correct
16
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.5%
ENT
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
74.8%
Epidemiology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
69.1%
Gastroenterology
Correct
17
Incorrect
4
Unanswered
0
Accuracy
81.0%
Average
74.1%
Genetics
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
69.5%
Geriatrics
Correct
10
Incorrect
0
Unanswered
1
Accuracy
90.9%
Average
77.5%
Gynecology and Obstetrics
Correct
17
Incorrect
2
Unanswered
0
Accuracy
89.5%
Average
86.7%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
82.6%
Hematology
Correct
10
Incorrect
1
Unanswered
0
Accuracy
90.9%
Average
82.7%
Immunology
Correct
9
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.3%
Infectious Diseases
Correct
21
Incorrect
5
Unanswered
1
Accuracy
77.8%
Average
74.9%
Legal Medicine and Bioethics
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
68.4%
Medical Oncology
Correct
24
Incorrect
1
Unanswered
0
Accuracy
96.0%
Average
87.2%
Nephrology
Correct
13
Incorrect
1
Unanswered
0
Accuracy
92.9%
Average
84.8%
Neurology
Correct
16
Incorrect
3
Unanswered
1
Accuracy
80.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
78.6%
Pediatrics
Correct
20
Incorrect
4
Unanswered
2
Accuracy
76.9%
Average
71.9%
Pharmacology
Correct
13
Incorrect
2
Unanswered
2
Accuracy
76.5%
Average
74.1%
Psychiatry
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
83.0%
Pulmonology
Correct
12
Incorrect
2
Unanswered
0
Accuracy
85.7%
Average
80.4%
Radiology-Emergency
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
69.4%
Rheumatology
Correct
11
Incorrect
2
Unanswered
2
Accuracy
73.3%
Average
76.6%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
76.6%
Traumatology
Correct
13
Incorrect
2
Unanswered
3
Accuracy
72.2%
Average
79.3%
Urology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
5
Incorrect
0
Unanswered
2
Accuracy
71.4%
Average
78.6%
Biostatistics
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
79.8%
Diagnosis
Correct
79
Incorrect
8
Unanswered
1
Accuracy
89.8%
Average
79.9%
Epidemiology
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
76.7%
Ethics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
74.1%
Interpretation
Correct
34
Incorrect
7
Unanswered
1
Accuracy
81.0%
Average
70.7%
Legal
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
64.6%
Pathophysiology
Correct
24
Incorrect
1
Unanswered
2
Accuracy
88.9%
Average
76.1%
Pharmacology
Correct
12
Incorrect
1
Unanswered
0
Accuracy
92.3%
Average
83.3%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
75.6%
Prognosis
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
80.8%
Risk
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
85.2%
Tests
Correct
22
Incorrect
4
Unanswered
1
Accuracy
81.5%
Average
77.9%
Treatment
Correct
64
Incorrect
14
Unanswered
3
Accuracy
79.0%
Average
77.3%
#AnswerCorrectStatus
1BB
2BA
3CC
4AB
5AA
6CC
7CC
8AA
9AA
10DD
11DD
12DD
13BB
14DD
15BAnnulled
16CB
17AB
18CA
19CC
20AA
21BB
22CD
23CC
24DD
25CC
26DAnnulled
27CC
28Annulled
29DD
30BB
31DD
32AA
33DD
34DD
35BB
36DD
37CC
38CC
39DD
40AA
41DD
42CC
43BB
44DD
45DD
46AA
47AA
48AA
49DD
50BB
51DC
52B
53DD
54BB
55AA
56AAnnulled
57CC
58BB
59DD
60AA
61BA
62DD
63BB
64DD
65AA
66AA
67BB
68B
69AB
70AA
71DD
72AA
73CD
74CC
75AA
76BB
77BB
78BB
79CC
80CC
81CC
82DD
83BB
84DD
85BC
86CC
87AA
88DD
89B
90AA
91B
92C
93BB
94CC
95AA
96CC
97DD
98CC
99AA
100CC
101BB
102DD
103AA
104CC
105AA
106CC
107BB
108DD
109BB
110CC
111AA
112CC
113BB
114DD
115DD
116CC
117AA
118DD
119CC
120AB
121DD
122CC
123CC
124CC
125DD
126DD
127BB
128DD
129AA
130DD
131DD
132AA
133BB
134CC
135DB
136CC
137AA
138DD
139DD
140BB
141AA
142AA
143BB
144BB
145DD
146CC
147BB
148AA
149A
150DD
151AA
152AA
153BB
154BB
155BB
156CC
157AA
158CC
159CC
160A
161AA
162CAnnulled
163DD
164AC
165CA
166BB
167CC
168DD
169CB
170BB
171CC
172CA
173DA
174BB
175BB
176CC
177CC
178AA
179DD
180AA
181BB
182CC
183BB
184BB
185BB
186DAnnulled
187CC
188DD
189DD
190DA
191AB
192AA
193CC
194AA
195AA
196AA
197BB
198C
199DD
200CC
201AB
202AA
203DD
204DC
205BB
206CD
207AA
208BC
209CC
210BB