MedicalBenchmark
OpenAI: GPT-3.5 Turbo 16k provider

GPT-3.5 Turbo 16k

267

#267 of 319 modelsMIR 2025

Net score

96.00 pts

Accuracy

60.5%

Correct / Incorrect

121 / 75

Total Cost

$0.00

Overall Performance

(vs. average)
Accuracy

60.5%

avg: 77.9%

Net score

96.00 pts

avg: 143.96 pts

Correct

121

avg: 156

Incorrect

75

avg: 35

Total Cost

$0.00

avg: $3.36

Average response time

3.0s

avg: 19.0s

Output Tokens

63K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

97.9%

avg: 95.2%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
82.3%
Cardiology
Correct
14
Incorrect
8
Unanswered
0
Accuracy
63.6%
Average
78.6%
Dermatology
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
69.4%
Endocrinology and Nutrition
Correct
9
Incorrect
6
Unanswered
1
Accuracy
56.3%
Average
83.5%
ENT
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
74.8%
Epidemiology
Correct
0
Incorrect
6
Unanswered
1
Accuracy
0.0%
Average
69.1%
Gastroenterology
Correct
12
Incorrect
9
Unanswered
0
Accuracy
57.1%
Average
74.1%
Genetics
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
69.5%
Geriatrics
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
77.5%
Gynecology and Obstetrics
Correct
14
Incorrect
4
Unanswered
1
Accuracy
73.7%
Average
86.7%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
82.6%
Hematology
Correct
5
Incorrect
6
Unanswered
0
Accuracy
45.5%
Average
82.7%
Immunology
Correct
7
Incorrect
2
Unanswered
0
Accuracy
77.8%
Average
83.3%
Infectious Diseases
Correct
17
Incorrect
10
Unanswered
0
Accuracy
63.0%
Average
74.9%
Legal Medicine and Bioethics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
68.4%
Medical Oncology
Correct
18
Incorrect
6
Unanswered
1
Accuracy
72.0%
Average
87.2%
Nephrology
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
84.8%
Neurology
Correct
15
Incorrect
5
Unanswered
0
Accuracy
75.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
78.6%
Pediatrics
Correct
15
Incorrect
11
Unanswered
0
Accuracy
57.7%
Average
71.9%
Pharmacology
Correct
11
Incorrect
5
Unanswered
1
Accuracy
64.7%
Average
74.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
83.0%
Pulmonology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
80.4%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
69.4%
Rheumatology
Correct
10
Incorrect
5
Unanswered
0
Accuracy
66.7%
Average
76.6%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
76.6%
Traumatology
Correct
11
Incorrect
7
Unanswered
0
Accuracy
61.1%
Average
79.3%
Urology
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
78.6%
Biostatistics
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
79.8%
Diagnosis
Correct
57
Incorrect
30
Unanswered
1
Accuracy
64.8%
Average
79.9%
Epidemiology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
76.7%
Ethics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
74.1%
Interpretation
Correct
22
Incorrect
19
Unanswered
1
Accuracy
52.4%
Average
70.7%
Legal
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
64.6%
Pathophysiology
Correct
12
Incorrect
14
Unanswered
1
Accuracy
44.4%
Average
76.1%
Pharmacology
Correct
9
Incorrect
3
Unanswered
1
Accuracy
69.2%
Average
83.3%
Prevention
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
75.6%
Prognosis
Correct
7
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
80.8%
Risk
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
85.2%
Tests
Correct
15
Incorrect
12
Unanswered
0
Accuracy
55.6%
Average
77.9%
Treatment
Correct
51
Incorrect
29
Unanswered
1
Accuracy
63.0%
Average
77.3%
#AnswerCorrectStatus
1BB
2A
3CC
4CB
5AA
6CC
7CC
8AA
9CA
10DD
11CD
12DD
13AB
14DD
15DAnnulled
16CB
17CB
18AA
19CC
20AA
21CB
22CD
23CC
24DD
25CC
26DAnnulled
27DC
28CAnnulled
29CD
30DB
31DD
32AA
33DD
34BD
35BB
36DD
37CC
38BC
39DD
40CA
41D
42BC
43CB
44DD
45CD
46AA
47AA
48AA
49DD
50DB
51DC
52BB
53DD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60BA
61AA
62D
63BB
64BD
65AA
66CA
67AB
68AB
69DB
70AA
71DD
72CA
73CD
74CC
75AA
76BB
77BB
78BB
79CC
80CC
81CC
82DD
83BB
84DD
85DC
86CC
87CA
88DD
89BB
90BA
91CB
92DC
93BB
94CC
95BA
96CC
97DD
98CC
99BA
100CC
101CB
102DD
103DA
104CC
105AA
106CC
107BB
108DD
109AB
110CC
111CA
112BC
113BB
114CD
115BD
116CC
117AA
118DD
119AC
120AB
121AD
122AC
123CC
124CC
125DD
126CD
127BB
128DD
129AA
130DD
131DD
132CA
133BB
134CC
135DB
136CC
137AA
138DD
139DD
140BB
141AA
142DA
143BB
144BB
145BD
146CC
147BB
148BA
149DA
150DD
151AA
152AA
153CB
154BB
155BB
156CC
157A
158CC
159DC
160BA
161DA
162BAnnulled
163DD
164BC
165DA
166DB
167CC
168DD
169CB
170CB
171CC
172BA
173DA
174BB
175BB
176CC
177CC
178BA
179CD
180AA
181DB
182CC
183DB
184BB
185DB
186CAnnulled
187BC
188DD
189DD
190BA
191AB
192AA
193CC
194AA
195AA
196AA
197BB
198CC
199DD
200CC
201AB
202AA
203DD
204CC
205BB
206BD
207AA
208BC
209CC
210AB