MedicalBenchmark
OpenAI: GPT-3.5 Turbo provider

GPT-3.5 Turbo

247

#247 of 290 modelsMIR 2025

Net score

80.33 pts

Accuracy

55.0%

Correct / Incorrect

110 / 89

Total Cost

$0.15

Overall Performance

(vs. average)
Accuracy

55.0%

avg: 75.9%

Net score

80.33 pts

avg: 138.99 pts

Correct

110

avg: 152

Incorrect

89

avg: 38

Total Cost

$0.15

avg: $3.59

Average response time

3.1s

avg: 18.1s

Output Tokens

66K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

98.8%

avg: 94.7%

Subject Breakdown

Allergology
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
81.3%
Cardiology
Correct
12
Incorrect
9
Unanswered
1
Accuracy
54.5%
Average
77.4%
Dermatology
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
62.8%
Endocrinology and Nutrition
Correct
10
Incorrect
6
Unanswered
0
Accuracy
62.5%
Average
82.5%
ENT
Correct
2
Incorrect
6
Unanswered
0
Accuracy
25.0%
Average
73.8%
Epidemiology
Correct
1
Incorrect
6
Unanswered
0
Accuracy
14.3%
Average
67.1%
Gastroenterology
Correct
12
Incorrect
9
Unanswered
0
Accuracy
57.1%
Average
72.9%
Genetics
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
68.2%
Geriatrics
Correct
6
Incorrect
5
Unanswered
0
Accuracy
54.5%
Average
71.2%
Gynecology and Obstetrics
Correct
14
Incorrect
5
Unanswered
0
Accuracy
73.7%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
6
Incorrect
5
Unanswered
0
Accuracy
54.5%
Average
81.8%
Immunology
Correct
8
Incorrect
1
Unanswered
0
Accuracy
88.9%
Average
82.5%
Infectious Diseases
Correct
15
Incorrect
13
Unanswered
0
Accuracy
53.6%
Average
71.1%
Legal Medicine and Bioethics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
67.2%
Medical Oncology
Correct
18
Incorrect
7
Unanswered
0
Accuracy
72.0%
Average
86.3%
Nephrology
Correct
8
Incorrect
7
Unanswered
0
Accuracy
53.3%
Average
78.2%
Neurology
Correct
12
Incorrect
8
Unanswered
0
Accuracy
60.0%
Average
76.2%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
72.6%
Palliative Care
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
77.2%
Pediatrics
Correct
14
Incorrect
11
Unanswered
0
Accuracy
56.0%
Average
72.7%
Pharmacology
Correct
9
Incorrect
8
Unanswered
0
Accuracy
52.9%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
73.0%
Radiology-Emergency
Correct
5
Incorrect
9
Unanswered
0
Accuracy
35.7%
Average
67.9%
Rheumatology
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
74.6%
Statistics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
74.9%
Traumatology
Correct
10
Incorrect
8
Unanswered
0
Accuracy
55.6%
Average
78.2%
Urology
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
77.1%
Biostatistics
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
78.4%
Diagnosis
Correct
49
Incorrect
40
Unanswered
0
Accuracy
55.1%
Average
77.9%
Epidemiology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
75.0%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
72.0%
Interpretation
Correct
19
Incorrect
22
Unanswered
1
Accuracy
45.2%
Average
69.3%
Legal
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
63.6%
Pathophysiology
Correct
17
Incorrect
9
Unanswered
1
Accuracy
63.0%
Average
72.6%
Pharmacology
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
82.4%
Prevention
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
74.5%
Prognosis
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
77.8%
Risk
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
84.3%
Tests
Correct
12
Incorrect
14
Unanswered
0
Accuracy
46.2%
Average
76.3%
Treatment
Correct
46
Incorrect
36
Unanswered
0
Accuracy
56.1%
Average
75.2%
#AnswerCorrectStatus
1BB
2CA
3CC
4CB
5AA
6CC
7CC
8AA
9AA
10BD
11DD
12CD
13AB
14DD
15D
16CB
17CB
18CA
19DC
20CA
21CB
22CD
23AC
24DD
25CC
26AAnnulled
27CC
28DAnnulled
29AD
30BB
31DD
32CA
33DD
34BD
35BB
36DD
37CC
38BC
39DD
40CA
41CD
42BC
43BB
44CD
45BD
46AA
47CA
48BA
49CD
50BB
51DC
52BB
53BD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60BA
61AA
62DD
63CB
64DD
65DA
66DA
67BB
68DB
69DB
70AA
71DD
72CA
73DD
74CC
75AA
76BB
77BB
78BB
79CC
80AC
81CC
82DD
83AB
84DD
85CC
86BC
87AA
88DD
89BB
90BA
91AB
92DC
93BB
94CC
95BA
96CC
97DD
98CC
99A
100CC
101CB
102CD
103DA
104CC
105AA
106CC
107BB
108DD
109BB
110CC
111AA
112BC
113BB
114AD
115BD
116CC
117AA
118DD
119CC
120AB
121AD
122AC
123CC
124CC
125DD
126BD
127AB
128DD
129AA
130DD
131BD
132CA
133BB
134CC
135DB
136CC
137CA
138DD
139DD
140CB
141AA
142AA
143BB
144BB
145BD
146AC
147BB
148BA
149CA
150DA
151AA
152AA
153AB
154BB
155BB
156CC
157CA
158BC
159AC
160AA
161DA
162B
163CD
164DC
165CA
166BB
167CC
168DD
169DB
170BB
171CC
172BA
173DA
174BB
175BB
176CC
177BC
178BA
179CD
180AA
181BB
182CC
183DB
184BB
185AB
186CAnnulled
187CC
188DD
189CD
190AA
191AB
192BA
193CC
194AA
195AA
196AA
197BB
198DC
199DD
200CC
201AB
202DA
203DD
204DC
205BB
206CD
207AA
208CC
209CC
210AB