MedicalBenchmark
OpenAI: GPT-3.5 Turbo Instruct provider

GPT-3.5 Turbo Instruct

252

#252 of 290 modelsMIR 2025

Net score

74.33 pts

Accuracy

52.0%

Correct / Incorrect

104 / 89

Total Cost

$0.31

Overall Performance

(vs. average)
Accuracy

52.0%

avg: 75.9%

Net score

74.33 pts

avg: 138.99 pts

Correct

104

avg: 152

Incorrect

89

avg: 38

Total Cost

$0.31

avg: $3.59

Average response time

3.7s

avg: 18.1s

Output Tokens

83K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

95.4%

avg: 94.7%

Subject Breakdown

Allergology
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
81.3%
Cardiology
Correct
11
Incorrect
10
Unanswered
1
Accuracy
50.0%
Average
77.4%
Dermatology
Correct
8
Incorrect
4
Unanswered
1
Accuracy
61.5%
Average
62.8%
Endocrinology and Nutrition
Correct
9
Incorrect
6
Unanswered
1
Accuracy
56.3%
Average
82.5%
ENT
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
73.8%
Epidemiology
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
67.1%
Gastroenterology
Correct
13
Incorrect
7
Unanswered
1
Accuracy
61.9%
Average
72.9%
Genetics
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
68.2%
Geriatrics
Correct
6
Incorrect
5
Unanswered
0
Accuracy
54.5%
Average
71.2%
Gynecology and Obstetrics
Correct
15
Incorrect
4
Unanswered
0
Accuracy
78.9%
Average
85.9%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
81.6%
Hematology
Correct
4
Incorrect
7
Unanswered
0
Accuracy
36.4%
Average
81.8%
Immunology
Correct
6
Incorrect
3
Unanswered
0
Accuracy
66.7%
Average
82.5%
Infectious Diseases
Correct
12
Incorrect
12
Unanswered
4
Accuracy
42.9%
Average
71.1%
Legal Medicine and Bioethics
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
67.2%
Medical Oncology
Correct
16
Incorrect
8
Unanswered
1
Accuracy
64.0%
Average
86.3%
Nephrology
Correct
7
Incorrect
7
Unanswered
1
Accuracy
46.7%
Average
78.2%
Neurology
Correct
8
Incorrect
12
Unanswered
0
Accuracy
40.0%
Average
76.2%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
72.6%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
77.2%
Pediatrics
Correct
14
Incorrect
10
Unanswered
1
Accuracy
56.0%
Average
72.7%
Pharmacology
Correct
8
Incorrect
8
Unanswered
1
Accuracy
47.1%
Average
73.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
82.0%
Pulmonology
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
73.0%
Radiology-Emergency
Correct
6
Incorrect
8
Unanswered
0
Accuracy
42.9%
Average
67.9%
Rheumatology
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
74.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.9%
Traumatology
Correct
9
Incorrect
9
Unanswered
0
Accuracy
50.0%
Average
78.2%
Urology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
6
Unanswered
0
Accuracy
14.3%
Average
77.1%
Biostatistics
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
78.4%
Diagnosis
Correct
44
Incorrect
43
Unanswered
2
Accuracy
49.4%
Average
77.9%
Epidemiology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
75.0%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
72.0%
Interpretation
Correct
22
Incorrect
19
Unanswered
1
Accuracy
52.4%
Average
69.3%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
63.6%
Pathophysiology
Correct
15
Incorrect
12
Unanswered
0
Accuracy
55.6%
Average
72.6%
Pharmacology
Correct
6
Incorrect
4
Unanswered
3
Accuracy
46.2%
Average
82.4%
Prevention
Correct
7
Incorrect
3
Unanswered
2
Accuracy
58.3%
Average
74.5%
Prognosis
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
77.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
84.3%
Tests
Correct
15
Incorrect
11
Unanswered
0
Accuracy
57.7%
Average
76.3%
Treatment
Correct
44
Incorrect
34
Unanswered
4
Accuracy
53.7%
Average
75.2%
#AnswerCorrectStatus
1DB
2AA
3CC
4AB
5AA
6CC
7CC
8AA
9CA
10AD
11DD
12DD
13BB
14AD
15D
16BB
17CB
18DA
19CC
20CA
21CB
22CD
23AC
24DD
25CC
26Annulled
27AC
28DAnnulled
29AD
30BB
31DD
32DA
33DD
34BD
35BB
36DD
37CC
38CC
39DD
40CA
41DD
42CC
43B
44DD
45BD
46AA
47AA
48AA
49AD
50DB
51DC
52DB
53CD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60DA
61AA
62DD
63BB
64DD
65CA
66DA
67AB
68BB
69CB
70AA
71DD
72CA
73DD
74DC
75AA
76BB
77BB
78AB
79DC
80BC
81CC
82CD
83BB
84DD
85DC
86AC
87AA
88DD
89CB
90DA
91DB
92CC
93BB
94DC
95BA
96CC
97DD
98DC
99DA
100CC
101CB
102AD
103DA
104CC
105AA
106CC
107CB
108D
109BB
110CC
111DA
112DC
113AB
114DD
115DD
116CC
117AA
118DD
119C
120AB
121DD
122AC
123CC
124CC
125AD
126CD
127BB
128DD
129BA
130DD
131BD
132AA
133BB
134CC
135DB
136DC
137CA
138DD
139DD
140BB
141BA
142BA
143BB
144BB
145BD
146BC
147AB
148BA
149DA
150BA
151AA
152CA
153CB
154BB
155BB
156CC
157A
158AC
159AC
160CA
161AA
162
163DD
164BC
165DA
166BB
167DC
168DD
169CB
170BB
171C
172BA
173DA
174BB
175DB
176CC
177BC
178BA
179DD
180AA
181DB
182CC
183BB
184BB
185DB
186DAnnulled
187CC
188DD
189CD
190DA
191AB
192A
193CC
194AA
195AA
196AA
197BB
198CC
199DD
200CC
201AB
202AA
203DD
204CC
205BB
206DD
207AA
208AC
209CC
210AB