MedicalBenchmark
OpenAI: GPT-3.5 Turbo Instruct provider

GPT-3.5 Turbo Instruct

278

#278 of 319 modelsMIR 2025

Net score

76.66 pts

Accuracy

53.0%

Correct / Incorrect

106 / 88

Total Cost

$0.31

Overall Performance

(vs. average)
Accuracy

53.0%

avg: 77.9%

Net score

76.66 pts

avg: 143.96 pts

Correct

106

avg: 156

Incorrect

88

avg: 35

Total Cost

$0.31

avg: $3.36

Average response time

3.7s

avg: 19.0s

Output Tokens

83K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

95.4%

avg: 95.2%

Subject Breakdown

Allergology
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
82.3%
Cardiology
Correct
11
Incorrect
10
Unanswered
1
Accuracy
50.0%
Average
78.6%
Dermatology
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
69.4%
Endocrinology and Nutrition
Correct
9
Incorrect
6
Unanswered
1
Accuracy
56.3%
Average
83.5%
ENT
Correct
3
Incorrect
5
Unanswered
0
Accuracy
37.5%
Average
74.8%
Epidemiology
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
69.1%
Gastroenterology
Correct
13
Incorrect
7
Unanswered
1
Accuracy
61.9%
Average
74.1%
Genetics
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
69.5%
Geriatrics
Correct
6
Incorrect
5
Unanswered
0
Accuracy
54.5%
Average
77.5%
Gynecology and Obstetrics
Correct
15
Incorrect
4
Unanswered
0
Accuracy
78.9%
Average
86.7%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
82.6%
Hematology
Correct
4
Incorrect
7
Unanswered
0
Accuracy
36.4%
Average
82.7%
Immunology
Correct
6
Incorrect
3
Unanswered
0
Accuracy
66.7%
Average
83.3%
Infectious Diseases
Correct
12
Incorrect
12
Unanswered
3
Accuracy
44.4%
Average
74.9%
Legal Medicine and Bioethics
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
68.4%
Medical Oncology
Correct
16
Incorrect
8
Unanswered
1
Accuracy
64.0%
Average
87.2%
Nephrology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
84.8%
Neurology
Correct
8
Incorrect
12
Unanswered
0
Accuracy
40.0%
Average
77.3%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
74.2%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
78.6%
Pediatrics
Correct
15
Incorrect
10
Unanswered
1
Accuracy
57.7%
Average
71.9%
Pharmacology
Correct
8
Incorrect
8
Unanswered
1
Accuracy
47.1%
Average
74.1%
Psychiatry
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
83.0%
Pulmonology
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
80.4%
Radiology-Emergency
Correct
6
Incorrect
8
Unanswered
0
Accuracy
42.9%
Average
69.4%
Rheumatology
Correct
9
Incorrect
6
Unanswered
0
Accuracy
60.0%
Average
76.6%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
76.6%
Traumatology
Correct
9
Incorrect
9
Unanswered
0
Accuracy
50.0%
Average
79.3%
Urology
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
6
Unanswered
0
Accuracy
14.3%
Average
78.6%
Biostatistics
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
79.8%
Diagnosis
Correct
44
Incorrect
42
Unanswered
2
Accuracy
50.0%
Average
79.9%
Epidemiology
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
76.7%
Ethics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
74.1%
Interpretation
Correct
22
Incorrect
19
Unanswered
1
Accuracy
52.4%
Average
70.7%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
64.6%
Pathophysiology
Correct
15
Incorrect
12
Unanswered
0
Accuracy
55.6%
Average
76.1%
Pharmacology
Correct
6
Incorrect
4
Unanswered
3
Accuracy
46.2%
Average
83.3%
Prevention
Correct
7
Incorrect
3
Unanswered
2
Accuracy
58.3%
Average
75.6%
Prognosis
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
80.8%
Risk
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
85.2%
Tests
Correct
16
Incorrect
11
Unanswered
0
Accuracy
59.3%
Average
77.9%
Treatment
Correct
45
Incorrect
33
Unanswered
3
Accuracy
55.6%
Average
77.3%
#AnswerCorrectStatus
1DB
2AA
3CC
4AB
5AA
6CC
7CC
8AA
9CA
10AD
11DD
12DD
13BB
14AD
15DAnnulled
16BB
17CB
18DA
19CC
20CA
21CB
22CD
23AC
24DD
25CC
26Annulled
27AC
28DAnnulled
29AD
30BB
31DD
32DA
33DD
34BD
35BB
36DD
37CC
38CC
39DD
40CA
41DD
42CC
43B
44DD
45BD
46AA
47AA
48AA
49AD
50DB
51DC
52DB
53CD
54DB
55CA
56BAnnulled
57CC
58BB
59DD
60DA
61AA
62DD
63BB
64DD
65CA
66DA
67AB
68BB
69CB
70AA
71DD
72CA
73DD
74DC
75AA
76BB
77BB
78AB
79DC
80BC
81CC
82CD
83BB
84DD
85DC
86AC
87AA
88DD
89CB
90DA
91DB
92CC
93BB
94DC
95BA
96CC
97DD
98DC
99DA
100CC
101CB
102AD
103DA
104CC
105AA
106CC
107CB
108D
109BB
110CC
111DA
112DC
113AB
114DD
115DD
116CC
117AA
118DD
119C
120AB
121DD
122AC
123CC
124CC
125AD
126CD
127BB
128DD
129BA
130DD
131BD
132AA
133BB
134CC
135DB
136DC
137CA
138DD
139DD
140BB
141BA
142BA
143BB
144BB
145BD
146BC
147AB
148BA
149DA
150BD
151AA
152CA
153CB
154BB
155BB
156CC
157A
158AC
159AC
160CA
161AA
162Annulled
163DD
164BC
165DA
166BB
167DC
168DD
169CB
170BB
171C
172BA
173DA
174BB
175DB
176CC
177BC
178BA
179DD
180AA
181DB
182CC
183BB
184BB
185DB
186DAnnulled
187CC
188DD
189CD
190DA
191AB
192A
193CC
194AA
195AA
196AA
197BB
198CC
199DD
200CC
201AB
202AA
203DD
204CC
205BB
206DD
207AA
208AC
209CC
210AB