MedicalBenchmark
OpenAI: GPT-3.5 Turbo 16k provider

GPT-3.5 Turbo 16k

252

#252 of 291 modelsMIR 2024

Net score

88.33 pts

Accuracy

58.0%

Correct / Incorrect

116 / 83

Total Cost

$0.00

Overall Performance

(vs. average)
Accuracy

58.0%

avg: 80.5%

Net score

88.33 pts

avg: 150.85 pts

Correct

116

avg: 161

Incorrect

83

avg: 30

Total Cost

$0.00

avg: $3.32

Average response time

2.3s

avg: 16.4s

Output Tokens

60K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

98.7%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
7
Incorrect
14
Unanswered
0
Accuracy
33.3%
Average
79.7%
Dermatology
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
80.2%
Endocrinology and Nutrition
Correct
9
Incorrect
10
Unanswered
0
Accuracy
47.4%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
4
Incorrect
4
Unanswered
0
Accuracy
50.0%
Average
89.3%
Gastroenterology
Correct
14
Incorrect
8
Unanswered
0
Accuracy
63.6%
Average
70.5%
Genetics
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
86.5%
Geriatrics
Correct
6
Incorrect
4
Unanswered
0
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
9
Incorrect
5
Unanswered
0
Accuracy
64.3%
Average
81.2%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
73.2%
Hematology
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
81.5%
Immunology
Correct
4
Incorrect
4
Unanswered
0
Accuracy
50.0%
Average
89.1%
Infectious Diseases
Correct
14
Incorrect
8
Unanswered
1
Accuracy
60.9%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
14
Incorrect
7
Unanswered
0
Accuracy
66.7%
Average
80.2%
Nephrology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
80.8%
Neurology
Correct
14
Incorrect
8
Unanswered
0
Accuracy
63.6%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
11
Incorrect
5
Unanswered
1
Accuracy
64.7%
Average
82.0%
Pharmacology
Correct
18
Incorrect
5
Unanswered
0
Accuracy
78.3%
Average
85.4%
Psychiatry
Correct
10
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
89.5%
Pulmonology
Correct
9
Incorrect
10
Unanswered
0
Accuracy
47.4%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
81.4%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
7
Incorrect
8
Unanswered
0
Accuracy
46.7%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
45
Incorrect
28
Unanswered
0
Accuracy
61.6%
Average
79.2%
Epidemiology
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
17
Incorrect
20
Unanswered
0
Accuracy
45.9%
Average
69.6%
Pathophysiology
Correct
18
Incorrect
15
Unanswered
0
Accuracy
54.5%
Average
85.4%
Pharmacology
Correct
17
Incorrect
8
Unanswered
0
Accuracy
68.0%
Average
84.0%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
89.8%
Prognosis
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
83.9%
Risk
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
83.6%
Tests
Correct
8
Incorrect
13
Unanswered
0
Accuracy
38.1%
Average
73.9%
Treatment
Correct
42
Incorrect
28
Unanswered
1
Accuracy
59.2%
Average
81.3%
#AnswerCorrectStatus
1BB
2DD
3DB
4BC
5BC
6DB
7DD
8CC
9BA
10DD
11DD
12AA
13DC
14BA
15DB
16CA
17CC
18AA
19BB
20AC
21CD
22BB
23AA
24DA
25BC
26BB
27DC
28DA
29BB
30DC
31BD
32BA
33CC
34CB
35DD
36CD
37AA
38CA
39CC
40BB
41CC
42AD
43DA
44CD
45AD
46BB
47CC
48CC
49BB
50BC
51CA
52DD
53CC
54BB
55BC
56DD
57DA
58DA
59BA
60CA
61AA
62CD
63DD
64AAnnulled
65DD
66CC
67BB
68BAnnulled
69AA
70BB
71AB
72CD
73B
74CC
75AB
76AA
77DD
78CC
79CB
80AA
81DC
82CC
83BB
84CC
85AA
86AA
87BB
88DD
89BB
90AA
91CD
92AA
93CC
94BB
95AD
96BB
97BB
98AB
99AA
100AB
101DA
102BD
103BB
104CD
105BB
106CC
107CC
108BB
109CD
110CD
111BB
112BC
113DAnnulled
114DD
115AD
116DA
117AD
118DD
119CA
120CC
121BA
122AB
123CD
124BD
125CB
126DD
127AA
128CB
129DD
130CC
131CC
132CD
133AA
134CC
135BA
136DD
137AA
138CC
139AA
140BC
141BB
142BC
143AA
144DD
145CC
146BC
147CC
148AA
149AC
150DD
151AA
152AA
153CC
154AB
155DD
156BC
157AC
158DD
159DD
160AB
161BB
162DB
163BB
164DB
165CA
166DC
167CA
168BB
169CC
170AA
171DD
172BB
173BA
174BB
175CA
176DC
177CC
178DB
179CC
180DAnnulled
181CB
182DD
183AC
184CA
185CC
186DD
187AA
188CC
189CD
190DD
191BB
192BB
193CC
194CC
195BC
196BB
197AA
198BB
199DD
200AA
201BB
202DD
203CB
204DD
205BD
206CAnnulled
207BA
208AA
209B
210DD