MedicalBenchmark
OpenAI: GPT-3.5 Turbo provider

GPT-3.5 Turbo

240

#240 of 291 modelsMIR 2024

Net score

105.66 pts

Accuracy

64.5%

Correct / Incorrect

129 / 70

Total Cost

$0.14

Overall Performance

(vs. average)
Accuracy

64.5%

avg: 80.5%

Net score

105.66 pts

avg: 150.85 pts

Correct

129

avg: 161

Incorrect

70

avg: 30

Total Cost

$0.14

avg: $3.32

Average response time

2.4s

avg: 16.4s

Output Tokens

59K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

99.6%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.1%
Cardiology
Correct
13
Incorrect
8
Unanswered
0
Accuracy
61.9%
Average
79.7%
Dermatology
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
80.2%
Endocrinology and Nutrition
Correct
12
Incorrect
7
Unanswered
0
Accuracy
63.2%
Average
84.2%
ENT
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
74.4%
Epidemiology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
13
Incorrect
8
Unanswered
1
Accuracy
59.1%
Average
70.5%
Genetics
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
86.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
0
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
10
Incorrect
4
Unanswered
0
Accuracy
71.4%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
12
Incorrect
1
Unanswered
0
Accuracy
92.3%
Average
81.5%
Immunology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.1%
Infectious Diseases
Correct
16
Incorrect
7
Unanswered
0
Accuracy
69.6%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
13
Incorrect
7
Unanswered
1
Accuracy
61.9%
Average
80.2%
Nephrology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
80.8%
Neurology
Correct
11
Incorrect
11
Unanswered
0
Accuracy
50.0%
Average
83.7%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
80.0%
Palliative Care
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
88.2%
Pediatrics
Correct
9
Incorrect
8
Unanswered
0
Accuracy
52.9%
Average
82.0%
Pharmacology
Correct
16
Incorrect
6
Unanswered
1
Accuracy
69.6%
Average
85.4%
Psychiatry
Correct
8
Incorrect
2
Unanswered
0
Accuracy
80.0%
Average
89.5%
Pulmonology
Correct
11
Incorrect
8
Unanswered
0
Accuracy
57.9%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
81.4%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
91.1%
Traumatology
Correct
10
Incorrect
5
Unanswered
0
Accuracy
66.7%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
41
Incorrect
32
Unanswered
0
Accuracy
56.2%
Average
79.2%
Epidemiology
Correct
8
Incorrect
4
Unanswered
0
Accuracy
66.7%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
22
Incorrect
15
Unanswered
0
Accuracy
59.5%
Average
69.6%
Pathophysiology
Correct
20
Incorrect
13
Unanswered
0
Accuracy
60.6%
Average
85.4%
Pharmacology
Correct
18
Incorrect
6
Unanswered
1
Accuracy
72.0%
Average
84.0%
Prevention
Correct
11
Incorrect
1
Unanswered
0
Accuracy
91.7%
Average
89.8%
Prognosis
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
83.9%
Risk
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
83.6%
Tests
Correct
14
Incorrect
7
Unanswered
0
Accuracy
66.7%
Average
73.9%
Treatment
Correct
50
Incorrect
20
Unanswered
1
Accuracy
70.4%
Average
81.3%
#AnswerCorrectStatus
1BB
2DD
3DB
4CC
5BC
6BB
7DD
8CC
9DA
10DD
11DD
12AA
13CC
14BA
15DB
16AA
17CC
18AA
19BB
20AC
21CD
22BB
23AA
24BA
25CC
26BB
27CC
28DA
29BB
30CC
31BD
32DA
33CC
34CB
35DD
36DD
37AA
38DA
39CC
40BB
41CC
42DD
43AA
44DD
45AD
46BB
47CC
48CC
49CB
50BC
51AA
52DD
53CC
54BB
55CC
56DD
57CA
58DA
59AA
60DA
61DA
62BD
63BD
64AAnnulled
65DD
66CC
67BB
68CAnnulled
69AA
70BB
71CB
72DD
73BB
74CC
75BB
76AA
77AD
78AC
79DB
80AA
81DC
82DC
83DB
84CC
85AA
86AA
87BB
88DD
89BB
90DA
91DD
92AA
93BC
94CB
95BD
96BB
97BB
98DB
99AA
100BB
101DA
102DD
103BB
104DD
105DB
106BC
107CC
108BB
109CD
110CD
111BB
112CC
113BAnnulled
114BD
115DD
116AA
117AD
118DD
119CA
120CC
121AA
122DB
123BD
124DD
125BB
126DD
127DA
128DB
129CD
130CC
131CC
132CD
133AA
134DC
135BA
136DD
137AA
138DC
139AA
140CC
141DB
142CC
143AA
144AD
145CC
146CC
147BC
148AA
149CC
150DD
151CA
152AA
153C
154BB
155DD
156CC
157AC
158DD
159DD
160BB
161BB
162BB
163BB
164BB
165CA
166AC
167BA
168BB
169CC
170AA
171BD
172BB
173BA
174BB
175AA
176DC
177CC
178BB
179CC
180BAnnulled
181CB
182DD
183AC
184CA
185DC
186DD
187AA
188CC
189AD
190DD
191BB
192BB
193DC
194CC
195BC
196BB
197AA
198BB
199DD
200BA
201BB
202DD
203BB
204DD
205DD
206DAnnulled
207BA
208AA
209CB
210DD