MedicalBenchmark
OpenAI: GPT-4o (2024-05-13) provider

GPT-4o (2024-05-13)

84

#84 of 319 models in the general ranking

Cumulative performance across 3 MIR exams

Net score

547.00 pts

Accuracy

93.3%

Correct / Incorrect

560 / 39

Total Cost

$5.67

Overall Performance

(vs. average)
Accuracy

93.3%

avg: 80.6%

Net score

547.00 pts

avg: 453.30 pts

Correct

560

avg: 483

Incorrect

39

avg: 90

Total Cost

$5.67

avg: $9.58

Average response time

3.4s

avg: 17.9s

Output Tokens

280K

avg: 1.3M

Reasoning Tokens

0

avg: 898K

Average confidence

99.4%

avg: 95.4%

Breakdown by Exam

MIR 2024
61
Correct
191
Incorrect
9
Accuracy
95.5%
Net score
188.00
MIR 2025
125
Correct
176
Incorrect
23
Accuracy
88.0%
Net score
168.33
MIR 2026
56
Correct
193
Incorrect
7
Accuracy
96.5%
Net score
190.66