MedicalBenchmark
OpenAI: GPT-4.1 Mini provider

GPT-4.1 Mini

103

#103 of 319 models in the general ranking

Cumulative performance across 3 MIR exams

Net score

539.33 pts

Accuracy

92.3%

Correct / Incorrect

554 / 44

Total Cost

$0.58

Overall Performance

(vs. average)
Accuracy

92.3%

avg: 80.6%

Net score

539.33 pts

avg: 453.30 pts

Correct

554

avg: 483

Incorrect

44

avg: 90

Total Cost

$0.58

avg: $9.58

Average response time

6.5s

avg: 17.9s

Output Tokens

288K

avg: 1.3M

Reasoning Tokens

0

avg: 898K

Average confidence

99.7%

avg: 95.4%

Breakdown by Exam

MIR 2024
97
Correct
188
Incorrect
11
Accuracy
94.0%
Net score
184.33
MIR 2025
105
Correct
179
Incorrect
20
Accuracy
89.5%
Net score
172.33
MIR 2026
107
Correct
187
Incorrect
13
Accuracy
93.5%
Net score
182.66