MedicalBenchmark
Meta: Llama 4 Maverick provider

Llama 4 Maverick

57

#57 of 319 models in the general ranking

Cumulative performance across 3 MIR exams

Net score

558.33 pts

Accuracy

94.7%

Correct / Incorrect

568 / 29

Total Cost

$0.35

Overall Performance

(vs. average)
Accuracy

94.7%

avg: 80.6%

Net score

558.33 pts

avg: 453.30 pts

Correct

568

avg: 483

Incorrect

29

avg: 90

Total Cost

$0.35

avg: $9.58

Average response time

7.9s

avg: 17.9s

Output Tokens

333K

avg: 1.3M

Reasoning Tokens

0

avg: 898K

Average confidence

99.1%

avg: 95.4%

Breakdown by Exam

MIR 2024
82
Correct
189
Incorrect
10
Accuracy
94.5%
Net score
185.66
MIR 2025
64
Correct
185
Incorrect
13
Accuracy
92.5%
Net score
180.66
MIR 2026
45
Correct
194
Incorrect
6
Accuracy
97.0%
Net score
192.00