MedicalBenchmark
Meta: Llama 3.2 1B Instruct provider

Llama 3.2 1B Instruct

316

#316 of 319 models in the general ranking

Cumulative performance across 3 MIR exams

Net score

38.66 pts

Accuracy

17.0%

Correct / Incorrect

102 / 190

Total Cost

$0.07

Overall Performance

(vs. average)
Accuracy

17.0%

avg: 80.6%

Net score

38.66 pts

avg: 453.30 pts

Correct

102

avg: 483

Incorrect

190

avg: 90

Total Cost

$0.07

avg: $9.58

Average response time

3.4s

avg: 17.9s

Output Tokens

309K

avg: 1.3M

Reasoning Tokens

0

avg: 898K

Average confidence

49.1%

avg: 95.4%

Breakdown by Exam

MIR 2024
317
Correct
29
Incorrect
65
Accuracy
14.5%
Net score
7.33
MIR 2025
315
Correct
29
Incorrect
67
Accuracy
14.5%
Net score
6.66
MIR 2026
315
Correct
44
Incorrect
58
Accuracy
22.0%
Net score
24.66