Llama 3.3 70B Instruct
195
#195 of 319 models in the general ranking
Cumulative performance across 3 MIR exams
Net score
478.33 pts
Accuracy
84.0%
Correct / Incorrect
504 / 77
Total Cost
$0.28
Overall Performance
(vs. average)Accuracy
84.0%
avg: 80.6%
Net score
478.33 pts
avg: 453.30 pts
Correct
504
avg: 483
Incorrect
77
avg: 90
Total Cost
$0.28
avg: $9.58
Average response time
14.1s
avg: 17.9s
Output Tokens
322K
avg: 1.3M
Reasoning Tokens
0
avg: 898K
Average confidence
94.9%
avg: 95.4%
Breakdown by Exam
| Exam | Position | Correct | Incorrect | Accuracy | Net score | Total Cost | |
|---|---|---|---|---|---|---|---|
| MIR 2024 | 201 | 171 | 27 | 85.5% | 162.00 | $0.09 | View detail |
| MIR 2025 | 203 | 161 | 32 | 80.5% | 150.33 | $0.10 | View detail |
| MIR 2026 | 177 | 172 | 18 | 86.0% | 166.00 | $0.09 | View detail |