gpt-oss-safeguard-20b
159
#159 of 319 models in the general ranking
Cumulative performance across 3 MIR exams
Net score
502.33 pts
Accuracy
87.7%
Correct / Incorrect
526 / 71
Total Cost
$0.20
Overall Performance
(vs. average)Accuracy
87.7%
avg: 80.6%
Net score
502.33 pts
avg: 453.30 pts
Correct
526
avg: 483
Incorrect
71
avg: 90
Total Cost
$0.20
avg: $9.58
Average response time
2.0s
avg: 17.9s
Output Tokens
608K
avg: 1.3M
Reasoning Tokens
356K
avg: 898K
Average confidence
99.3%
avg: 95.4%
Breakdown by Exam
| Exam | Position | Correct | Incorrect | Accuracy | Net score | Total Cost | |
|---|---|---|---|---|---|---|---|
| MIR 2024 | 171 | 177 | 22 | 88.5% | 169.66 | $0.07 | View detail |
| MIR 2025 | 152 | 172 | 28 | 86.0% | 162.66 | $0.07 | View detail |
| MIR 2026 | 162 | 177 | 21 | 88.5% | 170.00 | $0.06 | View detail |