Llama 3.1 Euryale 70B v2.2
232
#232 of 291 models — MIR 2024
Net score
125.00 pts
Accuracy
70.0%
Correct / Incorrect
140 / 45
Total Cost
$0.18
Overall Performance
(vs. average)Accuracy
70.0%
avg: 80.5%
Net score
125.00 pts
avg: 150.85 pts
Correct
140
avg: 161
Incorrect
45
avg: 30
Total Cost
$0.18
avg: $3.32
Average response time
21.1s
avg: 16.4s
Output Tokens
117K
avg: 427K
Reasoning Tokens
0
avg: 310K
Average confidence
92.7%
avg: 95.4%
Subject Breakdown
| Subject | Correct | Incorrect | Unanswered | Accuracy | Average |
|---|---|---|---|---|---|
Allergology | 2 | 1 | 0 | 66.7% | 90.5% |
Anesthesiology and Resuscitation | 4 | 0 | 0 | 100.0% | 87.1% |
Cardiology | 15 | 4 | 2 | 71.4% | 79.7% |
Dermatology | 10 | 3 | 1 | 71.4% | 80.2% |
Endocrinology and Nutrition | 16 | 3 | 0 | 84.2% | 84.2% |
ENT | 5 | 1 | 1 | 71.4% | 74.4% |
Epidemiology | 6 | 0 | 2 | 75.0% | 89.3% |
Gastroenterology | 12 | 6 | 4 | 54.5% | 70.5% |
Genetics | 6 | 1 | 0 | 85.7% | 86.5% |
Geriatrics | 8 | 2 | 0 | 80.0% | 86.9% |
Gynecology and Obstetrics | 9 | 4 | 1 | 64.3% | 81.2% |
Health Planning and Management | 1 | 0 | 1 | 50.0% | 73.2% |
Hematology | 7 | 6 | 0 | 53.8% | 81.5% |
Immunology | 8 | 0 | 0 | 100.0% | 89.1% |
Infectious Diseases | 16 | 6 | 1 | 69.6% | 81.8% |
Legal Medicine and Bioethics | 1 | 1 | 0 | 50.0% | 91.7% |
Medical Oncology | 15 | 3 | 3 | 71.4% | 80.2% |
Nephrology | 9 | 4 | 0 | 69.2% | 80.8% |
Neurology | 17 | 5 | 0 | 77.3% | 83.7% |
Ophthalmology | 5 | 0 | 0 | 100.0% | 80.0% |
Palliative Care | 3 | 1 | 0 | 75.0% | 88.2% |
Pediatrics | 11 | 4 | 2 | 64.7% | 82.0% |
Pharmacology | 16 | 6 | 1 | 69.6% | 85.4% |
Psychiatry | 10 | 0 | 0 | 100.0% | 89.5% |
Pulmonology | 12 | 5 | 2 | 63.2% | 80.6% |
Radiology-Emergency | 6 | 4 | 4 | 42.9% | 64.9% |
Rheumatology | 12 | 2 | 0 | 85.7% | 81.4% |
Statistics | 1 | 0 | 2 | 33.3% | 91.1% |
Traumatology | 9 | 3 | 3 | 60.0% | 74.5% |
Urology | 5 | 1 | 0 | 83.3% | 78.2% |
Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.1%
Cardiology
Correct
15
Incorrect
4
Unanswered
2
Accuracy
71.4%
Average
79.7%
Dermatology
Correct
10
Incorrect
3
Unanswered
1
Accuracy
71.4%
Average
80.2%
Endocrinology and Nutrition
Correct
16
Incorrect
3
Unanswered
0
Accuracy
84.2%
Average
84.2%
ENT
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
74.4%
Epidemiology
Correct
6
Incorrect
0
Unanswered
2
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
12
Incorrect
6
Unanswered
4
Accuracy
54.5%
Average
70.5%
Genetics
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
86.5%
Geriatrics
Correct
8
Incorrect
2
Unanswered
0
Accuracy
80.0%
Average
86.9%
Gynecology and Obstetrics
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
73.2%
Hematology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
81.5%
Immunology
Correct
8
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
89.1%
Infectious Diseases
Correct
16
Incorrect
6
Unanswered
1
Accuracy
69.6%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
15
Incorrect
3
Unanswered
3
Accuracy
71.4%
Average
80.2%
Nephrology
Correct
9
Incorrect
4
Unanswered
0
Accuracy
69.2%
Average
80.8%
Neurology
Correct
17
Incorrect
5
Unanswered
0
Accuracy
77.3%
Average
83.7%
Ophthalmology
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
11
Incorrect
4
Unanswered
2
Accuracy
64.7%
Average
82.0%
Pharmacology
Correct
16
Incorrect
6
Unanswered
1
Accuracy
69.6%
Average
85.4%
Psychiatry
Correct
10
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
89.5%
Pulmonology
Correct
12
Incorrect
5
Unanswered
2
Accuracy
63.2%
Average
80.6%
Radiology-Emergency
Correct
6
Incorrect
4
Unanswered
4
Accuracy
42.9%
Average
64.9%
Rheumatology
Correct
12
Incorrect
2
Unanswered
0
Accuracy
85.7%
Average
81.4%
Statistics
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
9
Incorrect
3
Unanswered
3
Accuracy
60.0%
Average
74.5%
Urology
Correct
5
Incorrect
1
Unanswered
0
Accuracy
83.3%
Average
78.2%
Question Type Breakdown
| Type | Correct | Incorrect | Unanswered | Accuracy | Average |
|---|---|---|---|---|---|
Anatomy | 4 | 1 | 1 | 66.7% | 79.8% |
Biostatistics | 2 | 0 | 3 | 40.0% | 90.7% |
Diagnosis | 54 | 13 | 6 | 74.0% | 79.2% |
Epidemiology | 10 | 1 | 1 | 83.3% | 81.2% |
Ethics | 0 | 1 | 0 | 0.0% | 94.5% |
Interpretation | 18 | 13 | 6 | 48.6% | 69.6% |
Pathophysiology | 27 | 6 | 0 | 81.8% | 85.4% |
Pharmacology | 18 | 7 | 0 | 72.0% | 84.0% |
Prevention | 9 | 1 | 2 | 75.0% | 89.8% |
Prognosis | 6 | 1 | 0 | 85.7% | 83.9% |
Risk | 11 | 2 | 0 | 84.6% | 83.6% |
Tests | 12 | 5 | 4 | 57.1% | 73.9% |
Treatment | 47 | 20 | 4 | 66.2% | 81.3% |
Anatomy
Correct
4
Incorrect
1
Unanswered
1
Accuracy
66.7%
Average
79.8%
Biostatistics
Correct
2
Incorrect
0
Unanswered
3
Accuracy
40.0%
Average
90.7%
Diagnosis
Correct
54
Incorrect
13
Unanswered
6
Accuracy
74.0%
Average
79.2%
Epidemiology
Correct
10
Incorrect
1
Unanswered
1
Accuracy
83.3%
Average
81.2%
Ethics
Correct
0
Incorrect
1
Unanswered
0
Accuracy
0.0%
Average
94.5%
Interpretation
Correct
18
Incorrect
13
Unanswered
6
Accuracy
48.6%
Average
69.6%
Pathophysiology
Correct
27
Incorrect
6
Unanswered
0
Accuracy
81.8%
Average
85.4%
Pharmacology
Correct
18
Incorrect
7
Unanswered
0
Accuracy
72.0%
Average
84.0%
Prevention
Correct
9
Incorrect
1
Unanswered
2
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
83.9%
Risk
Correct
11
Incorrect
2
Unanswered
0
Accuracy
84.6%
Average
83.6%
Tests
Correct
12
Incorrect
5
Unanswered
4
Accuracy
57.1%
Average
73.9%
Treatment
Correct
47
Incorrect
20
Unanswered
4
Accuracy
66.2%
Average
81.3%