CodeLLaMa 7B Instruct Solidity
318
#318 of 320 models — MIR 2024
Net score
2.33 pts
Accuracy
5.5%
Correct / Incorrect
11 / 26
Total Cost
$0.35
Overall Performance
(vs. average)Accuracy
5.5%
avg: 81.3%
Net score
2.33 pts
avg: 153.08 pts
Correct
11
avg: 163
Incorrect
26
avg: 29
Total Cost
$0.35
avg: $3.09
Average response time
31.1s
avg: 17.7s
Output Tokens
209K
avg: 414K
Reasoning Tokens
0
avg: 296K
Average confidence
24.0%
avg: 95.7%
Subject Breakdown
| Subject | Correct | Incorrect | Unanswered | Accuracy | Average |
|---|---|---|---|---|---|
Allergology | 0 | 1 | 2 | 0.0% | 90.8% |
Anesthesiology and Resuscitation | 0 | 0 | 4 | 0.0% | 87.7% |
Cardiology | 0 | 2 | 19 | 0.0% | 80.4% |
Dermatology | 0 | 1 | 13 | 0.0% | 81.0% |
Endocrinology and Nutrition | 0 | 3 | 16 | 0.0% | 85.1% |
ENT | 0 | 3 | 4 | 0.0% | 75.1% |
Epidemiology | 1 | 1 | 6 | 12.5% | 89.7% |
Gastroenterology | 0 | 5 | 17 | 0.0% | 71.5% |
Genetics | 0 | 1 | 6 | 0.0% | 87.1% |
Geriatrics | 1 | 0 | 9 | 10.0% | 87.7% |
Gynecology and Obstetrics | 1 | 2 | 11 | 7.1% | 82.0% |
Health Planning and Management | 0 | 0 | 2 | 0.0% | 75.1% |
Hematology | 2 | 0 | 11 | 15.4% | 82.4% |
Immunology | 0 | 1 | 7 | 0.0% | 89.7% |
Infectious Diseases | 1 | 2 | 20 | 4.3% | 82.5% |
Legal Medicine and Bioethics | 0 | 0 | 2 | 0.0% | 91.8% |
Medical Oncology | 2 | 3 | 16 | 9.5% | 80.9% |
Nephrology | 1 | 0 | 12 | 7.7% | 81.8% |
Neurology | 3 | 3 | 16 | 13.6% | 84.5% |
Ophthalmology | 0 | 1 | 4 | 0.0% | 81.3% |
Palliative Care | 0 | 1 | 3 | 0.0% | 88.6% |
Pediatrics | 1 | 0 | 16 | 5.9% | 82.9% |
Pharmacology | 3 | 0 | 20 | 13.0% | 85.8% |
Psychiatry | 0 | 1 | 9 | 0.0% | 90.0% |
Pulmonology | 2 | 2 | 15 | 10.5% | 81.6% |
Radiology-Emergency | 0 | 2 | 12 | 0.0% | 66.0% |
Rheumatology | 0 | 1 | 13 | 0.0% | 82.4% |
Statistics | 1 | 0 | 2 | 33.3% | 91.6% |
Traumatology | 0 | 3 | 12 | 0.0% | 75.4% |
Urology | 1 | 1 | 4 | 16.7% | 79.0% |
Allergology
Correct
0
Incorrect
1
Unanswered
2
Accuracy
0.0%
Average
90.8%
Anesthesiology and Resuscitation
Correct
0
Incorrect
0
Unanswered
4
Accuracy
0.0%
Average
87.7%
Cardiology
Correct
0
Incorrect
2
Unanswered
19
Accuracy
0.0%
Average
80.4%
Dermatology
Correct
0
Incorrect
1
Unanswered
13
Accuracy
0.0%
Average
81.0%
Endocrinology and Nutrition
Correct
0
Incorrect
3
Unanswered
16
Accuracy
0.0%
Average
85.1%
ENT
Correct
0
Incorrect
3
Unanswered
4
Accuracy
0.0%
Average
75.1%
Epidemiology
Correct
1
Incorrect
1
Unanswered
6
Accuracy
12.5%
Average
89.7%
Gastroenterology
Correct
0
Incorrect
5
Unanswered
17
Accuracy
0.0%
Average
71.5%
Genetics
Correct
0
Incorrect
1
Unanswered
6
Accuracy
0.0%
Average
87.1%
Geriatrics
Correct
1
Incorrect
0
Unanswered
9
Accuracy
10.0%
Average
87.7%
Gynecology and Obstetrics
Correct
1
Incorrect
2
Unanswered
11
Accuracy
7.1%
Average
82.0%
Health Planning and Management
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
75.1%
Hematology
Correct
2
Incorrect
0
Unanswered
11
Accuracy
15.4%
Average
82.4%
Immunology
Correct
0
Incorrect
1
Unanswered
7
Accuracy
0.0%
Average
89.7%
Infectious Diseases
Correct
1
Incorrect
2
Unanswered
20
Accuracy
4.3%
Average
82.5%
Legal Medicine and Bioethics
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
91.8%
Medical Oncology
Correct
2
Incorrect
3
Unanswered
16
Accuracy
9.5%
Average
80.9%
Nephrology
Correct
1
Incorrect
0
Unanswered
12
Accuracy
7.7%
Average
81.8%
Neurology
Correct
3
Incorrect
3
Unanswered
16
Accuracy
13.6%
Average
84.5%
Ophthalmology
Correct
0
Incorrect
1
Unanswered
4
Accuracy
0.0%
Average
81.3%
Palliative Care
Correct
0
Incorrect
1
Unanswered
3
Accuracy
0.0%
Average
88.6%
Pediatrics
Correct
1
Incorrect
0
Unanswered
16
Accuracy
5.9%
Average
82.9%
Pharmacology
Correct
3
Incorrect
0
Unanswered
20
Accuracy
13.0%
Average
85.8%
Psychiatry
Correct
0
Incorrect
1
Unanswered
9
Accuracy
0.0%
Average
90.0%
Pulmonology
Correct
2
Incorrect
2
Unanswered
15
Accuracy
10.5%
Average
81.6%
Radiology-Emergency
Correct
0
Incorrect
2
Unanswered
12
Accuracy
0.0%
Average
66.0%
Rheumatology
Correct
0
Incorrect
1
Unanswered
13
Accuracy
0.0%
Average
82.4%
Statistics
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
91.6%
Traumatology
Correct
0
Incorrect
3
Unanswered
12
Accuracy
0.0%
Average
75.4%
Urology
Correct
1
Incorrect
1
Unanswered
4
Accuracy
16.7%
Average
79.0%
Question Type Breakdown
| Type | Correct | Incorrect | Unanswered | Accuracy | Average |
|---|---|---|---|---|---|
Anatomy | 0 | 3 | 3 | 0.0% | 81.1% |
Biostatistics | 1 | 0 | 4 | 20.0% | 91.3% |
Diagnosis | 3 | 13 | 57 | 4.1% | 80.0% |
Epidemiology | 1 | 1 | 10 | 8.3% | 82.1% |
Ethics | 0 | 0 | 1 | 0.0% | 94.0% |
Interpretation | 3 | 5 | 29 | 8.1% | 70.5% |
Pathophysiology | 2 | 4 | 27 | 6.1% | 86.1% |
Pharmacology | 2 | 1 | 22 | 8.0% | 84.7% |
Prevention | 0 | 1 | 11 | 0.0% | 90.3% |
Prognosis | 1 | 0 | 6 | 14.3% | 84.6% |
Risk | 1 | 0 | 12 | 7.7% | 84.5% |
Tests | 2 | 3 | 16 | 9.5% | 75.0% |
Treatment | 2 | 10 | 59 | 2.8% | 82.1% |
Anatomy
Correct
0
Incorrect
3
Unanswered
3
Accuracy
0.0%
Average
81.1%
Biostatistics
Correct
1
Incorrect
0
Unanswered
4
Accuracy
20.0%
Average
91.3%
Diagnosis
Correct
3
Incorrect
13
Unanswered
57
Accuracy
4.1%
Average
80.0%
Epidemiology
Correct
1
Incorrect
1
Unanswered
10
Accuracy
8.3%
Average
82.1%
Ethics
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
94.0%
Interpretation
Correct
3
Incorrect
5
Unanswered
29
Accuracy
8.1%
Average
70.5%
Pathophysiology
Correct
2
Incorrect
4
Unanswered
27
Accuracy
6.1%
Average
86.1%
Pharmacology
Correct
2
Incorrect
1
Unanswered
22
Accuracy
8.0%
Average
84.7%
Prevention
Correct
0
Incorrect
1
Unanswered
11
Accuracy
0.0%
Average
90.3%
Prognosis
Correct
1
Incorrect
0
Unanswered
6
Accuracy
14.3%
Average
84.6%
Risk
Correct
1
Incorrect
0
Unanswered
12
Accuracy
7.7%
Average
84.5%
Tests
Correct
2
Incorrect
3
Unanswered
16
Accuracy
9.5%
Average
75.0%
Treatment
Correct
2
Incorrect
10
Unanswered
59
Accuracy
2.8%
Average
82.1%