MedicalBenchmark
Mistral: Mistral 7B Instruct v0.2 provider

Mistral 7B Instruct v0.2

268

#268 of 291 modelsMIR 2024

Net score

54.33 pts

Accuracy

43.5%

Correct / Incorrect

87 / 98

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

43.5%

avg: 80.5%

Net score

54.33 pts

avg: 150.85 pts

Correct

87

avg: 161

Incorrect

98

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

2.3s

avg: 16.4s

Output Tokens

72K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

92.1%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
90.5%
Anesthesiology and Resuscitation
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
87.1%
Cardiology
Correct
10
Incorrect
10
Unanswered
1
Accuracy
47.6%
Average
79.7%
Dermatology
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
80.2%
Endocrinology and Nutrition
Correct
8
Incorrect
10
Unanswered
1
Accuracy
42.1%
Average
84.2%
ENT
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
74.4%
Epidemiology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.3%
Gastroenterology
Correct
9
Incorrect
9
Unanswered
4
Accuracy
40.9%
Average
70.5%
Genetics
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
5
Incorrect
3
Unanswered
2
Accuracy
50.0%
Average
86.9%
Gynecology and Obstetrics
Correct
6
Incorrect
8
Unanswered
0
Accuracy
42.9%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
0
Unanswered
1
Accuracy
50.0%
Average
73.2%
Hematology
Correct
5
Incorrect
8
Unanswered
0
Accuracy
38.5%
Average
81.5%
Immunology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.1%
Infectious Diseases
Correct
8
Incorrect
13
Unanswered
2
Accuracy
34.8%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
13
Incorrect
6
Unanswered
2
Accuracy
61.9%
Average
80.2%
Nephrology
Correct
3
Incorrect
9
Unanswered
1
Accuracy
23.1%
Average
80.8%
Neurology
Correct
8
Incorrect
13
Unanswered
1
Accuracy
36.4%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
4
Incorrect
13
Unanswered
0
Accuracy
23.5%
Average
82.0%
Pharmacology
Correct
13
Incorrect
7
Unanswered
3
Accuracy
56.5%
Average
85.4%
Psychiatry
Correct
5
Incorrect
4
Unanswered
1
Accuracy
50.0%
Average
89.5%
Pulmonology
Correct
8
Incorrect
10
Unanswered
1
Accuracy
42.1%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
4
Incorrect
9
Unanswered
1
Accuracy
28.6%
Average
81.4%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
91.1%
Traumatology
Correct
6
Incorrect
8
Unanswered
1
Accuracy
40.0%
Average
74.5%
Urology
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
28
Incorrect
40
Unanswered
5
Accuracy
38.4%
Average
79.2%
Epidemiology
Correct
6
Incorrect
4
Unanswered
2
Accuracy
50.0%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
16
Incorrect
18
Unanswered
3
Accuracy
43.2%
Average
69.6%
Pathophysiology
Correct
17
Incorrect
15
Unanswered
1
Accuracy
51.5%
Average
85.4%
Pharmacology
Correct
10
Incorrect
11
Unanswered
4
Accuracy
40.0%
Average
84.0%
Prevention
Correct
7
Incorrect
5
Unanswered
0
Accuracy
58.3%
Average
89.8%
Prognosis
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
83.9%
Risk
Correct
4
Incorrect
7
Unanswered
2
Accuracy
30.8%
Average
83.6%
Tests
Correct
8
Incorrect
11
Unanswered
2
Accuracy
38.1%
Average
73.9%
Treatment
Correct
34
Incorrect
33
Unanswered
4
Accuracy
47.9%
Average
81.3%
#AnswerCorrectStatus
1AB
2AD
3BB
4BC
5CC
6AB
7DD
8CC
9CA
10DD
11DD
12BA
13AC
14AA
15AB
16CA
17CC
18CA
19BB
20AC
21CD
22BB
23AA
24A
25AC
26BB
27AC
28BA
29AB
30DC
31AD
32AA
33AC
34CB
35DD
36DD
37AA
38AA
39DC
40AB
41CC
42AD
43DA
44DD
45AD
46BB
47CC
48CC
49B
50BC
51AA
52AD
53AC
54B
55CC
56DD
57AA
58BA
59CA
60AA
61AA
62CD
63DD
64BAnnulled
65BD
66DC
67CB
68BAnnulled
69AA
70BB
71AB
72DD
73DB
74BC
75AB
76BA
77AD
78AC
79CB
80AA
81BC
82AC
83B
84CC
85AA
86A
87AB
88CD
89BB
90AA
91AD
92AA
93BC
94AB
95AD
96BB
97AB
98CB
99AA
100CB
101BA
102DD
103BB
104CD
105BB
106BC
107BC
108DB
109AD
110AD
111AB
112DC
113BAnnulled
114BD
115BD
116AA
117DD
118DD
119AA
120C
121AA
122AB
123CD
124AD
125BB
126DD
127CA
128B
129DD
130DC
131CC
132D
133A
134C
135A
136AD
137AA
138CC
139AA
140BC
141B
142DC
143CA
144AD
145AC
146BC
147CC
148AA
149AC
150DD
151AA
152DA
153AC
154BB
155BD
156CC
157DC
158AD
159DD
160B
161BB
162BB
163BB
164DB
165A
166DC
167AA
168B
169AC
170AA
171DD
172BB
173CA
174AB
175AA
176CC
177DC
178BB
179CC
180AAnnulled
181CB
182DD
183AC
184AA
185CC
186AD
187AA
188CC
189AD
190DD
191BB
192BB
193DC
194DC
195AC
196BB
197CA
198BB
199DD
200BA
201AB
202DD
203BB
204DD
205AD
206Annulled
207DA
208AA
209BB
210AD