MedicalBenchmark
Mistral: Mistral 7B Instruct provider

Mistral 7B Instruct

263

#263 of 291 modelsMIR 2024

Net score

61.33 pts

Accuracy

46.5%

Correct / Incorrect

93 / 95

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

46.5%

avg: 80.5%

Net score

61.33 pts

avg: 150.85 pts

Correct

93

avg: 161

Incorrect

95

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

2.1s

avg: 16.4s

Output Tokens

75K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

93.7%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
90.5%
Anesthesiology and Resuscitation
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
87.1%
Cardiology
Correct
10
Incorrect
11
Unanswered
0
Accuracy
47.6%
Average
79.7%
Dermatology
Correct
6
Incorrect
7
Unanswered
1
Accuracy
42.9%
Average
80.2%
Endocrinology and Nutrition
Correct
9
Incorrect
10
Unanswered
0
Accuracy
47.4%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.3%
Gastroenterology
Correct
9
Incorrect
13
Unanswered
0
Accuracy
40.9%
Average
70.5%
Genetics
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
86.5%
Geriatrics
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
6
Incorrect
7
Unanswered
1
Accuracy
42.9%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
7
Incorrect
5
Unanswered
1
Accuracy
53.8%
Average
81.5%
Immunology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.1%
Infectious Diseases
Correct
11
Incorrect
8
Unanswered
4
Accuracy
47.8%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
14
Incorrect
7
Unanswered
0
Accuracy
66.7%
Average
80.2%
Nephrology
Correct
7
Incorrect
6
Unanswered
0
Accuracy
53.8%
Average
80.8%
Neurology
Correct
5
Incorrect
12
Unanswered
5
Accuracy
22.7%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
7
Incorrect
9
Unanswered
1
Accuracy
41.2%
Average
82.0%
Pharmacology
Correct
13
Incorrect
8
Unanswered
2
Accuracy
56.5%
Average
85.4%
Psychiatry
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
89.5%
Pulmonology
Correct
8
Incorrect
9
Unanswered
2
Accuracy
42.1%
Average
80.6%
Radiology-Emergency
Correct
7
Incorrect
7
Unanswered
0
Accuracy
50.0%
Average
64.9%
Rheumatology
Correct
4
Incorrect
10
Unanswered
0
Accuracy
28.6%
Average
81.4%
Statistics
Correct
1
Incorrect
2
Unanswered
0
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
5
Incorrect
10
Unanswered
0
Accuracy
33.3%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
79.8%
Biostatistics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
34
Incorrect
35
Unanswered
4
Accuracy
46.6%
Average
79.2%
Epidemiology
Correct
7
Incorrect
4
Unanswered
1
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
16
Incorrect
20
Unanswered
1
Accuracy
43.2%
Average
69.6%
Pathophysiology
Correct
19
Incorrect
12
Unanswered
2
Accuracy
57.6%
Average
85.4%
Pharmacology
Correct
9
Incorrect
12
Unanswered
4
Accuracy
36.0%
Average
84.0%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
2
Incorrect
5
Unanswered
0
Accuracy
28.6%
Average
83.9%
Risk
Correct
3
Incorrect
8
Unanswered
2
Accuracy
23.1%
Average
83.6%
Tests
Correct
9
Incorrect
12
Unanswered
0
Accuracy
42.9%
Average
73.9%
Treatment
Correct
34
Incorrect
33
Unanswered
4
Accuracy
47.9%
Average
81.3%
#AnswerCorrectStatus
1AB
2AD
3BB
4BC
5CC
6DB
7DD
8CC
9CA
10DD
11DD
12AA
13DC
14BA
15DB
16AA
17CC
18BA
19BB
20AC
21CD
22DB
23AA
24AA
25AC
26BB
27AC
28AA
29AB
30AC
31AD
32AA
33AC
34BB
35DD
36BD
37AA
38AA
39DC
40DB
41CC
42AD
43AA
44AD
45AD
46BB
47CC
48CC
49B
50CC
51AA
52AD
53AC
54B
55AC
56DD
57DA
58CA
59CA
60A
61DA
62CD
63DD
64AAnnulled
65CD
66AC
67CB
68BAnnulled
69AA
70BB
71AB
72AD
73AB
74CC
75BB
76AA
77AD
78AC
79AB
80AA
81DC
82AC
83B
84CC
85AA
86DA
87BB
88DD
89CB
90AA
91AD
92A
93C
94AB
95BD
96BB
97B
98DB
99AA
100CB
101BA
102DD
103BB
104DD
105BB
106BC
107BC
108DB
109AD
110AD
111CB
112AC
113BAnnulled
114BD
115DD
116AA
117DD
118DD
119AA
120BC
121AA
122AB
123CD
124AD
125BB
126DD
127A
128AB
129BD
130DC
131CC
132BD
133CA
134BC
135BA
136AD
137AA
138CC
139AA
140BC
141BB
142DC
143AA
144DD
145BC
146BC
147CC
148AA
149AC
150DD
151AA
152DA
153BC
154BB
155D
156CC
157CC
158CD
159DD
160AB
161BB
162BB
163BB
164DB
165CA
166CC
167AA
168B
169AC
170AA
171DD
172BB
173CA
174AB
175AA
176CC
177DC
178DB
179CC
180BAnnulled
181CB
182DD
183CC
184DA
185CC
186DD
187AA
188CC
189BD
190DD
191BB
192BB
193C
194DC
195C
196BB
197AA
198BB
199CD
200BA
201AB
202BD
203BB
204DD
205AD
206BAnnulled
207DA
208AA
209CB
210DD