MedicalBenchmark
Mistral: Mixtral 8x7B Instruct provider

Mixtral 8x7B Instruct

238

#238 of 291 modelsMIR 2024

Net score

111.66 pts

Accuracy

65.5%

Correct / Incorrect

131 / 58

Total Cost

$0.11

Overall Performance

(vs. average)
Accuracy

65.5%

avg: 80.5%

Net score

111.66 pts

avg: 150.85 pts

Correct

131

avg: 161

Incorrect

58

avg: 30

Total Cost

$0.11

avg: $3.32

Average response time

9.9s

avg: 16.4s

Output Tokens

84K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

94.1%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
87.1%
Cardiology
Correct
14
Incorrect
7
Unanswered
0
Accuracy
66.7%
Average
79.7%
Dermatology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
80.2%
Endocrinology and Nutrition
Correct
12
Incorrect
6
Unanswered
1
Accuracy
63.2%
Average
84.2%
ENT
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
74.4%
Epidemiology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
14
Incorrect
8
Unanswered
0
Accuracy
63.6%
Average
70.5%
Genetics
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
86.5%
Geriatrics
Correct
6
Incorrect
2
Unanswered
2
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
8
Incorrect
5
Unanswered
1
Accuracy
57.1%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
6
Incorrect
6
Unanswered
1
Accuracy
46.2%
Average
81.5%
Immunology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
15
Incorrect
6
Unanswered
2
Accuracy
65.2%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
15
Incorrect
5
Unanswered
1
Accuracy
71.4%
Average
80.2%
Nephrology
Correct
6
Incorrect
7
Unanswered
0
Accuracy
46.2%
Average
80.8%
Neurology
Correct
15
Incorrect
4
Unanswered
3
Accuracy
68.2%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
2
Unanswered
1
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
6
Unanswered
1
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
17
Incorrect
5
Unanswered
1
Accuracy
73.9%
Average
85.4%
Psychiatry
Correct
9
Incorrect
1
Unanswered
0
Accuracy
90.0%
Average
89.5%
Pulmonology
Correct
12
Incorrect
5
Unanswered
2
Accuracy
63.2%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
9
Incorrect
3
Unanswered
2
Accuracy
64.3%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
9
Incorrect
3
Unanswered
3
Accuracy
60.0%
Average
74.5%
Urology
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
79.8%
Biostatistics
Correct
4
Incorrect
1
Unanswered
0
Accuracy
80.0%
Average
90.7%
Diagnosis
Correct
51
Incorrect
18
Unanswered
4
Accuracy
69.9%
Average
79.2%
Epidemiology
Correct
7
Incorrect
4
Unanswered
1
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
20
Incorrect
16
Unanswered
1
Accuracy
54.1%
Average
69.6%
Pathophysiology
Correct
21
Incorrect
8
Unanswered
4
Accuracy
63.6%
Average
85.4%
Pharmacology
Correct
16
Incorrect
6
Unanswered
3
Accuracy
64.0%
Average
84.0%
Prevention
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
83.9%
Risk
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
83.6%
Tests
Correct
11
Incorrect
7
Unanswered
3
Accuracy
52.4%
Average
73.9%
Treatment
Correct
47
Incorrect
21
Unanswered
3
Accuracy
66.2%
Average
81.3%
#AnswerCorrectStatus
1AB
2CD
3DB
4BC
5CC
6BB
7DD
8CC
9CA
10DD
11DD
12AA
13CC
14BA
15CB
16CA
17CC
18AA
19CB
20CC
21DD
22CB
23AA
24DA
25CC
26BB
27AC
28DA
29BB
30BC
31AD
32BA
33CC
34BB
35DD
36BD
37AA
38A
39CC
40BB
41CC
42DD
43DA
44DD
45DD
46BB
47CC
48CC
49BB
50CC
51CA
52DD
53CC
54BB
55CC
56DD
57AA
58BA
59A
60CA
61AA
62DD
63DD
64DAnnulled
65DD
66AC
67DB
68BAnnulled
69AA
70BB
71BB
72AD
73CB
74CC
75BB
76AA
77DD
78CC
79DB
80AA
81CC
82DC
83BB
84CC
85AA
86A
87BB
88DD
89BB
90AA
91DD
92AA
93AC
94BB
95AD
96BB
97B
98B
99A
100BB
101AA
102DD
103BB
104DD
105DB
106BC
107CC
108DB
109DD
110CD
111CB
112CC
113DAnnulled
114DD
115DD
116DA
117DD
118DD
119CA
120CC
121AA
122BB
123DD
124DD
125CB
126DD
127AA
128DB
129DD
130DC
131CC
132DD
133CA
134CC
135AA
136DD
137AA
138CC
139AA
140CC
141BB
142AC
143CA
144DD
145DC
146BC
147CC
148AA
149AC
150DD
151A
152AA
153AC
154BB
155D
156AC
157CC
158DD
159DD
160B
161BB
162BB
163BB
164DB
165AA
166CC
167CA
168BB
169CC
170CA
171DD
172BB
173CA
174BB
175AA
176CC
177CC
178BB
179C
180AAnnulled
181CB
182DD
183CC
184AA
185CC
186DD
187AA
188BC
189AD
190DD
191BB
192DB
193CC
194DC
195C
196BB
197AA
198BB
199CD
200AA
201BB
202DD
203CB
204DD
205DD
206BAnnulled
207BA
208AA
209AB
210AD