MedicalBenchmark
Mistral: Mistral 7B Instruct v0.1 provider

Mistral 7B Instruct v0.1

273

#273 of 291 modelsMIR 2024

Net score

45.00 pts

Accuracy

37.5%

Correct / Incorrect

75 / 90

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

37.5%

avg: 80.5%

Net score

45.00 pts

avg: 150.85 pts

Correct

75

avg: 161

Incorrect

90

avg: 30

Total Cost

$0.03

avg: $3.32

Average response time

22.4s

avg: 16.4s

Output Tokens

72K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

84.0%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
90.5%
Anesthesiology and Resuscitation
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
87.1%
Cardiology
Correct
9
Incorrect
12
Unanswered
0
Accuracy
42.9%
Average
79.7%
Dermatology
Correct
4
Incorrect
9
Unanswered
1
Accuracy
28.6%
Average
80.2%
Endocrinology and Nutrition
Correct
7
Incorrect
9
Unanswered
3
Accuracy
36.8%
Average
84.2%
ENT
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
74.4%
Epidemiology
Correct
3
Incorrect
4
Unanswered
1
Accuracy
37.5%
Average
89.3%
Gastroenterology
Correct
7
Incorrect
12
Unanswered
3
Accuracy
31.8%
Average
70.5%
Genetics
Correct
1
Incorrect
4
Unanswered
2
Accuracy
14.3%
Average
86.5%
Geriatrics
Correct
4
Incorrect
5
Unanswered
1
Accuracy
40.0%
Average
86.9%
Gynecology and Obstetrics
Correct
5
Incorrect
2
Unanswered
7
Accuracy
35.7%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
1
Unanswered
1
Accuracy
0.0%
Average
73.2%
Hematology
Correct
5
Incorrect
4
Unanswered
4
Accuracy
38.5%
Average
81.5%
Immunology
Correct
5
Incorrect
1
Unanswered
2
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
8
Incorrect
13
Unanswered
2
Accuracy
34.8%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
13
Incorrect
8
Unanswered
0
Accuracy
61.9%
Average
80.2%
Nephrology
Correct
4
Incorrect
6
Unanswered
3
Accuracy
30.8%
Average
80.8%
Neurology
Correct
10
Incorrect
5
Unanswered
7
Accuracy
45.5%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
4
Incorrect
10
Unanswered
3
Accuracy
23.5%
Average
82.0%
Pharmacology
Correct
7
Incorrect
11
Unanswered
5
Accuracy
30.4%
Average
85.4%
Psychiatry
Correct
5
Incorrect
2
Unanswered
3
Accuracy
50.0%
Average
89.5%
Pulmonology
Correct
7
Incorrect
9
Unanswered
3
Accuracy
36.8%
Average
80.6%
Radiology-Emergency
Correct
7
Incorrect
6
Unanswered
1
Accuracy
50.0%
Average
64.9%
Rheumatology
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
81.4%
Statistics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
91.1%
Traumatology
Correct
8
Incorrect
4
Unanswered
3
Accuracy
53.3%
Average
74.5%
Urology
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
79.8%
Biostatistics
Correct
1
Incorrect
3
Unanswered
1
Accuracy
20.0%
Average
90.7%
Diagnosis
Correct
28
Incorrect
35
Unanswered
10
Accuracy
38.4%
Average
79.2%
Epidemiology
Correct
4
Incorrect
6
Unanswered
2
Accuracy
33.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
16
Incorrect
15
Unanswered
6
Accuracy
43.2%
Average
69.6%
Pathophysiology
Correct
16
Incorrect
12
Unanswered
5
Accuracy
48.5%
Average
85.4%
Pharmacology
Correct
6
Incorrect
12
Unanswered
7
Accuracy
24.0%
Average
84.0%
Prevention
Correct
6
Incorrect
4
Unanswered
2
Accuracy
50.0%
Average
89.8%
Prognosis
Correct
2
Incorrect
3
Unanswered
2
Accuracy
28.6%
Average
83.9%
Risk
Correct
3
Incorrect
9
Unanswered
1
Accuracy
23.1%
Average
83.6%
Tests
Correct
7
Incorrect
10
Unanswered
4
Accuracy
33.3%
Average
73.9%
Treatment
Correct
24
Incorrect
30
Unanswered
17
Accuracy
33.8%
Average
81.3%
#AnswerCorrectStatus
1AB
2AD
3BB
4BC
5CC
6DB
7DD
8CC
9BA
10DD
11DD
12BA
13DC
14BA
15DB
16A
17CC
18CA
19BB
20CC
21DD
22B
23AA
24DA
25AC
26BB
27AC
28AA
29BB
30DC
31AD
32AA
33C
34DB
35DD
36D
37AA
38AA
39CC
40DB
41DC
42D
43A
44BD
45AD
46CB
47CC
48CC
49DB
50C
51CA
52AD
53AC
54CB
55CC
56DD
57AA
58BA
59CA
60AA
61AA
62CD
63DD
64AAnnulled
65DD
66C
67B
68BAnnulled
69AA
70BB
71B
72D
73AB
74CC
75AB
76A
77AD
78AC
79B
80AA
81BC
82C
83BB
84CC
85AA
86AA
87BB
88CD
89CB
90AA
91AD
92AA
93AC
94B
95D
96B
97AB
98BB
99AA
100DB
101BA
102DD
103B
104CD
105BB
106BC
107CC
108AB
109AD
110AD
111AB
112CC
113BAnnulled
114BD
115AD
116AA
117DD
118DD
119DA
120CC
121AA
122AB
123D
124AD
125BB
126DD
127CA
128B
129BD
130DC
131C
132CD
133CA
134BC
135AA
136AD
137AA
138CC
139AA
140BC
141BB
142C
143CA
144AD
145C
146AC
147CC
148BA
149C
150CD
151AA
152A
153AC
154BB
155BD
156CC
157DC
158AD
159DD
160B
161BB
162AB
163BB
164B
165CA
166DC
167A
168BB
169BC
170CA
171AD
172BB
173BA
174B
175AA
176CC
177C
178DB
179CC
180BAnnulled
181CB
182DD
183AC
184A
185CC
186AD
187A
188BC
189BD
190AD
191BB
192B
193AC
194AC
195C
196BB
197AA
198B
199CD
200AA
201AB
202BD
203CB
204CD
205AD
206BAnnulled
207AA
208AA
209AB
210AD