MedicalBenchmark
Mistral: Pixtral 12B provider

Pixtral 12B

246

#246 of 291 modelsMIR 2024

Net score

98.66 pts

Accuracy

61.0%

Correct / Incorrect

122 / 70

Total Cost

$0.02

Overall Performance

(vs. average)
Accuracy

61.0%

avg: 80.5%

Net score

98.66 pts

avg: 150.85 pts

Correct

122

avg: 161

Incorrect

70

avg: 30

Total Cost

$0.02

avg: $3.32

Average response time

4.1s

avg: 16.4s

Output Tokens

86K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

95.3%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
11
Incorrect
10
Unanswered
0
Accuracy
52.4%
Average
79.7%
Dermatology
Correct
11
Incorrect
3
Unanswered
0
Accuracy
78.6%
Average
80.2%
Endocrinology and Nutrition
Correct
13
Incorrect
6
Unanswered
0
Accuracy
68.4%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
7
Incorrect
1
Unanswered
0
Accuracy
87.5%
Average
89.3%
Gastroenterology
Correct
10
Incorrect
11
Unanswered
1
Accuracy
45.5%
Average
70.5%
Genetics
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
86.5%
Geriatrics
Correct
7
Incorrect
3
Unanswered
0
Accuracy
70.0%
Average
86.9%
Gynecology and Obstetrics
Correct
9
Incorrect
4
Unanswered
1
Accuracy
64.3%
Average
81.2%
Health Planning and Management
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
73.2%
Hematology
Correct
9
Incorrect
3
Unanswered
1
Accuracy
69.2%
Average
81.5%
Immunology
Correct
5
Incorrect
3
Unanswered
0
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
17
Incorrect
4
Unanswered
2
Accuracy
73.9%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
14
Incorrect
6
Unanswered
1
Accuracy
66.7%
Average
80.2%
Nephrology
Correct
4
Incorrect
9
Unanswered
0
Accuracy
30.8%
Average
80.8%
Neurology
Correct
13
Incorrect
7
Unanswered
2
Accuracy
59.1%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
2
Unanswered
0
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
6
Unanswered
1
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
18
Incorrect
3
Unanswered
2
Accuracy
78.3%
Average
85.4%
Psychiatry
Correct
6
Incorrect
2
Unanswered
2
Accuracy
60.0%
Average
89.5%
Pulmonology
Correct
10
Incorrect
7
Unanswered
2
Accuracy
52.6%
Average
80.6%
Radiology-Emergency
Correct
6
Incorrect
8
Unanswered
0
Accuracy
42.9%
Average
64.9%
Rheumatology
Correct
8
Incorrect
6
Unanswered
0
Accuracy
57.1%
Average
81.4%
Statistics
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.1%
Traumatology
Correct
8
Incorrect
7
Unanswered
0
Accuracy
53.3%
Average
74.5%
Urology
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
5
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.7%
Diagnosis
Correct
41
Incorrect
29
Unanswered
3
Accuracy
56.2%
Average
79.2%
Epidemiology
Correct
7
Incorrect
4
Unanswered
1
Accuracy
58.3%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
18
Incorrect
19
Unanswered
0
Accuracy
48.6%
Average
69.6%
Pathophysiology
Correct
20
Incorrect
10
Unanswered
3
Accuracy
60.6%
Average
85.4%
Pharmacology
Correct
19
Incorrect
4
Unanswered
2
Accuracy
76.0%
Average
84.0%
Prevention
Correct
11
Incorrect
0
Unanswered
1
Accuracy
91.7%
Average
89.8%
Prognosis
Correct
5
Incorrect
2
Unanswered
0
Accuracy
71.4%
Average
83.9%
Risk
Correct
9
Incorrect
3
Unanswered
1
Accuracy
69.2%
Average
83.6%
Tests
Correct
9
Incorrect
12
Unanswered
0
Accuracy
42.9%
Average
73.9%
Treatment
Correct
47
Incorrect
23
Unanswered
1
Accuracy
66.2%
Average
81.3%
#AnswerCorrectStatus
1AB
2CD
3DB
4CC
5CC
6BB
7CD
8CC
9BA
10DD
11DD
12BA
13DC
14DA
15CB
16CA
17CC
18AA
19BB
20CC
21CD
22DB
23AA
24BA
25AC
26B
27CC
28DA
29AB
30DC
31AD
32BA
33CC
34BB
35DD
36BD
37AA
38DA
39CC
40BB
41CC
42AD
43AA
44DD
45DD
46BB
47CC
48CC
49B
50CC
51DA
52DD
53C
54BB
55CC
56DD
57AA
58BA
59CA
60DA
61AA
62BD
63DD
64AAnnulled
65DD
66CC
67CB
68BAnnulled
69AA
70BB
71BB
72DD
73CB
74CC
75B
76DA
77AD
78CC
79DB
80AA
81CC
82DC
83BB
84CC
85AA
86AA
87BB
88CD
89B
90AA
91DD
92AA
93CC
94BB
95DD
96BB
97BB
98DB
99AA
100DB
101AA
102DD
103BB
104CD
105BB
106DC
107CC
108BB
109AD
110AD
111BB
112CC
113BAnnulled
114BD
115DD
116AA
117DD
118BD
119CA
120CC
121AA
122DB
123DD
124DD
125CB
126DD
127AA
128DB
129DD
130AC
131AC
132DD
133CA
134CC
135AA
136DD
137AA
138CC
139AA
140CC
141DB
142DC
143AA
144AD
145DC
146BC
147BC
148AA
149DC
150DD
151A
152A
153CC
154BB
155DD
156CC
157CC
158DD
159DD
160BB
161BB
162DB
163BB
164BB
165BA
166DC
167DA
168BB
169CC
170AA
171DD
172BB
173CA
174BB
175AA
176CC
177CC
178BB
179CC
180BAnnulled
181CB
182DD
183AC
184AA
185DC
186DD
187CA
188CC
189BD
190DD
191BB
192BB
193AC
194DC
195C
196BB
197CA
198BB
199DD
200AA
201BB
202DD
203CB
204DD
205D
206BAnnulled
207DA
208DA
209CB
210AD