MedicalBenchmark
Mancer: Weaver (alpha) provider

Weaver (alpha)

286

#286 of 291 modelsMIR 2024

Net score

23.66 pts

Accuracy

28.5%

Correct / Incorrect

57 / 100

Total Cost

$0.21

Overall Performance

(vs. average)
Accuracy

28.5%

avg: 80.5%

Net score

23.66 pts

avg: 150.85 pts

Correct

57

avg: 161

Incorrect

100

avg: 30

Total Cost

$0.21

avg: $3.32

Average response time

12.5s

avg: 16.4s

Output Tokens

111K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

79.0%

avg: 95.4%

Subject Breakdown

Allergology
Correct
1
Incorrect
1
Unanswered
1
Accuracy
33.3%
Average
90.5%
Anesthesiology and Resuscitation
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
87.1%
Cardiology
Correct
4
Incorrect
14
Unanswered
3
Accuracy
19.0%
Average
79.7%
Dermatology
Correct
4
Incorrect
7
Unanswered
3
Accuracy
28.6%
Average
80.2%
Endocrinology and Nutrition
Correct
8
Incorrect
8
Unanswered
3
Accuracy
42.1%
Average
84.2%
ENT
Correct
3
Incorrect
2
Unanswered
2
Accuracy
42.9%
Average
74.4%
Epidemiology
Correct
1
Incorrect
4
Unanswered
3
Accuracy
12.5%
Average
89.3%
Gastroenterology
Correct
5
Incorrect
12
Unanswered
5
Accuracy
22.7%
Average
70.5%
Genetics
Correct
0
Incorrect
3
Unanswered
4
Accuracy
0.0%
Average
86.5%
Geriatrics
Correct
6
Incorrect
2
Unanswered
2
Accuracy
60.0%
Average
86.9%
Gynecology and Obstetrics
Correct
6
Incorrect
7
Unanswered
1
Accuracy
42.9%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
1
Unanswered
1
Accuracy
0.0%
Average
73.2%
Hematology
Correct
3
Incorrect
8
Unanswered
2
Accuracy
23.1%
Average
81.5%
Immunology
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
5
Incorrect
10
Unanswered
8
Accuracy
21.7%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
9
Incorrect
9
Unanswered
3
Accuracy
42.9%
Average
80.2%
Nephrology
Correct
4
Incorrect
8
Unanswered
1
Accuracy
30.8%
Average
80.8%
Neurology
Correct
6
Incorrect
10
Unanswered
6
Accuracy
27.3%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
2
Unanswered
2
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
0
Unanswered
1
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
2
Incorrect
10
Unanswered
5
Accuracy
11.8%
Average
82.0%
Pharmacology
Correct
8
Incorrect
8
Unanswered
7
Accuracy
34.8%
Average
85.4%
Psychiatry
Correct
3
Incorrect
3
Unanswered
4
Accuracy
30.0%
Average
89.5%
Pulmonology
Correct
6
Incorrect
10
Unanswered
3
Accuracy
31.6%
Average
80.6%
Radiology-Emergency
Correct
4
Incorrect
6
Unanswered
4
Accuracy
28.6%
Average
64.9%
Rheumatology
Correct
3
Incorrect
9
Unanswered
2
Accuracy
21.4%
Average
81.4%
Statistics
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
91.1%
Traumatology
Correct
3
Incorrect
10
Unanswered
2
Accuracy
20.0%
Average
74.5%
Urology
Correct
1
Incorrect
2
Unanswered
3
Accuracy
16.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
79.8%
Biostatistics
Correct
0
Incorrect
3
Unanswered
2
Accuracy
0.0%
Average
90.7%
Diagnosis
Correct
16
Incorrect
37
Unanswered
20
Accuracy
21.9%
Average
79.2%
Epidemiology
Correct
2
Incorrect
6
Unanswered
4
Accuracy
16.7%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
12
Incorrect
18
Unanswered
7
Accuracy
32.4%
Average
69.6%
Pathophysiology
Correct
12
Incorrect
16
Unanswered
5
Accuracy
36.4%
Average
85.4%
Pharmacology
Correct
7
Incorrect
9
Unanswered
9
Accuracy
28.0%
Average
84.0%
Prevention
Correct
4
Incorrect
6
Unanswered
2
Accuracy
33.3%
Average
89.8%
Prognosis
Correct
3
Incorrect
2
Unanswered
2
Accuracy
42.9%
Average
83.9%
Risk
Correct
5
Incorrect
6
Unanswered
2
Accuracy
38.5%
Average
83.6%
Tests
Correct
6
Incorrect
12
Unanswered
3
Accuracy
28.6%
Average
73.9%
Treatment
Correct
23
Incorrect
33
Unanswered
15
Accuracy
32.4%
Average
81.3%
#AnswerCorrectStatus
1B
2CD
3CB
4AC
5CC
6B
7D
8CC
9A
10D
11D
12DA
13AC
14DA
15DB
16BA
17CC
18A
19BB
20CC
21AD
22DB
23AA
24DA
25AC
26AB
27CC
28BA
29AB
30BC
31CD
32DA
33C
34CB
35DD
36BD
37DA
38AA
39CC
40BB
41DC
42AD
43A
44AD
45D
46AB
47C
48C
49AB
50C
51DA
52CD
53BC
54CB
55CC
56AD
57BA
58A
59CA
60CA
61AA
62DD
63D
64BAnnulled
65DD
66DC
67BB
68AAnnulled
69CA
70BB
71B
72CD
73AB
74AC
75AB
76DA
77D
78BC
79CB
80DA
81AC
82AC
83BB
84AC
85A
86AA
87BB
88D
89B
90AA
91AD
92AA
93C
94DB
95AD
96BB
97AB
98DB
99A
100AB
101AA
102AD
103DB
104CD
105BB
106C
107C
108BB
109DD
110AD
111AB
112BC
113DAnnulled
114BD
115DD
116AA
117D
118AD
119CA
120AC
121AA
122CB
123BD
124D
125BB
126DD
127DA
128DB
129D
130CC
131BC
132D
133AA
134C
135BA
136AD
137A
138AC
139AA
140DC
141BB
142AC
143BA
144DD
145AC
146AC
147C
148A
149AC
150DD
151A
152DA
153AC
154AB
155CD
156CC
157DC
158DD
159DD
160BB
161BB
162BB
163DB
164DB
165AA
166AC
167AA
168BB
169C
170CA
171D
172BB
173A
174AB
175AA
176DC
177C
178B
179AC
180Annulled
181AB
182DD
183C
184AA
185CC
186CD
187CA
188C
189BD
190DD
191AB
192DB
193C
194AC
195C
196BB
197CA
198B
199CD
200BA
201BB
202DD
203BB
204CD
205D
206AAnnulled
207BA
208AA
209DB
210BD