MedicalBenchmark
Nous: Hermes 3 70B Instruct provider

Hermes 3 70B Instruct

265

#265 of 291 modelsMIR 2024

Net score

59.33 pts

Accuracy

39.0%

Correct / Incorrect

78 / 56

Total Cost

$0.53

Overall Performance

(vs. average)
Accuracy

39.0%

avg: 80.5%

Net score

59.33 pts

avg: 150.85 pts

Correct

78

avg: 161

Incorrect

56

avg: 30

Total Cost

$0.53

avg: $3.32

Average response time

177.4s

avg: 16.4s

Output Tokens

1.7M

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

64.8%

avg: 95.4%

Subject Breakdown

Allergology
Correct
0
Incorrect
2
Unanswered
1
Accuracy
0.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
1
Incorrect
2
Unanswered
1
Accuracy
25.0%
Average
87.1%
Cardiology
Correct
6
Incorrect
8
Unanswered
7
Accuracy
28.6%
Average
79.7%
Dermatology
Correct
3
Incorrect
2
Unanswered
9
Accuracy
21.4%
Average
80.2%
Endocrinology and Nutrition
Correct
10
Incorrect
5
Unanswered
4
Accuracy
52.6%
Average
84.2%
ENT
Correct
0
Incorrect
3
Unanswered
4
Accuracy
0.0%
Average
74.4%
Epidemiology
Correct
2
Incorrect
1
Unanswered
5
Accuracy
25.0%
Average
89.3%
Gastroenterology
Correct
8
Incorrect
8
Unanswered
6
Accuracy
36.4%
Average
70.5%
Genetics
Correct
7
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
86.5%
Geriatrics
Correct
4
Incorrect
2
Unanswered
4
Accuracy
40.0%
Average
86.9%
Gynecology and Obstetrics
Correct
8
Incorrect
3
Unanswered
3
Accuracy
57.1%
Average
81.2%
Health Planning and Management
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
73.2%
Hematology
Correct
5
Incorrect
3
Unanswered
5
Accuracy
38.5%
Average
81.5%
Immunology
Correct
3
Incorrect
4
Unanswered
1
Accuracy
37.5%
Average
89.1%
Infectious Diseases
Correct
10
Incorrect
7
Unanswered
6
Accuracy
43.5%
Average
81.8%
Legal Medicine and Bioethics
Correct
0
Incorrect
0
Unanswered
2
Accuracy
0.0%
Average
91.7%
Medical Oncology
Correct
9
Incorrect
5
Unanswered
7
Accuracy
42.9%
Average
80.2%
Nephrology
Correct
5
Incorrect
4
Unanswered
4
Accuracy
38.5%
Average
80.8%
Neurology
Correct
9
Incorrect
5
Unanswered
8
Accuracy
40.9%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
1
Unanswered
3
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
2
Incorrect
0
Unanswered
2
Accuracy
50.0%
Average
88.2%
Pediatrics
Correct
8
Incorrect
5
Unanswered
4
Accuracy
47.1%
Average
82.0%
Pharmacology
Correct
11
Incorrect
7
Unanswered
5
Accuracy
47.8%
Average
85.4%
Psychiatry
Correct
7
Incorrect
1
Unanswered
2
Accuracy
70.0%
Average
89.5%
Pulmonology
Correct
5
Incorrect
4
Unanswered
10
Accuracy
26.3%
Average
80.6%
Radiology-Emergency
Correct
5
Incorrect
4
Unanswered
5
Accuracy
35.7%
Average
64.9%
Rheumatology
Correct
7
Incorrect
3
Unanswered
4
Accuracy
50.0%
Average
81.4%
Statistics
Correct
1
Incorrect
0
Unanswered
2
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
6
Incorrect
5
Unanswered
4
Accuracy
40.0%
Average
74.5%
Urology
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
1
Unanswered
2
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
1
Incorrect
1
Unanswered
3
Accuracy
20.0%
Average
90.7%
Diagnosis
Correct
33
Incorrect
16
Unanswered
24
Accuracy
45.2%
Average
79.2%
Epidemiology
Correct
3
Incorrect
2
Unanswered
7
Accuracy
25.0%
Average
81.2%
Ethics
Correct
0
Incorrect
0
Unanswered
1
Accuracy
0.0%
Average
94.5%
Interpretation
Correct
13
Incorrect
12
Unanswered
12
Accuracy
35.1%
Average
69.6%
Pathophysiology
Correct
20
Incorrect
7
Unanswered
6
Accuracy
60.6%
Average
85.4%
Pharmacology
Correct
10
Incorrect
8
Unanswered
7
Accuracy
40.0%
Average
84.0%
Prevention
Correct
4
Incorrect
3
Unanswered
5
Accuracy
33.3%
Average
89.8%
Prognosis
Correct
4
Incorrect
0
Unanswered
3
Accuracy
57.1%
Average
83.9%
Risk
Correct
5
Incorrect
3
Unanswered
5
Accuracy
38.5%
Average
83.6%
Tests
Correct
8
Incorrect
6
Unanswered
7
Accuracy
38.1%
Average
73.9%
Treatment
Correct
20
Incorrect
27
Unanswered
24
Accuracy
28.2%
Average
81.3%
#AnswerCorrectStatus
1BB
2DD
3B
4C
5DC
6DB
7D
8C
9AA
10D
11AD
12A
13DC
14BA
15B
16AA
17CC
18A
19BB
20DC
21DD
22BB
23A
24CA
25C
26BB
27AC
28A
29BB
30C
31DD
32BA
33CC
34BB
35DD
36DD
37AA
38BA
39C
40B
41DC
42CD
43A
44D
45D
46BB
47AC
48AC
49B
50CC
51A
52DD
53CC
54AB
55AC
56D
57AA
58A
59A
60AA
61CA
62D
63D
64AAnnulled
65AD
66CC
67CB
68AAnnulled
69AA
70BB
71B
72D
73AB
74CC
75BB
76AA
77DD
78CC
79B
80AA
81C
82C
83BB
84CC
85AA
86A
87BB
88DD
89BB
90A
91AD
92A
93AC
94BB
95DD
96B
97AB
98DB
99A
100B
101DA
102D
103AB
104D
105CB
106CC
107CC
108B
109DD
110AD
111B
112C
113BAnnulled
114BD
115AD
116AA
117DD
118D
119CA
120C
121AA
122AB
123D
124AD
125B
126D
127A
128DB
129DD
130AC
131AC
132DD
133BA
134C
135AA
136BD
137DA
138CC
139A
140AC
141BB
142C
143DA
144BD
145C
146C
147AC
148CA
149C
150DD
151AA
152AA
153AC
154BB
155DD
156DC
157C
158DD
159D
160AB
161BB
162AB
163BB
164DB
165A
166CC
167AA
168BB
169CC
170CA
171D
172BB
173DA
174BB
175AA
176C
177AC
178BB
179CC
180AAnnulled
181B
182D
183CC
184AA
185C
186D
187AA
188CC
189CD
190AD
191B
192B
193CC
194DC
195CC
196BB
197AA
198CB
199D
200A
201BB
202DD
203B
204DD
205D
206CAnnulled
207AA
208A
209DB
210DD