MedicalBenchmark
NousResearch: Hermes 2 Pro - Llama-3 8B provider

Hermes 2 Pro - Llama-3 8B

281

#281 of 291 modelsMIR 2024

Net score

37.00 pts

Accuracy

36.0%

Correct / Incorrect

72 / 105

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

36.0%

avg: 80.5%

Net score

37.00 pts

avg: 150.85 pts

Correct

72

avg: 161

Incorrect

105

avg: 30

Total Cost

$0.03

avg: $3.32

Average response time

4.0s

avg: 16.4s

Output Tokens

86K

avg: 427K

Reasoning Tokens

0

avg: 310K

Average confidence

84.1%

avg: 95.4%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
90.5%
Anesthesiology and Resuscitation
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
87.1%
Cardiology
Correct
7
Incorrect
12
Unanswered
2
Accuracy
33.3%
Average
79.7%
Dermatology
Correct
4
Incorrect
8
Unanswered
2
Accuracy
28.6%
Average
80.2%
Endocrinology and Nutrition
Correct
9
Incorrect
9
Unanswered
1
Accuracy
47.4%
Average
84.2%
ENT
Correct
3
Incorrect
4
Unanswered
0
Accuracy
42.9%
Average
74.4%
Epidemiology
Correct
3
Incorrect
3
Unanswered
2
Accuracy
37.5%
Average
89.3%
Gastroenterology
Correct
5
Incorrect
14
Unanswered
3
Accuracy
22.7%
Average
70.5%
Genetics
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
86.5%
Geriatrics
Correct
4
Incorrect
6
Unanswered
0
Accuracy
40.0%
Average
86.9%
Gynecology and Obstetrics
Correct
5
Incorrect
5
Unanswered
4
Accuracy
35.7%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
3
Incorrect
10
Unanswered
0
Accuracy
23.1%
Average
81.5%
Immunology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.1%
Infectious Diseases
Correct
10
Incorrect
12
Unanswered
1
Accuracy
43.5%
Average
81.8%
Legal Medicine and Bioethics
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
91.7%
Medical Oncology
Correct
7
Incorrect
13
Unanswered
1
Accuracy
33.3%
Average
80.2%
Nephrology
Correct
2
Incorrect
8
Unanswered
3
Accuracy
15.4%
Average
80.8%
Neurology
Correct
11
Incorrect
8
Unanswered
3
Accuracy
50.0%
Average
83.7%
Ophthalmology
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
80.0%
Palliative Care
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
88.2%
Pediatrics
Correct
6
Incorrect
8
Unanswered
3
Accuracy
35.3%
Average
82.0%
Pharmacology
Correct
7
Incorrect
13
Unanswered
3
Accuracy
30.4%
Average
85.4%
Psychiatry
Correct
6
Incorrect
3
Unanswered
1
Accuracy
60.0%
Average
89.5%
Pulmonology
Correct
12
Incorrect
7
Unanswered
0
Accuracy
63.2%
Average
80.6%
Radiology-Emergency
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
64.9%
Rheumatology
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
81.4%
Statistics
Correct
1
Incorrect
1
Unanswered
1
Accuracy
33.3%
Average
91.1%
Traumatology
Correct
2
Incorrect
12
Unanswered
1
Accuracy
13.3%
Average
74.5%
Urology
Correct
0
Incorrect
5
Unanswered
1
Accuracy
0.0%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
1
Incorrect
5
Unanswered
0
Accuracy
16.7%
Average
79.8%
Biostatistics
Correct
3
Incorrect
1
Unanswered
1
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
25
Incorrect
39
Unanswered
9
Accuracy
34.2%
Average
79.2%
Epidemiology
Correct
3
Incorrect
7
Unanswered
2
Accuracy
25.0%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
11
Incorrect
20
Unanswered
6
Accuracy
29.7%
Average
69.6%
Pathophysiology
Correct
13
Incorrect
16
Unanswered
4
Accuracy
39.4%
Average
85.4%
Pharmacology
Correct
9
Incorrect
15
Unanswered
1
Accuracy
36.0%
Average
84.0%
Prevention
Correct
7
Incorrect
4
Unanswered
1
Accuracy
58.3%
Average
89.8%
Prognosis
Correct
3
Incorrect
3
Unanswered
1
Accuracy
42.9%
Average
83.9%
Risk
Correct
5
Incorrect
7
Unanswered
1
Accuracy
38.5%
Average
83.6%
Tests
Correct
6
Incorrect
12
Unanswered
3
Accuracy
28.6%
Average
73.9%
Treatment
Correct
29
Incorrect
36
Unanswered
6
Accuracy
40.8%
Average
81.3%
#AnswerCorrectStatus
1BB
2CD
3DB
4AC
5CC
6BB
7DD
8CC
9A
10AD
11DD
12BA
13C
14BA
15DB
16CA
17C
18A
19B
20CC
21CD
22CB
23BA
24DA
25DC
26CB
27AC
28A
29AB
30DC
31BD
32CA
33BC
34CB
35DD
36BD
37AA
38AA
39CC
40B
41DC
42BD
43A
44BD
45D
46BB
47CC
48CC
49DB
50BC
51DA
52CD
53C
54DB
55CC
56BD
57DA
58DA
59CA
60DA
61CA
62DD
63CD
64AAnnulled
65DD
66AC
67DB
68CAnnulled
69AA
70BB
71B
72BD
73DB
74CC
75BB
76DA
77DD
78CC
79B
80AA
81BC
82CC
83BB
84DC
85AA
86CA
87CB
88AD
89BB
90AA
91DD
92AA
93BC
94BB
95BD
96BB
97DB
98CB
99CA
100CB
101CA
102DD
103B
104CD
105DB
106C
107CC
108BB
109BD
110D
111AB
112BC
113Annulled
114DD
115D
116DA
117DD
118DD
119DA
120CC
121AA
122DB
123DD
124DD
125CB
126DD
127AA
128DB
129DD
130DC
131CC
132CD
133A
134BC
135BA
136DD
137DA
138CC
139DA
140CC
141DB
142CC
143DA
144D
145CC
146BC
147DC
148DA
149AC
150D
151AA
152A
153AC
154BB
155BD
156AC
157DC
158DD
159DD
160DB
161BB
162BB
163DB
164DB
165CA
166DC
167AA
168CB
169BC
170BA
171DD
172BB
173DA
174CB
175AA
176DC
177BC
178DB
179C
180DAnnulled
181BB
182DD
183AC
184AA
185CC
186BD
187A
188AC
189AD
190AD
191AB
192BB
193BC
194AC
195DC
196DB
197DA
198BB
199D
200BA
201BB
202DD
203BB
204DD
205AD
206DAnnulled
207AA
208CA
209B
210AD