MedicalBenchmark
NousResearch: Hermes 2 Pro - Llama-3 8B provider

Hermes 2 Pro - Llama-3 8B

298

#298 of 319 modelsMIR 2025

Net score

44.33 pts

Accuracy

39.0%

Correct / Incorrect

78 / 101

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

39.0%

avg: 77.9%

Net score

44.33 pts

avg: 143.96 pts

Correct

78

avg: 156

Incorrect

101

avg: 35

Total Cost

$0.03

avg: $3.36

Average response time

4.1s

avg: 19.0s

Output Tokens

88K

avg: 430K

Reasoning Tokens

0

avg: 306K

Average confidence

86.5%

avg: 95.2%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
87.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
82.3%
Cardiology
Correct
8
Incorrect
12
Unanswered
2
Accuracy
36.4%
Average
78.6%
Dermatology
Correct
4
Incorrect
7
Unanswered
1
Accuracy
33.3%
Average
69.4%
Endocrinology and Nutrition
Correct
8
Incorrect
7
Unanswered
1
Accuracy
50.0%
Average
83.5%
ENT
Correct
2
Incorrect
5
Unanswered
1
Accuracy
25.0%
Average
74.8%
Epidemiology
Correct
0
Incorrect
6
Unanswered
1
Accuracy
0.0%
Average
69.1%
Gastroenterology
Correct
11
Incorrect
9
Unanswered
1
Accuracy
52.4%
Average
74.1%
Genetics
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
69.5%
Geriatrics
Correct
4
Incorrect
6
Unanswered
1
Accuracy
36.4%
Average
77.5%
Gynecology and Obstetrics
Correct
7
Incorrect
10
Unanswered
2
Accuracy
36.8%
Average
86.7%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
82.6%
Hematology
Correct
4
Incorrect
6
Unanswered
1
Accuracy
36.4%
Average
82.7%
Immunology
Correct
5
Incorrect
3
Unanswered
1
Accuracy
55.6%
Average
83.3%
Infectious Diseases
Correct
12
Incorrect
13
Unanswered
2
Accuracy
44.4%
Average
74.9%
Legal Medicine and Bioethics
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
68.4%
Medical Oncology
Correct
14
Incorrect
8
Unanswered
3
Accuracy
56.0%
Average
87.2%
Nephrology
Correct
8
Incorrect
4
Unanswered
2
Accuracy
57.1%
Average
84.8%
Neurology
Correct
6
Incorrect
11
Unanswered
3
Accuracy
30.0%
Average
77.3%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
74.2%
Palliative Care
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
78.6%
Pediatrics
Correct
10
Incorrect
15
Unanswered
1
Accuracy
38.5%
Average
71.9%
Pharmacology
Correct
7
Incorrect
10
Unanswered
0
Accuracy
41.2%
Average
74.1%
Psychiatry
Correct
1
Incorrect
5
Unanswered
2
Accuracy
12.5%
Average
83.0%
Pulmonology
Correct
7
Incorrect
3
Unanswered
4
Accuracy
50.0%
Average
80.4%
Radiology-Emergency
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
69.4%
Rheumatology
Correct
5
Incorrect
7
Unanswered
3
Accuracy
33.3%
Average
76.6%
Statistics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
76.6%
Traumatology
Correct
6
Incorrect
9
Unanswered
3
Accuracy
33.3%
Average
79.3%
Urology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
80.7%

Question Type Breakdown

Anatomy
Correct
0
Incorrect
7
Unanswered
0
Accuracy
0.0%
Average
78.6%
Biostatistics
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
79.8%
Diagnosis
Correct
36
Incorrect
40
Unanswered
12
Accuracy
40.9%
Average
79.9%
Epidemiology
Correct
0
Incorrect
4
Unanswered
1
Accuracy
0.0%
Average
76.7%
Ethics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
74.1%
Interpretation
Correct
12
Incorrect
25
Unanswered
5
Accuracy
28.6%
Average
70.7%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
64.6%
Pathophysiology
Correct
11
Incorrect
12
Unanswered
4
Accuracy
40.7%
Average
76.1%
Pharmacology
Correct
5
Incorrect
8
Unanswered
0
Accuracy
38.5%
Average
83.3%
Prevention
Correct
6
Incorrect
6
Unanswered
0
Accuracy
50.0%
Average
75.6%
Prognosis
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
80.8%
Risk
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
85.2%
Tests
Correct
11
Incorrect
11
Unanswered
5
Accuracy
40.7%
Average
77.9%
Treatment
Correct
33
Incorrect
41
Unanswered
7
Accuracy
40.7%
Average
77.3%
#AnswerCorrectStatus
1B
2DA
3C
4AB
5BA
6DC
7CC
8AA
9CA
10AD
11CD
12D
13BB
14DD
15DAnnulled
16CB
17AB
18DA
19CC
20DA
21DB
22AD
23C
24DD
25CC
26BAnnulled
27BC
28DAnnulled
29AD
30DB
31DD
32DA
33DD
34CD
35BB
36BD
37CC
38BC
39DD
40DA
41AD
42BC
43AB
44DD
45CD
46CA
47DA
48DA
49DD
50DB
51DC
52BB
53DD
54DB
55AA
56DAnnulled
57DC
58CB
59DD
60BA
61A
62DD
63BB
64DD
65AA
66AA
67AB
68B
69AB
70DA
71DD
72CA
73D
74DC
75AA
76CB
77B
78BB
79C
80BC
81CC
82CD
83AB
84DD
85DC
86AC
87CA
88AD
89CB
90DA
91DB
92C
93B
94BC
95AA
96CC
97D
98DC
99DA
100CC
101DB
102CD
103DA
104DC
105CA
106AC
107BB
108DD
109BB
110CC
111AA
112AC
113B
114D
115DD
116BC
117AA
118DD
119AC
120BB
121CD
122CC
123CC
124CC
125DD
126D
127BB
128DD
129BA
130DD
131D
132AA
133BB
134CC
135BB
136CC
137AA
138DD
139DD
140CB
141AA
142BA
143BB
144BB
145BD
146CC
147CB
148DA
149AA
150DD
151CA
152BA
153AB
154BB
155AB
156CC
157AA
158AC
159AC
160AA
161A
162BAnnulled
163DD
164BC
165DA
166B
167CC
168DD
169DB
170CB
171CC
172DA
173DA
174BB
175AB
176CC
177BC
178DA
179CD
180AA
181DB
182BC
183DB
184BB
185B
186DAnnulled
187AC
188CD
189BD
190A
191CB
192CA
193CC
194AA
195DA
196CA
197BB
198BC
199BD
200CC
201DB
202BA
203DD
204DC
205B
206CD
207A
208BC
209CC
210B