MedicalBenchmark
NousResearch: Hermes 2 Pro - Llama-3 8B provider

Hermes 2 Pro - Llama-3 8B

270

#270 of 290 modelsMIR 2025

Net score

42.66 pts

Accuracy

38.5%

Correct / Incorrect

77 / 103

Total Cost

$0.03

Overall Performance

(vs. average)
Accuracy

38.5%

avg: 75.9%

Net score

42.66 pts

avg: 138.99 pts

Correct

77

avg: 152

Incorrect

103

avg: 38

Total Cost

$0.03

avg: $3.59

Average response time

4.1s

avg: 18.1s

Output Tokens

88K

avg: 443K

Reasoning Tokens

0

avg: 320K

Average confidence

86.5%

avg: 94.7%

Subject Breakdown

Allergology
Correct
2
Incorrect
1
Unanswered
1
Accuracy
50.0%
Average
86.9%
Anesthesiology and Resuscitation
Correct
1
Incorrect
4
Unanswered
1
Accuracy
16.7%
Average
81.3%
Cardiology
Correct
8
Incorrect
12
Unanswered
2
Accuracy
36.4%
Average
77.4%
Dermatology
Correct
4
Incorrect
8
Unanswered
1
Accuracy
30.8%
Average
62.8%
Endocrinology and Nutrition
Correct
8
Incorrect
7
Unanswered
1
Accuracy
50.0%
Average
82.5%
ENT
Correct
2
Incorrect
5
Unanswered
1
Accuracy
25.0%
Average
73.8%
Epidemiology
Correct
0
Incorrect
6
Unanswered
1
Accuracy
0.0%
Average
67.1%
Gastroenterology
Correct
11
Incorrect
9
Unanswered
1
Accuracy
52.4%
Average
72.9%
Genetics
Correct
2
Incorrect
3
Unanswered
1
Accuracy
33.3%
Average
68.2%
Geriatrics
Correct
3
Incorrect
7
Unanswered
1
Accuracy
27.3%
Average
71.2%
Gynecology and Obstetrics
Correct
7
Incorrect
10
Unanswered
2
Accuracy
36.8%
Average
85.9%
Health Planning and Management
Correct
0
Incorrect
2
Unanswered
0
Accuracy
0.0%
Average
81.6%
Hematology
Correct
4
Incorrect
6
Unanswered
1
Accuracy
36.4%
Average
81.8%
Immunology
Correct
5
Incorrect
3
Unanswered
1
Accuracy
55.6%
Average
82.5%
Infectious Diseases
Correct
12
Incorrect
14
Unanswered
2
Accuracy
42.9%
Average
71.1%
Legal Medicine and Bioethics
Correct
1
Incorrect
4
Unanswered
0
Accuracy
20.0%
Average
67.2%
Medical Oncology
Correct
14
Incorrect
8
Unanswered
3
Accuracy
56.0%
Average
86.3%
Nephrology
Correct
8
Incorrect
5
Unanswered
2
Accuracy
53.3%
Average
78.2%
Neurology
Correct
6
Incorrect
11
Unanswered
3
Accuracy
30.0%
Average
76.2%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
72.6%
Palliative Care
Correct
0
Incorrect
3
Unanswered
1
Accuracy
0.0%
Average
77.2%
Pediatrics
Correct
10
Incorrect
14
Unanswered
1
Accuracy
40.0%
Average
72.7%
Pharmacology
Correct
7
Incorrect
10
Unanswered
0
Accuracy
41.2%
Average
73.1%
Psychiatry
Correct
1
Incorrect
5
Unanswered
2
Accuracy
12.5%
Average
82.0%
Pulmonology
Correct
7
Incorrect
4
Unanswered
3
Accuracy
50.0%
Average
73.0%
Radiology-Emergency
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
67.9%
Rheumatology
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
74.6%
Statistics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
74.9%
Traumatology
Correct
6
Incorrect
9
Unanswered
3
Accuracy
33.3%
Average
78.2%
Urology
Correct
6
Incorrect
1
Unanswered
0
Accuracy
85.7%
Average
79.5%

Question Type Breakdown

Anatomy
Correct
0
Incorrect
7
Unanswered
0
Accuracy
0.0%
Average
77.1%
Biostatistics
Correct
0
Incorrect
4
Unanswered
0
Accuracy
0.0%
Average
78.4%
Diagnosis
Correct
36
Incorrect
41
Unanswered
12
Accuracy
40.4%
Average
77.9%
Epidemiology
Correct
0
Incorrect
4
Unanswered
1
Accuracy
0.0%
Average
75.0%
Ethics
Correct
0
Incorrect
3
Unanswered
0
Accuracy
0.0%
Average
72.0%
Interpretation
Correct
12
Incorrect
25
Unanswered
5
Accuracy
28.6%
Average
69.3%
Legal
Correct
1
Incorrect
3
Unanswered
0
Accuracy
25.0%
Average
63.6%
Pathophysiology
Correct
10
Incorrect
13
Unanswered
4
Accuracy
37.0%
Average
72.6%
Pharmacology
Correct
5
Incorrect
8
Unanswered
0
Accuracy
38.5%
Average
82.4%
Prevention
Correct
6
Incorrect
6
Unanswered
0
Accuracy
50.0%
Average
74.5%
Prognosis
Correct
2
Incorrect
4
Unanswered
0
Accuracy
33.3%
Average
77.8%
Risk
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
84.3%
Tests
Correct
11
Incorrect
11
Unanswered
4
Accuracy
42.3%
Average
76.3%
Treatment
Correct
33
Incorrect
42
Unanswered
7
Accuracy
40.2%
Average
75.2%
#AnswerCorrectStatus
1B
2DA
3C
4AB
5BA
6DC
7CC
8AA
9CA
10AD
11CD
12D
13BB
14DD
15D
16CB
17AB
18DA
19CC
20DA
21DB
22AD
23C
24DD
25CC
26BAnnulled
27BC
28DAnnulled
29AD
30DB
31DD
32DA
33DD
34CD
35BB
36BD
37CC
38BC
39DD
40DA
41AD
42BC
43AB
44DD
45CD
46CA
47DA
48DA
49DD
50DB
51DC
52BB
53DD
54DB
55AA
56DAnnulled
57DC
58CB
59DD
60BA
61A
62DD
63BB
64DD
65AA
66AA
67AB
68B
69AB
70DA
71DD
72CA
73D
74DC
75AA
76CB
77B
78BB
79C
80BC
81CC
82CD
83AB
84DD
85DC
86AC
87CA
88AD
89CB
90DA
91DB
92C
93B
94BC
95AA
96CC
97D
98DC
99DA
100CC
101DB
102CD
103DA
104DC
105CA
106AC
107BB
108DD
109BB
110CC
111AA
112AC
113B
114D
115DD
116BC
117AA
118DD
119AC
120BB
121CD
122CC
123CC
124CC
125DD
126D
127BB
128DD
129BA
130DD
131D
132AA
133BB
134CC
135BB
136CC
137AA
138DD
139DD
140CB
141AA
142BA
143BB
144BB
145BD
146CC
147CB
148DA
149AA
150DA
151CA
152BA
153AB
154BB
155AB
156CC
157AA
158AC
159AC
160AA
161A
162B
163DD
164BC
165DA
166B
167CC
168DD
169DB
170CB
171CC
172DA
173DA
174BB
175AB
176CC
177BC
178DA
179CD
180AA
181DB
182BC
183DB
184BB
185B
186DAnnulled
187AC
188CD
189BD
190A
191CB
192CA
193CC
194AA
195DA
196CA
197BB
198BC
199BD
200CC
201DB
202BA
203DD
204DC
205B
206CD
207A
208BC
209CC
210B