MedicalBenchmark
Goliath 120B provider

Goliath 120B

260

#260 of 290 modelsMIR 2026

Net score

76.33 pts

Accuracy

50.5%

Correct / Incorrect

101 / 74

Total Cost

$1.20

Overall Performance

(vs. average)
Accuracy

50.5%

avg: 81.6%

Net score

76.33 pts

avg: 154.00 pts

Correct

101

avg: 163

Incorrect

74

avg: 28

Total Cost

$1.20

avg: $3.33

Average response time

23.1s

avg: 16.2s

Output Tokens

100K

avg: 430K

Reasoning Tokens

0

avg: 310K

Average confidence

83.8%

avg: 95.1%

Subject Breakdown

Allergology
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
96.9%
Anesthesiology and Resuscitation
Correct
2
Incorrect
4
Unanswered
1
Accuracy
28.6%
Average
69.6%
Cardiology
Correct
7
Incorrect
14
Unanswered
4
Accuracy
28.0%
Average
77.3%
Dermatology
Correct
4
Incorrect
4
Unanswered
3
Accuracy
36.4%
Average
72.3%
Endocrinology and Nutrition
Correct
8
Incorrect
4
Unanswered
3
Accuracy
53.3%
Average
84.0%
ENT
Correct
6
Incorrect
1
Unanswered
1
Accuracy
75.0%
Average
84.7%
Epidemiology
Correct
4
Incorrect
2
Unanswered
1
Accuracy
57.1%
Average
80.2%
Gastroenterology
Correct
12
Incorrect
14
Unanswered
4
Accuracy
40.0%
Average
79.3%
Genetics
Correct
5
Incorrect
4
Unanswered
2
Accuracy
45.5%
Average
78.7%
Geriatrics
Correct
8
Incorrect
5
Unanswered
0
Accuracy
61.5%
Average
83.0%
Gynecology and Obstetrics
Correct
5
Incorrect
3
Unanswered
4
Accuracy
41.7%
Average
84.3%
Health Planning and Management
Correct
5
Incorrect
4
Unanswered
1
Accuracy
50.0%
Average
78.4%
Hematology
Correct
5
Incorrect
3
Unanswered
1
Accuracy
55.6%
Average
76.6%
Immunology
Correct
3
Incorrect
2
Unanswered
1
Accuracy
50.0%
Average
91.4%
Infectious Diseases
Correct
5
Incorrect
7
Unanswered
2
Accuracy
35.7%
Average
77.9%
Legal Medicine and Bioethics
Correct
7
Incorrect
2
Unanswered
2
Accuracy
63.6%
Average
82.9%
Medical Oncology
Correct
11
Incorrect
12
Unanswered
0
Accuracy
47.8%
Average
83.0%
Nephrology
Correct
5
Incorrect
4
Unanswered
1
Accuracy
50.0%
Average
85.1%
Neurology
Correct
8
Incorrect
4
Unanswered
1
Accuracy
61.5%
Average
88.6%
Ophthalmology
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
83.7%
Palliative Care
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
80.2%
Pediatrics
Correct
17
Incorrect
1
Unanswered
4
Accuracy
77.3%
Average
87.6%
Pharmacology
Correct
3
Incorrect
7
Unanswered
1
Accuracy
27.3%
Average
78.6%
Psychiatry
Correct
5
Incorrect
2
Unanswered
1
Accuracy
62.5%
Average
87.9%
Pulmonology
Correct
10
Incorrect
5
Unanswered
1
Accuracy
62.5%
Average
82.8%
Radiology-Emergency
Correct
5
Incorrect
6
Unanswered
2
Accuracy
38.5%
Average
67.7%
Rheumatology
Correct
8
Incorrect
3
Unanswered
0
Accuracy
72.7%
Average
88.4%
Statistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Traumatology
Correct
5
Incorrect
5
Unanswered
1
Accuracy
45.5%
Average
65.2%
Urology
Correct
4
Incorrect
4
Unanswered
0
Accuracy
50.0%
Average
82.5%

Question Type Breakdown

Anatomy
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
82.6%
Biostatistics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
83.8%
Diagnosis
Correct
44
Incorrect
31
Unanswered
6
Accuracy
54.3%
Average
82.2%
Epidemiology
Correct
6
Incorrect
3
Unanswered
0
Accuracy
66.7%
Average
88.7%
Ethics
Correct
5
Incorrect
0
Unanswered
1
Accuracy
83.3%
Average
92.0%
Interpretation
Correct
12
Incorrect
16
Unanswered
9
Accuracy
32.4%
Average
72.0%
Legal
Correct
7
Incorrect
1
Unanswered
1
Accuracy
77.8%
Average
82.4%
Pathophysiology
Correct
15
Incorrect
6
Unanswered
5
Accuracy
57.7%
Average
84.3%
Pharmacology
Correct
7
Incorrect
7
Unanswered
1
Accuracy
46.7%
Average
82.3%
Prevention
Correct
6
Incorrect
9
Unanswered
1
Accuracy
37.5%
Average
80.6%
Prognosis
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
93.2%
Risk
Correct
6
Incorrect
8
Unanswered
1
Accuracy
40.0%
Average
84.3%
Tests
Correct
14
Incorrect
14
Unanswered
5
Accuracy
42.4%
Average
80.3%
Treatment
Correct
31
Incorrect
30
Unanswered
11
Accuracy
43.1%
Average
80.1%
#AnswerCorrectStatus
1CA
2BB
3CD
4BD
5BB
6AC
7BB
8D
9D
10CA
11DB
12CA
13BAnnulled
14C
15A
16CC
17CC
18D
19AB
20CB
21B
22C
23AB
24AA
25C
26BA
27CB
28DD
29AA
30DD
31C
32CC
33BB
34CB
35C
36BD
37CD
38BA
39CC
40DA
41DD
42AA
43BB
44BB
45BB
46BB
47DD
48CC
49CC
50AAnnulled
51DA
52CC
53CC
54C
55CA
56DA
57A
58DA
59BB
60BB
61C
62BB
63AB
64CAnnulled
65CC
66DC
67AC
68AA
69B
70BB
71AB
72BB
73BB
74A
75CC
76CC
77DA
78AD
79CA
80DD
81CC
82DD
83B
84AA
85DD
86BA
87BB
88AB
89BB
90CC
91A
92BC
93DD
94BC
95DC
96AB
97BB
98AA
99BB
100CC
101CC
102CA
103CC
104CC
105CC
106CB
107DC
108BA
109CC
110AC
111BB
112AC
113CC
114DD
115AB
116CD
117CC
118CB
119CA
120DA
121BC
122BB
123DB
124B
125CC
126DB
127BC
128DB
129BC
130CC
131BB
132DB
133CC
134BB
135CC
136BB
137DD
138BB
139Annulled
140CD
141AA
142BAnnulled
143BA
144DD
145C
146CC
147BB
148CC
149BB
150DB
151DB
152C
153BC
154DD
155AB
156BA
157DD
158BB
159CC
160AA
161CAnnulled
162AA
163DC
164AC
165BA
166AA
167DD
168CC
169BB
170BB
171AA
172B
173DD
174C
175BB
176AA
177D
178BD
179AB
180BC
181BB
182DC
183DD
184DD
185B
186DD
187DD
188C
189CC
190DB
191DD
192DD
193CC
194AC
195AD
196CA
197DC
198BA
199BB
200DD
201CC
202AA
203DA
204CC
205BB
206DC
207BA
208CAnnulled
209BB
210CC