MedicalBenchmark
Arcee AI: Trinity Mini provider

Trinity Mini

227

#227 of 291 modelsMIR 2024

Net score

130.33 pts

Accuracy

72.5%

Correct / Incorrect

145 / 44

Total Cost

$0.04

Overall Performance

(vs. average)
Accuracy

72.5%

avg: 80.5%

Net score

130.33 pts

avg: 150.85 pts

Correct

145

avg: 161

Incorrect

44

avg: 30

Total Cost

$0.04

avg: $3.32

Average response time

5.7s

avg: 16.4s

Output Tokens

241K

avg: 427K

Reasoning Tokens

198K

avg: 310K

Average confidence

94.6%

avg: 95.4%

Subject Breakdown

Allergology
Correct
3
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
90.5%
Anesthesiology and Resuscitation
Correct
3
Incorrect
1
Unanswered
0
Accuracy
75.0%
Average
87.1%
Cardiology
Correct
14
Incorrect
5
Unanswered
2
Accuracy
66.7%
Average
79.7%
Dermatology
Correct
9
Incorrect
5
Unanswered
0
Accuracy
64.3%
Average
80.2%
Endocrinology and Nutrition
Correct
13
Incorrect
4
Unanswered
2
Accuracy
68.4%
Average
84.2%
ENT
Correct
4
Incorrect
3
Unanswered
0
Accuracy
57.1%
Average
74.4%
Epidemiology
Correct
6
Incorrect
2
Unanswered
0
Accuracy
75.0%
Average
89.3%
Gastroenterology
Correct
15
Incorrect
7
Unanswered
0
Accuracy
68.2%
Average
70.5%
Genetics
Correct
7
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
86.5%
Geriatrics
Correct
8
Incorrect
0
Unanswered
2
Accuracy
80.0%
Average
86.9%
Gynecology and Obstetrics
Correct
9
Incorrect
2
Unanswered
3
Accuracy
64.3%
Average
81.2%
Health Planning and Management
Correct
1
Incorrect
1
Unanswered
0
Accuracy
50.0%
Average
73.2%
Hematology
Correct
11
Incorrect
1
Unanswered
1
Accuracy
84.6%
Average
81.5%
Immunology
Correct
5
Incorrect
0
Unanswered
3
Accuracy
62.5%
Average
89.1%
Infectious Diseases
Correct
17
Incorrect
5
Unanswered
1
Accuracy
73.9%
Average
81.8%
Legal Medicine and Bioethics
Correct
2
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
91.7%
Medical Oncology
Correct
15
Incorrect
5
Unanswered
1
Accuracy
71.4%
Average
80.2%
Nephrology
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
80.8%
Neurology
Correct
19
Incorrect
2
Unanswered
1
Accuracy
86.4%
Average
83.7%
Ophthalmology
Correct
2
Incorrect
3
Unanswered
0
Accuracy
40.0%
Average
80.0%
Palliative Care
Correct
4
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
88.2%
Pediatrics
Correct
10
Incorrect
7
Unanswered
0
Accuracy
58.8%
Average
82.0%
Pharmacology
Correct
18
Incorrect
4
Unanswered
1
Accuracy
78.3%
Average
85.4%
Psychiatry
Correct
9
Incorrect
0
Unanswered
1
Accuracy
90.0%
Average
89.5%
Pulmonology
Correct
14
Incorrect
5
Unanswered
0
Accuracy
73.7%
Average
80.6%
Radiology-Emergency
Correct
8
Incorrect
3
Unanswered
3
Accuracy
57.1%
Average
64.9%
Rheumatology
Correct
12
Incorrect
1
Unanswered
1
Accuracy
85.7%
Average
81.4%
Statistics
Correct
2
Incorrect
1
Unanswered
0
Accuracy
66.7%
Average
91.1%
Traumatology
Correct
10
Incorrect
3
Unanswered
2
Accuracy
66.7%
Average
74.5%
Urology
Correct
4
Incorrect
2
Unanswered
0
Accuracy
66.7%
Average
78.2%

Question Type Breakdown

Anatomy
Correct
3
Incorrect
3
Unanswered
0
Accuracy
50.0%
Average
79.8%
Biostatistics
Correct
3
Incorrect
2
Unanswered
0
Accuracy
60.0%
Average
90.7%
Diagnosis
Correct
49
Incorrect
20
Unanswered
4
Accuracy
67.1%
Average
79.2%
Epidemiology
Correct
9
Incorrect
3
Unanswered
0
Accuracy
75.0%
Average
81.2%
Ethics
Correct
1
Incorrect
0
Unanswered
0
Accuracy
100.0%
Average
94.5%
Interpretation
Correct
24
Incorrect
10
Unanswered
3
Accuracy
64.9%
Average
69.6%
Pathophysiology
Correct
25
Incorrect
5
Unanswered
3
Accuracy
75.8%
Average
85.4%
Pharmacology
Correct
20
Incorrect
3
Unanswered
2
Accuracy
80.0%
Average
84.0%
Prevention
Correct
9
Incorrect
2
Unanswered
1
Accuracy
75.0%
Average
89.8%
Prognosis
Correct
5
Incorrect
1
Unanswered
1
Accuracy
71.4%
Average
83.9%
Risk
Correct
10
Incorrect
3
Unanswered
0
Accuracy
76.9%
Average
83.6%
Tests
Correct
13
Incorrect
5
Unanswered
3
Accuracy
61.9%
Average
73.9%
Treatment
Correct
57
Incorrect
12
Unanswered
2
Accuracy
80.3%
Average
81.3%
#AnswerCorrectStatus
1BB
2D
3BB
4BC
5C
6BB
7DD
8CC
9BA
10DD
11DD
12AA
13C
14DA
15CB
16AA
17CC
18AA
19BB
20CC
21DD
22AB
23AA
24BA
25AC
26BB
27AC
28CA
29BB
30DC
31AD
32AA
33CC
34BB
35DD
36DD
37A
38A
39CC
40BB
41BC
42DD
43AA
44DD
45AD
46BB
47CC
48CC
49BB
50BC
51CA
52DD
53CC
54DB
55CC
56DD
57DA
58DA
59CA
60DA
61AA
62CD
63BD
64AAnnulled
65DD
66CC
67CB
68CAnnulled
69AA
70BB
71B
72CD
73BB
74CC
75BB
76AA
77AD
78DC
79BB
80AA
81DC
82CC
83BB
84CC
85AA
86AA
87BB
88DD
89BB
90AA
91DD
92AA
93AC
94BB
95DD
96BB
97DB
98BB
99AA
100BB
101AA
102DD
103BB
104DD
105DB
106BC
107CC
108BB
109DD
110DD
111BB
112CC
113BAnnulled
114DD
115DD
116DA
117DD
118DD
119CA
120CC
121AA
122BB
123DD
124DD
125CB
126DD
127AA
128BB
129DD
130BC
131CC
132CD
133AA
134CC
135DA
136DD
137AA
138CC
139AA
140CC
141BB
142CC
143AA
144DD
145CC
146BC
147CC
148AA
149C
150DD
151AA
152A
153CC
154BB
155DD
156CC
157CC
158DD
159DD
160B
161BB
162B
163BB
164BB
165CA
166CC
167AA
168BB
169CC
170CA
171DD
172BB
173AA
174BB
175AA
176CC
177CC
178B
179CC
180DAnnulled
181BB
182DD
183CC
184AA
185CC
186DD
187AA
188CC
189CD
190DD
191BB
192DB
193CC
194DC
195CC
196BB
197AA
198BB
199DD
200BA
201BB
202DD
203BB
204DD
205BD
206CAnnulled
207DA
208AA
209BB
210DD