NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNGHQAJ988ER6K92G144Z6J
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
5.6s
Cost:
$0.000191
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
50.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet Dialogue:** **Selvi:** (Looking down at her camera, voice trembling slightly) "Haan... haan... main... main yahin... yahin khade hoon... par... par main... main nahi... nahi ja sakti." (She gestures vaguely towards the town square, her eyes wide with fear) "Unki... unki baat... unki baat... main nahi samajh paati." (She shakes her head, clutching her camera strap tightly) **Hunter:** (Leaning in, brow furrowed, trying to follow her gestures) "Selvi? What are you talking about? Slow down. What happened? You look like you've seen a ghost." (He frowns, clearly lost) "What's wrong? Speak English. I don't understand Hindi." (He pulls out his phone, checking the screen urgently) "Sorry, gotta take this. Something came up. We'll talk later, okay?" (He turns and walks away quickly, leaving Selvi staring after him, still clutching her camera, looking utterly bewildered and more shaken than before)
created_at
"2026-02-17T09:57:55.158283+00:00"
2/17/2026, 9:57:55 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
74
moderation_latency
(null)
generation_time
5458
tokens_prompt
315
tokens_completion
1400
native_tokens_prompt
274
native_tokens_completion
1192
native_tokens_completion_images
(null)
native_tokens_reasoning
1165
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00019113
router
(null)
provider_responses
0
id
"018ef4636f794b72ad2b527a57702b66"
status
200
is_byok
false
latency
74
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771322269-iCC9fQbtPqp5z04bZD9p"
upstream_id
"018ef4636f794b72ad2b527a57702b66"
total_cost
0.00019113
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
50.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"Unki... unki baat... unki baat... main nahi samajh paati."
detectedLang
"et"
heavyLang
"tl"
scores
et
0.511002444987775
eu
0.4520547945205479
fi
0.34210526315789475
is
0.33774834437086093
no
0.3333333333333333
en
0.3311036789297659
lv
0.3265993265993266
tl
0.3265993265993266
fr
0.3055555555555556
da
0.3006993006993007
sv
0.2857142857142857
de
0.2805755395683453
ms
0.26199261992619927
reliable
false
passes
false
1
name
"Hunter"
dialogue
"Sorry, gotta take this. Something came up. We'll talk later, okay?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7505868544600939
tl
0.6358183376178235
it
0.4121715076071923
nl
0.40308988764044945
pt
0.3694362017804154
sv
0.2772108843537415
de
0.2685025817555938
fi
0.2085661080074488
hu
0.2026266416510319
da
0.18111753371868977
no
0.17635658914728683
es
0.1550695825049702
ms
0.1550695825049702
sl
0.1550695825049702
hi
0.15169660678642716
pl
0.15169660678642716
fr
0.1431451612903226
reliable
true
passes
true