NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95QKJ8EXJJ311YA49MGXK0
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
16.2s
Cost:
$0.001599
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
50.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
The aroma of freshly ground coffee usually offered a comforting blanket in Alborn’s modest cafe, but for Selvi, today it did little to soothe the tremor running through her hands. She was tracing the edge of a photograph on her camera’s screen, a shot taken hastily in the near-darkness of the previous night. Blackwood Forest loomed in the background, and a dark, nondescript van was captured with unnerving clarity, parked just off the barely-there track, its engine ominously silent. She replayed the low, guttural sounds she’d heard, the frantic, muffled cries that had sent her heart leaping into her throat. Her fingers tightened on the cool metal of her camera. A shadow fell over her small table, startling her. Her head snapped up, camera instinctively clutched closer. A man stood there, imposing and solid, his gaze sharp and direct. He was Detective Hunter, she guessed from the understated authority he radiated. “Excuse me,” Hunter began, his voice a low, no-nonsense rumble. “I’m Detective Hunter. I’m investigating a few… unsettling incidents around town. Did you happen to see or hear anything unusual last night?” Selvi’s eyes widened, her breath catching. His words were a blur, but ‘detective’ and ‘unusual’ registered. What registered more strongly was the panic that surged from the memory of what she had witnessed. She pointed a trembling finger at the image on her camera's screen. <callback> Selvi: "Oh, bhagwan. Main… main dekha. Kal raat. Wahan… woh gaadi. Kuchh awazein thi. Daravni." (Oh, God. I... I saw. Last night. There... that vehicle. There were some sounds. Scary.) Hunter: "See? Anything? Last night? This Van? People? Gone?" Selvi: "Nahi! Nahi! Woh log… woh andar the! Aur woh awaaz! Meri jaan nikalti thi!" (No! No! Those people... they were inside! And that sound! I was terrified!) Hunter: "Look, I have to go. If you can find *anyone* who speaks English... or if you can write down what you saw... please. This is my card. Find help." </callback>
created_at
"2026-02-25T01:13:36.081Z"
2/25/2026, 1:13:36 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1078
moderation_latency
(null)
generation_time
16108
tokens_prompt
315
tokens_completion
2746
native_tokens_prompt
289
native_tokens_completion
3925
native_tokens_completion_images
0
native_tokens_reasoning
3448
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0015989
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1078
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982016-vCwJS7o18b3XjQ2OZJWk"
upstream_id
(null)
total_cost
0.0015989
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
50.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"Oh, bhagwan. Main… main dekha. Kal raat. Wahan… woh gaadi. Kuchh awazein thi. Daravni."
detectedLang
"ms"
heavyLang
"vi"
scores
ms
0.44812362030905084
tl
0.302649930264993
eu
0.29676511954992973
de
0.2625368731563422
vi
0.2366412213740458
sl
0.21011058451816747
hr
0.20634920634920634
nl
0.2
et
0.18962722852512154
en
0.1652754590984975
fi
0.14965986394557826
fr
0.14965986394557826
cs
0.1334488734835355
pl
0.1334488734835355
sk
0.1334488734835355
yo
0.13194444444444445
reliable
false
passes
false
1
name
"Hunter"
dialogue
"See? Anything? Last night? This Van? People? Gone?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8236331569664903
tl
0.6598639455782312
nl
0.4594594594594595
et
0.45155393053016457
ca
0.3071593533487298
hu
0.29245283018867924
sq
0.25
es
0.2248062015503876
ms
0.20634920634920634
da
0.19786096256684493
de
0.19786096256684493
fr
0.19786096256684493
no
0.19786096256684493
yo
0.19786096256684493
reliable
true
passes
true
2
name
"Selvi"
dialogue
"Nahi! Nahi! Woh log… woh andar the! Aur woh awaaz! Meri jaan nikalti thi!"
detectedLang
"et"
heavyLang
"de"
scores
et
0.4696969696969697
ms
0.44620253164556967
eu
0.4195688225538972
ro
0.36936936936936937
yo
0.34579439252336447
en
0.3445692883895131
sl
0.33586337760910817
no
0.3230174081237911
pt
0.31237721021611
vi
0.3069306930693069
sv
0.3
es
0.2985971943887776
is
0.2290748898678414
tl
0.2290748898678414
it
0.22566371681415928
hr
0.21171171171171171
nl
0.1935483870967742
da
0.1822429906542056
fr
0.1822429906542056
sq
0.1822429906542056
ca
0.18032786885245902
reliable
false
passes
false
3
name
"Hunter"
dialogue
"Look, I have to go. If you can find anyone who speaks English... or if you can write down what you saw... please. This is my card. Find help."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8340581368734264
tl
0.5801968731905037
da
0.4366744366744367
pl
0.38870151770657674
yo
0.36736474694589877
cs
0.35555555555555557
sk
0.33910665451230626
ro
0.3255813953488372
no
0.3166823751178134
hr
0.3108365019011407
sv
0.2995169082125604
et
0.28851815505397443
lt
0.2479253112033195
hu
0.23361522198731502
ca
0.22872340425531915
sl
0.210239651416122
it
0.2050438596491228
lv
0.2032967032967033
ms
0.19175027870680048
nl
0.17986425339366516
is
0.17613636363636365
fr
0.17142857142857143
tr
0.16857798165137614
az
0.16184971098265896
sq
0.1559953434225844
de
0.0903387703889586
be
0.0880503144654088
uk
0.0799492385786802
vi
0.0799492385786802
reliable
true
passes
true