NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ957G3K9E8KQT6HQEPFPZDF
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
35.1s
Cost:
$0.002252
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
99.83%
Dialogue to Total Word Ratio
(details)
49.9315%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unfolded the crudely drawn map, a child’s rendering of a castle adorned with sparkly stickers that had stubbornly adhered to her fingers. They huddled behind a particularly thorny rose bush, the aroma of cut grass and impending parental judgment filling the evening air. Bill nervously adjusted the ill-fitting party hat perched precariously on his head, its brim threatening to obscure his vision. The distant sound of giggling children was like the ticking of a very, very small clock. "Target acquired: 'Fortress of Festivity'," Molly whispered, nodding towards the brightly decorated house. "Primary objective: the legendary 'Present Pile of Plenty'. Intel suggests it's unguarded during the cake-cutting ritual." "A children's party, Molly? It feels... ethically dubious. What if little Timmy sees us? His war cry is terrifying." "Timmy will be wrestling with a slice of overly frosted cake. We're merely redistributing excess toys, Bill. Think of it as early Christmas for us. Now, the exit strategy is crucial: through the gnome-guarded shrubbery." "Gnomes?" Bill whimpered. "Garden gnomes. They’re surprisingly judgmental."
created_at
"2026-02-25T01:04:48.251Z"
2/25/2026, 1:04:48 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1058
moderation_latency
(null)
generation_time
35023
tokens_prompt
77
tokens_completion
7571
native_tokens_prompt
67
native_tokens_completion
5613
native_tokens_completion_images
0
native_tokens_reasoning
5387
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0022519
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1058
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981488-dbeC8dDmUGQwz41u4SgH"
upstream_id
(null)
total_cost
0.0022519
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
170
99.83%
Dialogue to Total Word Ratio
Ratio: 47.98%, Deviation: 2.02%
neededClean
false
wordsTotal
173
wordsDialogue
83
49.9315%