NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95CQKE8C7B0TV37ZB2WYSK
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
42.8s
Cost:
$0.003896
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a sharpened pencil against the greasy placemat, her eyes narrowed in concentration. Bill slurped his banana milkshake, a sticky residue tracing a path down his chin. The placemat, a child's crayon rendering of ‘Susie’s Super Fun Twelfth’, was now adorned with Molly’s strategic red scribbles: ‘PRIMARY OBJECTIVE: THE BACK ROOM (GIFTS)’, ‘SECONDARY: THE FRONT PORCH STEPS (CARD BOX)’, ‘ESCAPE ROUTE: THE FORSYTHIA BUSH’. The ambient hum of ‘The Greasy Spoon’ diner, a chipped Formica tabletop, and the distant clatter of plates served as their planning sanctuary. Molly folded the placemat into quarters, then eighths, a nervous habit. The afternoon sun, struggling through the grimy window, caught a stray piece of glitter still clinging to Bill’s enormous thumb. This operation, ‘Operation: Present Panic’, was audacious – a raid on a children’s birthday party, targeting the symbols of childhood joy. Molly found the sheer absurdity both appalling and exhilarating. Bill, meanwhile, seemed mostly focused on the milkshake. The plan was direct: hit the house during peak chaos. Molly, the phantom, would slip in the back, bypass napping adults, and collect goods from the ‘gift zone’. Bill, the brute force distraction, would orchestrate a spectacle outside, drawing the younger, more volatile element away. A yappy terrier barked distantly, a factor for their getaway. The dense foliage of the garden offered an escape. Molly imagined the scene: stolen wrapping paper, confused parents, Bill juggling invisible oranges. It would be glorious. “Alright, Bill,” Molly finally murmured, smoothing the placemat. “Here’s the core intel. The window is critical: precisely between the puppet show and the actual cake presentation. Your role necessitates… significant theatre.” Bill nodded, wiping his mouth. “You mean I gotta be loud?” “Exactly. You’re ‘Captain Chaos’. Your mission is to create a diversion so overwhelming, so utterly bizarre, that every child, and frankly, every adult with a pulse, is drawn *away* from the house. Think spontaneous interpretive dance about sock puppets, or a sudden, aggressive balloon animal demonstration. Keep ‘em pinned to the park across the street. My part is the silent infiltration. The back room first, then the porch. Quick, clean sweep. Load into the disguised duffels. We rendezvous at the lilac bush by the fence. Watch for the dog. And absolutely no singing. Your singing would be terrifying.” Molly stood, adjusting her jacket. “And Bill? This entire haul is spoken for by the buyer. No ‘borrowing’ for yourself. Understood?” Bill grinned, a flicker of mischief in his eyes. “Presents for the people, boss. Got it.”
created_at
"2026-02-25T01:07:39.77Z"
2/25/2026, 1:07:39 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1183
moderation_latency
(null)
generation_time
42746
tokens_prompt
77
tokens_completion
6821
native_tokens_prompt
67
native_tokens_completion
9724
native_tokens_completion_images
0
native_tokens_reasoning
9171
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0038963
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1183
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981659-1wgAXbraoDsMIQXkZ3ZO"
upstream_id
(null)
total_cost
0.0038963
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
414
0.00%
Dialogue to Total Word Ratio
Ratio: 53.38%, Deviation: 23.38%
neededClean
false
wordsTotal
414
wordsDialogue
221
0.0000%