NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Pro
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2025-11-18-01KACP8TJY7PM7B88J5B20YA17
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in the van smelled of stale crisps and desperation. Molly jabbed a finger at the map spread across a greasy pizza box on her lap. It was less a blueprint and more a child’s drawing, rendered in thick crayon with a flagrant disregard for scale. A massive yellow sun with a smiley face occupied one corner. “The primary entry point is here,” she declared, her finger tapping a wobbly rectangle labelled ‘Bouncy Castel’. Bill slumped further into the passenger seat, the springs groaning in protest. He squinted at the drawing, a masterpiece of suburban chaos. “You want us to go in through the Bouncy Castle?” “It's the perfect Trojan Horse, Bill. A symbol of joy and distracted parents. No one suspects the Bouncy Castle.” Molly’s eyes glinted with the kind of fervour usually reserved for cracking a bank vault, not infiltrating Brayden’s Sixth Birthday Bash. She slid a blurry photograph across the box. It showed a small man with a pained expression twisting a pink balloon into something vaguely resembling a dachshund. “My cover is 'Billy the Balloon Bender'? You're joking.” “Deadly serious,” Molly said, ignoring his despair. “Your job is to herd the children towards the back garden with the promise of a legendary 'Mega-Snake'.” “I don't know how to make a Mega-Snake! I can barely manage a worm.” “Just tie a knot in a long one.” She dismissed his concerns with a wave. “While you're wrangling the sugar-crazed mob, I'll slip in. The target is a centralized pile they're calling 'Gift Mountain'.” Her finger moved to a crudely drawn pyramid in the area marked ‘Lounge’. Bill ran a hand over his tired face. “Gift Mountain? Molly, what are we even looking for? A new-release tricycle? A limited-edition talking pony?” “We're looking for volume, Bill. Volume and wrapping paper that suggests a high price point. Think electronics, not educational wooden blocks.” She was all business, tracing the exfiltration route with a red crayon. Out the patio doors, behind the shed filled with pool noodles, and through a gap in the fence. The entire operation was timed to coincide with the magician’s finale, a twenty-minute window of peak distraction. The plan was, on paper, almost plausible. If you ignored the fact that the entire security detail consisted of parents filming on their phones and a golden retriever that was afraid of its own shadow. “Fine,” Bill sighed, the fight draining out of him. He picked a piece of pepperoni off the map. “But you know there's going to be cake, right? What if we miss the cake?” Molly looked at him, her expression a flat mask of professional focus. “We are not here for the cake, Bill.”
created_at
"2025-11-18T23:55:10.293962+00:00"
Tuesday, November 18th, 2025 at 11:55:10 PM GMT+00:00
model
"google/gemini-2.5-pro"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2020
moderation_latency
(null)
generation_time
23736
tokens_prompt
68
tokens_completion
1287
native_tokens_prompt
67
native_tokens_completion
2659
native_tokens_completion_images
0
native_tokens_reasoning
2060
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.02667375
api_type
"completions"
id
"gen-1763510086-ngXqQQqTLvaWUsJONnnS"
upstream_id
(null)
total_cost
0.02667375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
455
3.3496%
Dialogue to Total Word Ratio
Ratio: 38.24%, Deviation: 8.24%
neededClean
false
wordsTotal
455
wordsDialogue
174
1.6748%