NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2025-11-18-01KACP5MK1DVE4WHQ46DMYTNSG
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Are you serious, Bill?” “Serious as a heart attack, Molly. And considerably less messy.” Molly pinched the bridge of her nose, a habit she’d picked up after three years of working with Bill. “A children’s birthday party? The one with the bouncy castle and the guy dressed as a slightly-too-realistic gorilla?” “The very same! Think of the haul, Molly. Imagine the unblemished glory of a hundred perfectly wrapped presents, just waiting to… well, waiting to be liberated.” Bill’s eyes twinkled with the kind of misguided enthusiasm usually reserved for cult leaders or particularly ambitious squirrels. “Liberated, he says. From six-year-olds.” Molly gestured dramatically around the dive bar booth they were currently occupying, attracting the attention of exactly no one. “Billy, we steal from banks, from museums, occasionally from that guy who sells knock-off watches in the alley. We do not steal from infants.” “Technically, they’re six. And look, it’s not *from* them. It’s *for* us. A redistribution of joy, if you will. And toys. Mostly toys.” Bill leaned forward conspiratorially. “Consider this: low security, high emotional trauma for the victims, but ultimately, they’ll get over it. Probably. And isn’t that the sweet spot of all our operations?” “The sweet spot of emotional trauma? Billy, what is wrong with you?” “Nothing! I’ve been thinking about this, Molly. It’s brilliant. The parents will be distracted by the sugary chaos, the kids will be hopped up on birthday cake and juice boxes. It’s a perfect storm of opportunity!” Molly sighed. “And how exactly do you propose we, two internationally renowned purveyors of larceny, infiltrate a kindergarten class’s birthday bash without looking like the creepiest people on the planet?” “Disguises, of course! I was thinking… clowns.” Molly stared at him, her jaw slack. “Clowns. You want us to dress as clowns. The very same clowns that are responsible for 90% of childhood nightmares?” “Precisely! It’s unexpected. Who would suspect two clowns of being master thieves? Everyone expects them to just… make balloon animals and terrorize small children with overly aggressive honking.” “Bill, I refuse to wear a wig that smells like other people’s dreams. And I certainly refuse to paint my face to look like some sort of psychotic harlequin.” “But Molly, think of the tactical advantages! Balloons for distraction, oversized shoes for… well, for very little, but they look funny. And a flower that squirts water! Imagine spraying the bouncy castle gorilla in the face while I make off with the loot.” “You’re going to be spraying a guy in a gorilla suit while I’m supposed to be… what, precisely? Hoofing it out of there with a sack full of Barbies and Hot Wheels?” “Exactly! It’s a classic misdirection. Besides, it’s a challenge. We’ve done the secure vault, the laser grid, the armed guards. This is new territory. A moral grey area, perhaps, but a highly profitable one.” Molly ran a hand through her hair. “And what happens when little Timmy’s parents find out their child’s brand-new Lego Death Star is gone? They’re going to call the cops, Bill. And we’ll be arrested for grand theft children’s toys.” “They’ll never suspect us! Clowns, Molly. Clowns are above suspicion. They’re like ninjas, but with more primary colors and existential dread.” “I’m not wearing red suspenders, Bill.” “Fine, blue! But the oversized shoes are non-negotiable.”
created_at
"2025-11-18T23:53:07.893609+00:00"
Tuesday, November 18th, 2025 at 11:53:07 PM GMT+00:00
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
347
moderation_latency
(null)
generation_time
5357
tokens_prompt
68
tokens_completion
776
native_tokens_prompt
67
native_tokens_completion
813
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0020526
api_type
"completions"
id
"gen-1763509981-IAqo1T4HFJAznt9RGHdc"
upstream_id
(null)
total_cost
0.0020526
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
557
0.0002%
Dialogue to Total Word Ratio
Ratio: 86.36%, Deviation: 16.36%
neededClean
false
wordsTotal
557
wordsDialogue
481
0.0001%