NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-11-18-01KACP6KK3XP502WM2FYM9P81T
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Right, so we’re agreed,” Molly said, chewing the end of a pen, “the target is the bouncy castle.” “The bouncy castle,” Bill echoed, eyes wide with a mix of dread and amusement. “Molly, are we sure about this? It’s a *children’s* party.” “Exactly! Undetected. Unsuspecting. They’ll never see us coming.” “Because we’ll be disguised as… what, exactly? Very large toddlers?” Bill rubbed his temples. “No, no, amateur hour. We go in as the entertainment. You, Bill, are ‘Boomer Bill, the Balloon Bouncer.’ Your job is to create a diversion. Max chaos.” “Balloon Bouncer? That’s my professional alias?” “Has a certain… ring to it. Anyway, while Boomer Bill is charming the pants off the little monsters, I, ‘Magical Molly, the Marvelous Magician,’ will be circling the perimeter. Scooping up the goods.” “The presents. You mean the ‘goods’ are actually presents for six-year-olds?” “Irrelevant! They’re high-value, low-security targets. Think of the mark-up on a slightly-used drone.” “A drone. You think there’ll be a drone?” “There’s always a drone, Bill. It's the law of modern gift-giving. Now, your balloon dog repertoire, is it still up to snuff?” “I can make a passable poodle, a somewhat questionable sausage dog, and a very aggressive badger.” “Perfect! Aggressive badger will be key for the distraction. Now, which one of us is going to pretend to be allergic to peanuts so we can get close to the cake table?” “I am absolutely not pretending to be allergic to peanuts, Molly. I almost died last time.” “Details, details. This is for the greater good!”
created_at
"2025-11-18T23:53:37.032253+00:00"
Tuesday, November 18th, 2025 at 11:53:37 PM GMT+00:00
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
510
moderation_latency
(null)
generation_time
3028
tokens_prompt
68
tokens_completion
382
native_tokens_prompt
67
native_tokens_completion
408
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0010401
api_type
"completions"
id
"gen-1763510013-swNkxnn7HtCFZkdOgI8f"
upstream_id
(null)
total_cost
0.0010401
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
263
99.9752%
Dialogue to Total Word Ratio
Ratio: 91.25%, Deviation: 1.25%
neededClean
false
wordsTotal
263
wordsDialogue
240
49.9876%