NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash Lite
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2025-11-18-01KACP86V9FB9NYVAGKD8WY02E
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Are you sure about this, Molly?” Bill’s voice was a reedy whisper, his eyes wide as he peered through the gap in the garden fence. “It’s… a lot of tiny humans.” Molly adjusted the brim of her fedora, a theatrical flourish. “Bill, my friend, think of it as a deep-sea dive. We’re after a rare, iridescent prize, and these… ankle-biters… are merely the phosphorescent jellyfish we must navigate.” Bill gulped. “They’re not jellyfish, Molly. They’re small, screaming harbingers of chaos. And that one, the one with the sticky face, is staring right at me.” “A distraction, Bill. Classic misdirection. Focus. We’ve rehearsed this. You take point on the sugar-fueled squadron. Maintain eye contact. Assert dominance through interpretive dance, if you must.” “Interpretive dance? Molly, I can barely navigate a crowded supermarket without tripping. And what if they throw something? Imagine a rogue party popper to the eye.” “Precisely why you’re going in first. You’re the… decoy. The… human shield, if you will. I’ll be positioned by the gift table, a silent shadow, a master of acquisition. Think of the loot, Bill. The Lego sets. The remote-control cars. The… glitter glue. So much glitter glue.” “Glitter glue will be in my hair for weeks, Molly. My parole officer will know.” “Details, Bill, mere trifling details. You need to overcome your… apprehension. This isn’t some high-stakes art gallery heist. This is a glorified birthday buffet. We just need to be stealthy. And maybe wear an oversized inflatable dinosaur costume.” Bill’s jaw sagged. “A dinosaur costume? You want me to wear a dinosaur costume?” “It’s perfect! Blends in. Creates a diversion. Imagine little Timmy seeing a T-Rex at his own party. His little mind, blown! He’ll forget all about the presents.” “Or he’ll have a terrifying existential crisis and start crying, alerting the horde. Then we’re dealing with a full-blown stampede of mini-Nietzscheans.” “You’re overthinking it, Bill. Think of the rewards. We could retire to a small island. Somewhere with no children. Preferably with an all-you-can-eat buffet of… non-sparkly food.” “An island sounds nice. But the dinosaur costume…” “It’s non-negotiable, Bill. It’s our Trojan Horse. Or rather, our Trojan Dino. Now, remember the signal: two sharp whistles, followed by a dramatic cough. That means the coast is clear for me to deploy the burlap sack.” “And what if I get overwhelmed and start singing ‘Wheels on the Bus’ off-key?” “Then you, my dear Bill, have officially failed the mission. But I have faith. Mostly. Now, deep breaths. Imagine you are one of them. A tiny, sugar-crazed maniac. Embrace the chaos.” Bill squeezed his eyes shut. “I’m a tiny, sugar-crazed maniac. I want cake. And presents. And maybe to push someone down.” Molly clapped him on the shoulder. “Excellent! You’re almost ready. Now go forth, my reptilian friend, and procure our treasures.”
created_at
"2025-11-18T23:54:29.888823+00:00"
Tuesday, November 18th, 2025 at 11:54:29 PM GMT+00:00
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
328
moderation_latency
(null)
generation_time
3488
tokens_prompt
68
tokens_completion
705
native_tokens_prompt
67
native_tokens_completion
736
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0003011
api_type
"completions"
id
"gen-1763510066-ynk6XlnP9jMkiI8UWUsg"
upstream_id
(null)
total_cost
0.0003011
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
43.3783%
Matches word count
n/a
neededClean
false
words
483
0.0000%
Dialogue to Total Word Ratio
Ratio: 90.68%, Deviation: 20.68%
neededClean
false
wordsTotal
483
wordsDialogue
438
21.6892%