NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash Lite
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2025-11-18-01KACPD76853Q7QBR0HCA30ER4
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Are you sure about this, Molly?” Bill whispered, his voice a nervous squeak. “Breaking into a *children’s party*? That feels… low, even for us.” Molly adjusted her fake beehive wig, a shimmering testament to her commitment to disguise. “Bill, think of the payload. Imagine. A veritable cornucopia of consumerist glee. We’re talking Gameboys, those terrifying singing dolls, enough Lego to build a small country. It’s a goldmine, Bill. A childhood treasure trove.” “But the kids, Molly! They’ll be there. Tiny, sticky agents of chaos. One of them’s bound to see us in our ill-fitting clown costumes.” “Which is why,” Molly produced two oversized, rainbow-striped jumpsuits from her duffel bag, “we blend in. Who’s going to suspect the extra clowns? We’re practically a surprise guest. A *very* well-intentioned, gift-acquiring surprise guest.” Bill stared at the jumpsuit. “Don’t we usually go for more… subtle approaches? Trench coats? Silken gloves?” “Subtlety is for bank tellers and people who haven’t spent the last week dodging security drones at the city’s artisanal cheese festival. This is a party, Bill. Noise, distraction, sugar highs. Perfect cover.” “Sugar highs,” Bill repeated, a tremor in his voice. “You think they’ve got cake?” “Probably. And ice cream. And potentially a bouncy castle. But we’re not here for dessert, Bill. We’re here for the spoils. Now, the intel suggests the presents are all gathered in the den. A veritable Fort Knox of plastic and wrapping paper.” “And how do we get past the parents?” “Ah, the parents. The final boss battle. I’ve identified the prime distraction: Mrs. Henderson. Apparently, she’s prone to epic, theatrical pronouncements about party etiquette. We create a minor disturbance near the buffet – a strategically ‘dropped’ tray of miniature quiches, perhaps – and while she’s… *addressing* the situation, you and I slip into the den.” “Minor disturbance? Molly, you once set off a small firework in a pet shop to cover your escape from a particularly stubborn hamster.” “An unfortunate misunderstanding involving a rogue laser pointer. This will be much more controlled. Think of it as… performance art. A culinary catastrophe.” Bill sighed. “So, quiches. And then what? We just grab everything?” “Not *everything*, Bill. We’re professionals. We go for the high-value items. The electronic toys, the designer dolls, anything with a brand name that screams conspicuous consumption. We’ll need a few strategically placed pillowcases. And, if we’re lucky, we’ll be out before anyone notices the absence of Mrs. Higgins’ prize-winning… poodle that she insists on dressing up as a fairy.” “You’ve thought of everything, haven’t you?” “Naturally. This is going to be our masterpiece, Bill. The ‘Little Tykes’ Larceny.’ The ‘Barbie Bonanza.’ The… the ‘Kiddie Cache Caper.’” “It’s a mouthful, Molly.” “Precisely! Memorable! Now, help me with this zipper. It’s surprisingly tight for a clown costume.”
created_at
"2025-11-18T23:57:14.610058+00:00"
Tuesday, November 18th, 2025 at 11:57:14 PM GMT+00:00
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
325
moderation_latency
(null)
generation_time
3831
tokens_prompt
68
tokens_completion
690
native_tokens_prompt
67
native_tokens_completion
722
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0002955
api_type
"completions"
id
"gen-1763510230-BtjTNBT3AnVctCYZwCj1"
upstream_id
(null)
total_cost
0.0002955
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
465
0.0000%
Dialogue to Total Word Ratio
Ratio: 90.11%, Deviation: 20.11%
neededClean
false
wordsTotal
465
wordsDialogue
419
0.0000%