NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-11-18-01KACPBPHQD3Z66Q8PBSDM59FF
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Right, so the bouncy castle is our biggest hurdle.” Molly chewed on her lip, scanning the blueprints of a suburban backyard. “Bouncy castle? Molly, we’re stealing from a five-year-old’s birthday, not Fort Knox,” Bill grumbled, polishing a pair of binoculars that looked suspiciously like they’d been lifted from a nature documentary host. “Precisely! The sheer unpredictability of inflatable physics, Bill. Think of the trampoline effect. We need a distraction. A big one.” “A clown. Get a clown.” “Too obvious. And frankly, traumatizing. My last encounter with a clown involved a unicycle and a very expensive vase.” “Fair point. What about the gift table? Is it visible from the patio?” “Partially. But the real stash, the good stuff, will be in the kitchen. That’s where the parents will have their own little ‘secret santa’ pile.” “Ah, the adult presents. Nifty. So the kid presents are just a bonus?” “Hardly. It’s a matter of principle. Every toy, every wrapped box. We leave no present behind, Bill. We are Santa’s more… efficient helpers.” “Right. Santa’s morally ambiguous helpers. What about the cake? Can we leverage the cake?” “The cake is key. We need it to collapse in a spectacular, sugary explosion. Right when the presents are being opened.” “A synchronized sugar-bomb, then. I like it. So, a distraction, a kitchen sweep, and a bouncy castle bypass. And what about the dog? Blue Heeler, apparently.” “The dog, Bill, is where the bacon comes in.”
created_at
"2025-11-18T23:56:23.742181+00:00"
Tuesday, November 18th, 2025 at 11:56:23 PM GMT+00:00
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
509
moderation_latency
(null)
generation_time
2787
tokens_prompt
68
tokens_completion
341
native_tokens_prompt
67
native_tokens_completion
359
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0009176
api_type
"completions"
id
"gen-1763510180-GfXkatQPMzObqdxzURFU"
upstream_id
(null)
total_cost
0.0009176
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
243
99.4237%
Dialogue to Total Word Ratio
Ratio: 87.24%, Deviation: 2.76%
neededClean
false
wordsTotal
243
wordsDialogue
212
49.7119%