NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS26146KYN0AJP8J19H7GJXQ
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
5.5s
Cost:
$0.006554
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
What do you mean, you didn't bring the blueprints? Molly hissed, her back pressed against the vinyl siding of the suburban split-level. I brought the schematics, Bill whispered back, triumphantly brandishing a crumpled sheet of paper. Molly snatched it, her nocturnal goggles clinking against her nose. Bill, this is a coloring page. And you didn't even stay inside the lines of the bouncy castle. The bouncy castle is the insertion point, Molly. It's the soft underbelly of the entire perimeter. It's a commercial-grade inflatable shaped like a dinosaur, Bill. If we pop it, thirty screaming six-year-olds will swarm us like piranhas. Did you pack the distraction? Of course I packed the distraction, Bill said, patting his tactical vest. A high-frequency whistle, guaranteed to disorient them. That’s for dogs. Children aren’t dogs. They're basically the same, Molly. They lick things, they don't listen, and they chase shiny objects. Just tell me we're on schedule. The cake cutting is at two-fifteen. That’s our window. We're golden. The cake is funfetti. It takes at least ten minutes to slice because of the structural integrity of the sprinkles. Once the sugar crash hits, we sweep the gift table. Did you locate the primary target? Molly asked, peering through the hedge. The big box with the shiny red bow. Jimmy’s parents bought him the Megalodon Power-Wheels. It's got working headlights and a cup holder. We can flip it on the dark web for three hundred caps, easy. What about the smaller boxes? Molly’s eyes gleamed in the shadow of her ski mask. I saw a Lego Death Star go in there. That's pure liquid gold. Excellent. Now, what's our exit strategy if the clown spots us? I’ve got juice boxes, Bill whispered, opening his jacket to reveal rows of apple juice pouches. If things get hairy, I puncture these and create a sticky-floor hazard. Standard riot control. You’re a genius, Bill. Sickening, but a genius. What’s your status on the getaway vehicle? The tricycle is stationed by the sandbox. I disabled the training wheels for maximum speed. Can it hold the weight of the Megalodon? It’s rated for eighty pounds. We might have to sacrifice the Lego set if we want to make the curb. Never, Molly snapped. We leave no plastic brick behind. Remember, we are professional thieves. We took down the retirement home raffle, we can handle a birthday party. Right. Respect the craft. On my mark, we breach the sandbox. Ready? Wait, Molly. I have to ask. What? Is it wrong to steal from a kid named Jimmy? It’s his special day. Bill, he’s turning seven. He doesn’t have bills. He doesn’t know the crushing weight of inflation. We are doing him a favor by teaching him about the harsh realities of the free market. You always know just what to say to make me feel better about grand larceny. That's why I'm the mastermind. Now, put your party hat on, pull down your visor, and let’s go get that Lego.
created_at
"2026-05-20T07:53:59.767Z"
5/20/2026, 7:53:59 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
700
moderation_latency
(null)
generation_time
5354
tokens_prompt
77
tokens_completion
787
native_tokens_prompt
67
native_tokens_completion
717
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0065535
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"l2gNaqKSOZfF-sAPnq7JgQs"
is_byok
false
latency
693
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263639-Ly17509IOZKPNOcerj35"
session_id
(null)
api_type
"completions"
id
"gen-1779263639-a0uFLwAuBdW0A90ziMbW"
upstream_id
"l2gNaqKSOZfF-sAPnq7JgQs"
total_cost
0.0065535
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
507
wordsDialogue
0
49.9995%