NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25R939921SXMHB5YQYPX3M
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.4s
Cost:
$0.006319
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
17.5011%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, her brow furrowing as she stared across the street. The target was a pastel-pink bouncy castle shaped like a defensive fortress. It was reinforced by fifteen sugar-rushed eight-year-olds and a very stressed mother wearing a birthday cone hat. "We need to talk about the bouncy castle," Molly whispered, her voice tight with professional anxiety. "It’s a tactical nightmare." Bill didn’t look up from his clipboard. He was busy cross-referencing a map of the backyard with a crayon drawing of a unicorn. "The castle is a distraction. The real challenge is the pinata. If that thing drops before we make our move, we lose the crowd completely." "I don't care about the pinata, Bill. I care about the gift table," Molly said, grabbing his arm. "Look at it. It’s loaded. There’s a motorized toy car in there that could fund our rent for three months." Bill sighed, tapping his pencil against his nose. "We can't just walk in and grab the loot. We need a cover." "I have a cover," Molly said, gesturing to her oversized, polka-dot overalls. "I’m Barnaby the Magic Clown." "You look like a sleep-deprived mime who escaped from a low-budget circus," Bill noted dryly. "And what about me? Why am I wearing this giant, suffocating plush dinosaur suit?" "Because you’re the distraction, Rex. When you start doing the Macarena, all eyes will be on you. While they gasping at your extinct moves, I slip behind the bouncy castle and bag the presents." Bill adjusted the heavy green tail of his costume, looking doubtful. "The Macarena? Molly, that dance is thirty years old. Kids today want TikTok trends. If I don't hit the Griddy, they'll see right through me." "Can you do the Griddy in a tail?" "I can try," Bill mumbled, taking a deep breath of the hot, polyester-scented air inside the mask. "But if my zipper gets stuck, you are buying the getaway ice cream." "Deal. Now, check your watch. Cake cutting is scheduled for exactly two-forty. That is our window." "Why the cake?" Bill asked. "Sugar crash, Bill. It's basic biology," Molly explained, checking her oversized yellow pockets. "Once the frosting hits their bloodstreams, we have exactly five minutes of chaotic paralysis. The adults will be trying to clean up, and the kids will be staring into the middle distance. We strike then." "What about the mother?" Bill pointed a green plush claw toward the patio. "She looks like she’s on her third pot of coffee. She’s twitchy." Molly smiled, pulling a shiny silver juice pouch from her pocket. "I’ve got apple juice. If she gets close, I offer her a drink and a fake balloon animal. She’ll be so grateful she’ll let me take the presents." Bill hoisted his heavy dinosaur tail over his arm. "Alright, Barnaby. Let’s go steal some childhood joy." "For the rent, Rex," Molly corrected. "For the rent."
created_at
"2026-05-20T07:49:09.994Z"
5/20/2026, 7:49:09 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
923
moderation_latency
(null)
generation_time
5074
tokens_prompt
77
tokens_completion
746
native_tokens_prompt
67
native_tokens_completion
691
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0063195
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"dmcNasqwEtSw1MkPxJLk8Aw"
is_byok
false
latency
923
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263350-Eq2UOgRbWGQ96MU4SVl4"
session_id
(null)
api_type
"completions"
id
"gen-1779263350-eoqqI9LEtyuZcUaVGQPr"
upstream_id
"dmcNasqwEtSw1MkPxJLk8Aw"
total_cost
0.0063195
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
482
0.00%
Dialogue to Total Word Ratio
Ratio: 65.65%, Deviation: 35.65%
neededClean
false
wordsTotal
492
wordsDialogue
323
17.5011%