NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95T0HFBAFRF6E5H5ZVA8FC
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
51.9s
Cost:
$0.026652
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly lay flat against the freshly manicured lawn, her nose buried in a patch of invasive clover. The target was fifty feet away, a white wooden gazebo currently overflowing with brightly wrapped boxes that promised a bounty of high-end toy electronics and limited-edition action figures. She adjusted her tactical goggles, which were actually modified swimming eyewear coated in matte black spray paint. They provided nearly zero visibility in the bright afternoon sun, but she felt they heightened her other senses through sheer sensory deprivation. They were currently positioned behind a decorative stone frog that Bill had nicknamed Gerald. Is the bouncy castle at full pressure? Molly asked. Bill, crouched beside her in a matching turtleneck that was far too tight for a humid July afternoon, checked his plastic stopwatch with a grim expression. He wiped a bead of sweat from his forehead, trying not to smear the charred cork camouflage he had applied to his cheeks before they exited the van. It is a structural hazard, Bill replied. The air pressure is high. The children are ready for cake. They are like small, sugar-fueled sticks of dynamite. Molly looked through her binoculars, which were two toilet paper rolls taped together with electrical tape. The suburban landscape was a minefield of colorful plastic hazards. A stray tricycle sat overturned like a dead beetle in their path, and an automatic sprinkler system threatened to ruin their knitted disguises at any moment. She could feel the heat radiating off the brick patio, shimmering like a desert mirage. Molly checked her utility belt, which was actually a carpenter apron she had dyed dark navy. It held several essential items: a juice-box straw for silent breathing, sticky-tack for silencing squeaky hinges, and a map of the backyard drawn in scented markers. The target is the gazebo, Molly said. Aim for the LEGOs. No sweaters. We are thieves, Bill, not a thrift store donation bin. Bill shifted his weight, his knees cracking loudly like dry twigs in the quiet afternoon air. He clutched a mesh bag filled with oversized marbles, his primary weapon for the retreat. He looked toward the patio where a man in a polka-dot suit was struggling to inflate a long, skinny balloon for a line of impatient toddlers. The clown makes me nervous, Bill whispered. He has been watching this hydrangea bush for five minutes with deep professional suspicion. He is not watching us, Molly insisted. He is a distraction specialist. He is waiting for the balloon animal quota to be met. It is psychological warfare. A golden retriever wandered past their hiding spot, sniffing at the mulch with casual indifference. Molly held her breath, her fingers gripping the dirt. The animal was a professional, much like themselves, and recognized the code of the street. The dog paused, looked directly at the two adults in wool masks laying in the garden, and then decided a discarded cocktail sausage near the grill was much more enticing. I still think the balaclavas are a mistake for a garden party, Bill said, scratching his nose. We stick out. Everyone else here is wearing linen. Professionals do not show their faces, Molly said. If the toddler sees us, he will testify. Children have terrifyingly good memories. Do you have the tool? I have the safety scissors, Bill said. Good, Molly said. When the singing starts, we move.
created_at
"2026-02-25T01:14:54.901Z"
2/25/2026, 1:14:54 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1767
moderation_latency
(null)
generation_time
51870
tokens_prompt
77
tokens_completion
4064
native_tokens_prompt
67
native_tokens_completion
8873
native_tokens_completion_images
0
native_tokens_reasoning
8188
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0266525
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1767
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982094-e0aYy8fgkuMAodM9Ryup"
upstream_id
(null)
total_cost
0.0266525
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
561
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
567
wordsDialogue
0
0.0000%