NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95KAT68V98W5SE0FK0HCV5
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10.8s
Cost:
$0.004293
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slammed a grainy, over-saturated photograph onto the folding table. Look at it, Bill. That is the target. Bill squinted at the image, leaning so close his nose nearly touched the paper. It looks like a bouncy castle shaped like a dejected dragon. It is a fortress of inflatable polyethylene, Molly corrected, tapping a pencil against the table. And inside that fortress, in the northwest corner of the sunken living room, lies the Mother Lode. You mean the pile of boxes wrapped in dinosaur paper? I mean the haul of the century. Six Lego Star Wars sets, three remote-controlled quadcopters, and at least four envelopes containing high-denominational crisp twenties from various grandmothers. Bill whistled through his teeth. That is a lot of plastic. But what about the perimeter? The perimeter is guarded by a tier-one security force, Molly said. Specifically, twelve sugar-addled seven-year-olds and a golden retriever with a history of nipping. We enter through the kitchen window at 14:00 hours, draped in the official uniform of the catering company. Do I have to wear the hairnet again? Bill asked. It makes my forehead break out. The hairnet is non-negotiable, Bill. It provides camouflage and plausible deniability. If someone sees us, we are just the people delivering the organic fruit cups. Once the cake ceremony begins, the parents will be distracted by the singing. That is our window. And the getaway? The bouncy castle, Molly said, a predatory glint in her eye. I’ve rigged the air intake. When I pull this cord, the dragon collapses. In the ensuing chaos of screaming children and sagging vinyl, we grab the gift bags and exit through the hedge side of the yard. What if the clown shows up early? Bill asked, his voice trembling slightly. You know how I feel about the stilts. They make him too tall for a normal human. The clown is a freelancer named Barnaby, Molly dismissed with a wave of her hand. I’ve already intercepted his GPS. He’s going to be redirected to a retirement home three towns over. He will be making balloon poodles for Octogenarians while we are counting our bounty. Bill nodded slowly, tracing the map of the living room with a meaty finger. What about the gift receipts? Do we take those? Always take the receipts, Bill. We are professionals, not savages. If we can’t flip the toys on the street, we return them to the big-box stores for store credit. It’s cleaner that way. I just feel bad about the cupcakes, Bill muttered. I saw the mother posting on social media. They’re gluten-free. That’s why we’re doing them a favor, Molly snapped. Nobody wants a gluten-free cupcake. We’re liberating those children from a lifetime of dietary disappointment. Bill stood up straighter, adjusting his imaginary tie. You’re right. We’re like Robin Hood, but we keep the stuff. Exactly, Molly said, handing him a pair of tactical party hats. Now, go sharpen your cake server. We move at noon.
created_at
"2026-02-25T01:11:16.047Z"
2/25/2026, 1:11:16 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1595
moderation_latency
(null)
generation_time
10617
tokens_prompt
77
tokens_completion
974
native_tokens_prompt
67
native_tokens_completion
1420
native_tokens_completion_images
0
native_tokens_reasoning
745
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0042935
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1595
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981876-HyFEdYsPFoyPkEKZKe8R"
upstream_id
(null)
total_cost
0.0042935
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
494
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
505
wordsDialogue
0
49.3562%