NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94PQSBRNERRNDQRVX8XBDX
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.7s
Cost:
$0.010315
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0049%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale coffee and desperation. Molly tapped a finger against the dashboard, her eyes fixed on the bounce house currently vibrating with the force of twelve hyperactive first-graders. To any other observer, it was a Saturday afternoon in the suburbs. To Molly, it was a high-security vault currently being guarded by a distracted clown and a golden retriever that had fallen asleep in its own water bowl. Look at them, Bill. No discipline. No spatial awareness. They are basically tiny, sticky landmines. Bill sighed, digging through a grease-stained bag for a jelly-filled donut. It is a birthday party, Molly. Not a cartel meeting. We are here for the LEGO Star Destroyer and the remote-control monster truck. Let us keep our eyes on the prize. That truck has independent suspension, Bill. Do you have any idea what that fetches on the underground toddler market? It is two months of rent if we flip it to the right nanny. Molly adjusted the focus on her binoculars, narrowing her gaze on the sunroom. The cake is being served in ten minutes. That is our window. Sugar crashes are predictable. At T-minus fifteen, the energy levels will spike, creating a chaotic auditory screen. At T-minus thirty, the weeping begins. That is when we move in. And the clown? Bill asked, wiping powdered sugar off his chin. You did not mention the guy in the oversized shoes. He looks shifty. I think he is packing. He is packing a balloon animal pump, you idiot. But he is a variable. He has distracted the parents, which is good, but he keeps the perimeter loose. Bill shifted in his polyester suit, the fabric groaning under the strain. I still do not like the plan. Stealing from a kid named Timmy feels like a new low, even for us. Remember when we used to hit art galleries? Galleries have laser grids and silent alarms. This house has a screen door with a broken latch and a trampoline. It is a lateral move, Bill. Focus. Molly pulled out a blueprint drawn in crayon. She had intercepted it from the kitchen table during her flower delivery reconnaissance the day before. The gift table is positioned in the northwest corner of the sunroom. It is a logistical bottleneck. We enter through the laundry room, neutralize the juice box station to prevent any sticky footprint trails, and extract the haul into the hamper. What if they have the party favors already handed out? Do not worry about the favors. Those are just plastic whistles and bubble wands. We want the heavy cardboard. We want the stuff wrapped in glitter paper. High-density loot only. Bill pulled a small glass vial from his pocket. I have the distraction ready. High-potency bubble solution. One blow and the backyard becomes a shimmering field of tactical confusion. They will be too busy popping spheres to notice us. Good. Remember, if a toddler makes eye contact, do not engage. They smell fear and fruit snacks. Got it. No eye contact, no mercy, and absolutely no raisins. Molly checked her watch, her face hardening into a mask of professional heist-readiness. It is showtime. Put on the hairnet.
created_at
"2026-02-25T00:55:39.056Z"
2/25/2026, 12:55:39 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1795
moderation_latency
(null)
generation_time
19688
tokens_prompt
77
tokens_completion
2017
native_tokens_prompt
67
native_tokens_completion
3427
native_tokens_completion_images
0
native_tokens_reasoning
2754
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0103145
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1795
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980939-UgW8QWSCOZtoMPTo2vmH"
upstream_id
(null)
total_cost
0.0103145
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
531
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
539
wordsDialogue
0
0.0049%