NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94VQBV8SGM634V22YXB94P
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
46.8s
Cost:
$0.025809
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
32.77%
Dialogue to Total Word Ratio
(details)
16.3848%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night-vision goggles, even though the midday sun blazed through the van’s windshield. Bill sat beside her, panting as he struggled to squeeze his burly frame into a neon-yellow clown suit. The interior of the stolen delivery truck smelled of stale popcorn and professional desperation. Molly checked the monitor displaying the backyard feed. The suburban street was quiet, save for the faint sound of a distant lawnmower and a barking neighborhood dog. "Remind me why I am the clown and you are the inspector?" Bill asked, struggling to snap an oversized red nose into place. He looked in the rearview mirror and grimaced at his own reflection. "Because I have the clipboard, Bill. A clipboard equals absolute authority. Nobody questions a woman in a high-visibility vest clutching a laminated safety checklist." Molly tapped her pen against the heavy aluminum board. She peeked through the tinted windows at the perfectly manicured lawn of the suburban house. "I feel like a neon target. These shoes are too small. My toes are currently performing a tragic play about claustrophobia." "Focus on the loot. The target is the mahogany gift table in the sunroom. Intelligence suggests premium LEGO sets and dozens of gift cards." Bill tugged at his spandex leggings, which were dangerously thin. He let out a long, theatrical sigh as he checked his equipment. "What about the small, sticky kids?" "The toddlers are neutralized by the arrival of pepperoni pizza. We have a seven-minute window before the singalong begins. That is our window." Bill checked his tactical wristwatch, which was partially obscured by neon-green ruffled sleeves. He peered nervously at the house's front door. "Balloons are a tactical problem, Molly. They are static-charged. If my sequins brush against them, the pop will alert the CrossFit mother." "That is why you have the distraction kit," Molly said, pointing toward a burlap sack overflowing with high-grade glitter tucked under the passenger seat. She grinned with predatory delight. "Glitter? That is essentially biological warfare. We agreed on smoke bombs." "Glitter is a tactical concealment cloud. If things go south, you deploy the sparkles and disappear into the chaos. Do you have the bubble gun?" Bill checked the safety on his plastic weapon and then tucked it into a hidden holster sewn into his oversized polka-dot waistband. "Loaded. But I still think the bouncy castle remains the primary threat. Those children are like piranhas in party hats." "Just keep honking that handheld horn. It creates a Pavlovian response of deep confusion in the adolescent mind." Bill checked his face paint one last time. He adjusted the tiny hat perched precariously on his bald, sweaty head. "They ask too many piercing questions about my makeup and my life choices." "This is the big one, Bill. Think of the resale value on those limited-edition cards. Focus on the objective and the prize money." "Fine. I mean it, Molly. But if I get hit by a stray juice box, I am calling for immediate extraction." "Check your rainbow wig. We move in sixty seconds." "The cake is chocolate. I am not leaving without a slice." "Stay professional, Bill. We are here for the toys, not the ganache." Molly grabbed her clipboard and slid the door open with a quiet metallic click, leading them out into the dangerous suburban heat.
created_at
"2026-02-25T00:58:22.465Z"
2/25/2026, 12:58:22 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2236
moderation_latency
(null)
generation_time
46698
tokens_prompt
77
tokens_completion
3946
native_tokens_prompt
67
native_tokens_completion
8592
native_tokens_completion_images
0
native_tokens_reasoning
7868
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0258095
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
2236
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981102-FRYA0ZtqM2DSwqleMsHO"
upstream_id
(null)
total_cost
0.0258095
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
552
32.77%
Dialogue to Total Word Ratio
Ratio: 54.72%, Deviation: 4.72%
neededClean
false
wordsTotal
561
wordsDialogue
307
16.3848%