NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94K25B50QS5N179BVBJVRN
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
48.2s
Cost:
$0.025996
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
98.28%
Dialogue to Total Word Ratio
(details)
70.8316%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The interior of the 2004 Ford Windstar smelled of stale corn chips and the bitter scent of Bill’s cheap cologne. Molly peered through a pair of high-powered binoculars, her eyes locked on a suburban backyard that looked like a primary-colored war zone. A massive inflatable castle wheezed in the corner, its turrets swaying rhythmically with the frantic jumping of ten over-caffeinated first-graders. The humidity of the July afternoon was pressing against the windshield, blurring the edges of the pink and blue bunting. Across the street, the target sat unguarded: a sagging folding table burdened with a mountain of brightly wrapped boxes. "Target acquired," Molly said. "The high-value assets are clustered near the sugar-free punch." Bill shifted in the driver’s seat, his large frame making the van’s suspension groan. He was clutching a juice box with a sense of profound existential dread. "Are we really doing this? Stealing from a six-year-old feels like a professional low." "It’s not stealing. It’s a forced redistribution of plastic. We’re doing a public service." Molly didn't blink. She pulled a map from the glove box. It wasn't a standard blueprint; it was a crude sketch drawn in neon highlighter on the back of a pizza menu. She pointed to a jagged line representing the perimeter fence. "The entry point is behind the hydrangeas. The clown is the distraction. That’s our window." "What if the dog sees us? Golden Retrievers are notorious for their snitching." Molly adjusted her earpiece, her expression cold and professional. The neighbor’s lawn sprinklers hissed in the distance, a rhythmic sound that felt like a ticking clock over the muffled sounds of a pop song playing from the patio. "The dog has been neutralized with organic jerky. Focus. I’m talking high-end LEGO sets." Bill sighed, pulling a lime-green ski mask over his head. He adjusted the eye holes, which were slightly misaligned, giving him the appearance of a very confused forest gnome. "If I get tackled by a dad in cargo shorts, I’m blaming you." "Just make sure you grab the box with the silver ribbon. It’s the gaming console." Outside, a chorus of screaming erupted as a piñata was struck, showering the dry lawn with cheap taffy and plastic rings. Molly checked her watch. The chaotic physics of thirty children scrambling for sugar was the perfect smokescreen for their tactical insertion. "Go now. Watch out for the LEGO landmines. They’re deadly. Watch your shins." "I'll be like a shadow. A shadow that really wants a piece of cake." "The cake is for winners, Bill. Just focus on the mission and get out." He stepped out of the van, his boots thudding softly against the hot asphalt. He moved with a surprising lack of grace, his bulky frame silhouetted against the white picket fence. Molly stayed behind the wheel, her fingers tapping a rhythm on the dashboard. She watched as Bill disappeared into the thicket of dusty hydrangeas, his green mask blending poorly with the floral arrangement. The greatest heist of the suburban summer had officially begun, fueled by greed, adrenaline, and a very specific map drawn in crayon.
created_at
"2026-02-25T00:53:38.609Z"
2/25/2026, 12:53:38 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1693
moderation_latency
(null)
generation_time
48145
tokens_prompt
77
tokens_completion
3674
native_tokens_prompt
67
native_tokens_completion
8654
native_tokens_completion_images
0
native_tokens_reasoning
7947
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0259955
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1693
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980818-jopYCMDT3WpPxxH4rDF1"
upstream_id
(null)
total_cost
0.0259955
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
98.28%
Dialogue to Total Word Ratio
Ratio: 29.41%, Deviation: 0.59%
neededClean
false
wordsTotal
527
wordsDialogue
155
70.8316%