NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95VVEPH367J4FVYEFA6DAX
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
29.0s
Cost:
$0.016827
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
99.66%
Dialogue to Total Word Ratio
(details)
99.5181%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Did you pack the smoke bombs?" Bill whispered, adjusting his foam tail. "They're bubble machines, Bill. We aren't raiding a cartel." "Bubbles are tactical. They sting the eyes." "It’s a six-year-old’s birthday. We just need to blend in." "I'm wearing a sequined dinosaur suit. Is this blending?" "You look like a professional mascot. Hand me the blueprints." "It’s a crayon drawing of a bouncy castle, Molly." "Precision matters. The loot is in the Southwest quadrant, behind the organic juice boxes." "What if there’s a clown?" "We neutralize him with the high-velocity Silly String." "I don't like his baggy pants. He could be hiding a piece." "A piece of what? Pound cake? Focus. The haul is massive. There is a vintage-edition Lego set." "And the remote-control helicopter?" "That's mine. You get the neon Play-Doh." "Deal. Do I deploy the glitter grenades on entry?" "Only if the toddlers get unruly." "They’re always unruly," Bill muttered, checking his watch. "They’re caffeinated and emotionally unstable." "That's why we brought the juice bribes. Check your perimeter." "Perimeter is clear of parents. Let’s do this. For the plastic gold." "Masks on, Bill." "This dinosaur snout is incredibly itchy." "Work through the pain," Molly said, opening the gate. "Think of the Legos."
created_at
"2026-02-25T01:15:55.229Z"
2/25/2026, 1:15:55 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1773
moderation_latency
(null)
generation_time
28997
tokens_prompt
77
tokens_completion
1932
native_tokens_prompt
67
native_tokens_completion
5598
native_tokens_completion_images
0
native_tokens_reasoning
5252
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0168275
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1773
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982155-x1JUOLRA5EzZJCsAMrdY"
upstream_id
(null)
total_cost
0.0168275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
205
99.66%
Dialogue to Total Word Ratio
Ratio: 92.42%, Deviation: 2.42%
neededClean
false
wordsTotal
211
wordsDialogue
195
99.5181%