NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ956N9WAF77P3MAM2NDFD60
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
13.0s
Cost:
$0.005760
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Look at the blueprints, Bill. I spent three hours scouting the perimeter. Bill squinted at the crumpled paper. That is a Denny’s placemat, Molly. And you’ve drawn the X in maple syrup. It is a topographical map of the Johnson estate. Here, by the sandbox, is the primary insertion point. Is the fence electrified? Bill asked, leaning in. It is cedar, Bill. But it has those pointy decorative bits on top. Very intimidating. Now, the prize is in the sunroom. It is a mountain of wrapped boxes. High-end stuff. We are talking premium Lego sets, tablets, and maybe even one of those miniature electric Porsches. I have always wanted a Porsche, Bill whispered. You are six-foot-four, you would look like a Shriner. Focus. At fourteen-hundred hours, the clown arrives. Should we be worried about the clown? I have heard they have hidden pockets. Are we talking a professional or a local hire? The clown is our window. He is the tactical distraction. While he is making a balloon giraffe that looks like a mutated sausage, we slip through the sliding glass door. What about the heavy security? You mean the twelve toddlers in sticky socks? They are unpredictable, Molly. One of them sees us, lets out a high-pitched squeal, and the jig is up. They are like living sirens. Sentient, jam-covered alarms. We use the tactical bribes, Molly said, patting her tactical vest. I have got six bags of organic, gluten-free fruit snacks. It is better than gold in that neighborhood. Smart. Very professional. And the parents? Chardonnay, Bill. They will be on the patio, discussing private school tuition and juice cleanses. They won't see a thing as long as the caterer keeps the shrimp cocktail coming. Wait, is there shrimp? Because my allergy— You are wearing the hazmat suit, remember? I thought that was for the ball pit. I am not going in the ball pit without protection. Do you know what kind of bacteria thrives in those plastic spheres? It is a petri dish of misery and forgotten dreams. The suit is for the theme, Bill. We are the Pollution Patrol. If anyone asks, we are there to inspect the inflatable slide for toxic fumes. Do I get a badge? I made you one out of a cereal box. Just pin it on your chest. Now, once we bag the goods, we head for the hedge. Stay low and do not trip over the vibranium-shield-carrying kindergartner. The what? The kid dressed as Captain America. He has a mean throw. I can handle a kid, Molly. That is what you said about the goat at the petting zoo heist, and you still have the bite marks. That goat was an outlier. Okay, so we bag the loot, hit the hedge, then what? We vanish into the suburbs. We will be halfway to the pawn shop before the birthday boy even blows out his candles. Can we stay for the cake? Bill asked. I heard it is red velvet. No cake, Bill. We are thieves, not guests. But it has the cream cheese frosting, Molly. One slice, she sighed. But if you get crumbs on the evidence, I am leaving you in the ball pit.
created_at
"2026-02-25T01:04:20.8Z"
2/25/2026, 1:04:20 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1421
moderation_latency
(null)
generation_time
12961
tokens_prompt
77
tokens_completion
1240
native_tokens_prompt
67
native_tokens_completion
1909
native_tokens_completion_images
0
native_tokens_reasoning
1191
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0057605
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1421
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981460-WN6Mx0yoqiV5xbmk7YYJ"
upstream_id
(null)
total_cost
0.0057605
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
535
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
544
wordsDialogue
0
0.0000%