NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95DSE46VCYF44CSEWZNPK2
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
23.7s
Cost:
$0.011265
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the binoculars while crouching behind a decorative hedge. Check the side gate, Bill. It is locked, Molly. I already checked it twice. Check it a third time. Diligence is the mother of not going to jail. I checked it. It is locked. Also, that toddler in the Spider-Man suit is staring directly into my soul. Ignore the child. He has been eating blue frosting for twenty minutes. His brain is a swamp. Focus on the objective. Do you see the stack near the hibiscus? The one with the shiny blue paper? That is the jackpot. That giant box is definitely a motorized dirt bike. A dirt bike for a six-year-old? Rich parents are overcompensating, Bill. It is the backbone of our local economy. I feel weird about this one. They have a bouncy castle. I always wanted to go in one of those when I was a kid. If we finish the job in under four minutes, I will allow you thirty seconds of airtime. Make it a full minute. Forty-five seconds. And you have to take your boots off. I will not have you popping the bounty. Fine. What is the plan for the clown? The clown is a freelancer. He does not care about site security. I will distract him with a balloon animal emergency. What exactly is a balloon animal emergency? I will tell him there is a giraffe with a broken neck in the driveway. While he is grieving, you slide the gift table toward the van. It is a heavy table, Molly. It is solid oak. That is why you spent all that money on the gym membership last year. Put those deltoids to work. I think the dog saw me. Bill ducked his head as a golden retriever barked from the porch. The golden retriever? He is growling. It is a very polite, suburban growl, but it is definitely a threat. Throw him the vegan jerky. The stuff you bought at the health food store? He is a high-end dog, Bill. He probably has a gluten sensitivity and a therapist. He will love it. Are we really stealing from a kid named Tyler? His name is on the oversized banner. Tyler has too much. We are just rebalancing the cosmic scales. By selling his LEGO sets on the internet? It is a form of aggressive wealth redistribution. Now, check your watch. It says twelve-thirty. Cakes are at twelve-forty-five. That is our window. What if there is a real magician? If a real magician shows up, he might see through our disguises. Bill, we are wearing high-visibility vests and carrying clipboards. In this neighborhood, that makes us essentially invisible. Nobody questions a person who looks like they are checking a meter. I still want to jump in the castle. Focus. If we get that dirt bike, we can retire. For how long? Until next weekend. There is a sweet sixteen in the heights. They are rumored to have a fountain made of chocolate. Can we steal a fountain? We can steal anything if the plan is tight enough. That does not make any sense. Just grab the jerky, Bill. It is go time.
created_at
"2026-02-25T01:08:14.41Z"
2/25/2026, 1:08:14 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1524
moderation_latency
(null)
generation_time
23665
tokens_prompt
77
tokens_completion
1935
native_tokens_prompt
67
native_tokens_completion
3744
native_tokens_completion_images
0
native_tokens_reasoning
3047
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0112655
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1524
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981694-Z7wvj3e958J05kTOAW90"
upstream_id
(null)
total_cost
0.0112655
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
530
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
539
wordsDialogue
0
0.0152%