NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94TQPK0Z27KE5WGKYG1PS1
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.7s
Cost:
$0.011425
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Check the perimeter, Molly whispered, her eyes glued to the binoculars. I see a bouncy castle, three melting ice-cream cakes, and a golden retriever that looks like it has seen too much, Bill said, reclining in the passenger seat of their dented sedan. Focus, Bill. We are here for the jackpot. The gift table is currently unguarded. It is a five-year-old’s birthday party, Molly. Are we really hitting the low point of our careers? We used to boost luxury watches in Zurich. Now we are staking out a suburban split-level. Watches have serial numbers. You know what does not have a serial number? A pristine, unopened box of Intergalactic Laser-Legos. That is street gold, Bill. It is untraceable and highly liquid. I am just saying, the getaway is going to be embarrassing if we get chased by a man in a polyester Buzz Lightyear suit. My dignity has a floor, and we are currently in the basement. That is why we have the distraction. Did you check the canister? The bubbles? Yes, it is primed. But why bubbles? Toddlers have the attention span of gnats. You release a high-velocity stream of bubbles near the sandbox, and the entire security force—that is the moms, Bill—will be distracted by the sheer photogenic potential of the chaos. They will be too busy posting to their stories to notice the loot moving. And the clown? He has been staring at the van for five minutes. The clown is a variable, but I have got a balloon pump and a bag of scraps. If he tries to intervene, I will challenge him to a giraffe-off. I have been practicing my twists for three weeks. You are terrifying when you are motivated. I am a professional. Now, what is our entry point? The sliding glass door is propped open for the pizza delivery guy. But there is a biological hazard to consider. What? Armed guards? Sticky fingers, Molly. Everywhere. I do not want jam on my tactical turtleneck. It is dry-clean only. Suck it up. We move when the Happy Birthday chorus hits the bridge. That is the peak of the sonic interference. They will not hear us sliding the gifts into the laundry hamper. Wait, look. Is that the aunt? She is guarding the gift table like a hawk. That is Aunt Martha. She has had four mimosas. She is not an obstacle; she is an opportunity. Give her a compliment about her highlights and she will let us walk out with the furniture. I do not like this. The dog is looking at me. It knows our intent. The dog knows nothing. It is a golden retriever. Its brain is just a tennis ball bouncing in an empty room. How do we handle the cake cutting? That is our window of egress. Once the sugar hits their systems, it becomes a mosh pit. We grab the hamper, hit the side gate, and we are back in the van before the first kid throws up. I still think we could have just robbed a toy store. And miss out on the free juice boxes? Think, Bill. Overhead is everything. Fine. But if I get a glitter-bomb to the face, I am quitting the crew. Masks on, Bill. Let us go make some memories.
created_at
"2026-02-25T00:57:50.045Z"
2/25/2026, 12:57:50 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1893
moderation_latency
(null)
generation_time
24597
tokens_prompt
77
tokens_completion
1969
native_tokens_prompt
67
native_tokens_completion
3797
native_tokens_completion_images
0
native_tokens_reasoning
3086
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0114245
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1893
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981070-PB6UfJ3GYpIMWz7r8yfV"
upstream_id
(null)
total_cost
0.0114245
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
550
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
559
wordsDialogue
0
0.0000%