NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS260EEWAY6SWW9VP3ZVRXA4
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.1s
Cost:
$0.005851
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
We have exactly twelve minutes before the magic show ends, Bill said, adjusting his pastel yellow party hat. That is our window of opportunity. Once the French bulldog does his card trick, forty sugar-crazed eight-year-olds will stampede back to the living room for cake. Molly adjusted her fake mustache, which was slipping into her mouth. Why am I wearing this? I look like a disgruntled Mario brother. You are the hired entertainment, Bill explained, squinting through the bushes at the suburban house. You are Uncle Mustache the balloon sculptor. I am the clown who had a flat tire. It is a highly believable narrative. We do not even have balloons, Bill. I have a bag of lawn clippings and a roll of duct tape. Work with what you have, Molly. Improvisation is the soul of larceny. Now, did you locate the loot? It is in the dining room, laid out on a trestle table. There is a mountain of presents. I saw a box that definitely contains a motorized mini-scooter. Perfect. We bag the loot, load the minivan, and we are gone before the first juice box is pierced. What about the mother? She has been staring at me through the kitchen window for ten minutes. She is suspicious because you are eating her decorative mulch. Stop doing that. It tastes like cedar, Molly muttered, wiping her hands on her trousers. Is the getaway vehicle running? The engine is purring like a kitten with asthma. I parked it behind the bouncy castle to block the exit. You blocked the only exit of a bouncy castle? Bill, that is a safety hazard. It is tactical positioning, Molly. If things go south, we use the children as a soft barrier. They are bouncy. They bounce. You are a monster. Where is the sack? I brought the extra-large laundry bags from home. The ones with the little flowers on them. They blend in. Do flower bags scream 'professional heist' to you? They scream 'laundry day,' which is highly disarming, Bill insisted, tapping his plastic watch. Undercover genius. Now, on my mark, we enter through the patio doors. Wait, Molly said, grabbing his arm. What is that noise? It sounds like a violin being tortured by a dolphin. That is the magic bulldog, Bill. The show is over. They are clapping. We are out of time! Go, go, go! Molly kicked open the screen door. Freeze, toddlers! she yelled. Bill gasped, grabbing her coveralls. Why did you say that? We are supposed to be party guests! I panicked! Just grab the scooter! A tiny kid in a dinosaur suit pointed a plastic sword at them. You are not the balloon guy, the toddler declared. You do not have any balloons. He is learning, Molly whispered to Bill. Eliminate him. I am not going to eliminate a dinosaur, Molly! Just take the blue boxes and run!
created_at
"2026-05-20T07:53:37.504Z"
5/20/2026, 7:53:37 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
858
moderation_latency
(null)
generation_time
4923
tokens_prompt
77
tokens_completion
722
native_tokens_prompt
67
native_tokens_completion
639
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0058515
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"gWgNaqPyLqfH-sAP35HeuQc"
is_byok
false
latency
858
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263617-LRsf0HLhbW0FeRpX6JMB"
session_id
(null)
api_type
"completions"
id
"gen-1779263617-mQSxOViSiG0XbDN88KCm"
upstream_id
"gWgNaqPyLqfH-sAP35HeuQc"
total_cost
0.0058515
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
0.00%
Dialogue to Total Word Ratio
Ratio: 3.30%, Deviation: 46.70%
neededClean
false
wordsTotal
485
wordsDialogue
16
13.5829%