NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR92YQWBPQVS8M3FNQS3JNM0
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 2s
Cost:
$0.000921
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
3.0453%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The suburban sanctuary of Oak Creek smelled of lukewarm juice boxes and pulverized disappointment. Molly crouched behind a manicured hedge, her tactical turtleneck itching against her neck. She squinted through the slats of the fence, watching a clown struggle with a balloon animal that looked less like a poodle and more like a dying intestine. Beside her, Bill was busy double-checking his specialized bag of silver-plated bolt cutters and a very suspicious amount of duct tape. It is a fortress of sticky fingers and high-fructose corn syrup, Molly whispered, adjusting her night-vision goggles which were currently just taped-on swimming goggles. Do you think they have hidden laser grids, or is it just the standard territorial toddlers? Bill asked, nibbling on a stolen cracker he had found in his pocket. The invitation said Five’s a Crowd, Bill. There are twenty of them. Each one is a pint-sized sociopath with a sugar-induced stamina bar that never empties. We move when the clown goes for his scheduled breakdown phase. Bill looked at his watch, which was currently set to a time zone that did not exist. I have been practicing my stealth walk, but my knees click like a tap dancer on hardwood floors. Is stealth optional? Stealth is the entire operation, Bill. If we get swarmed, they will paint us with glitter glue and call us Uncle Sparkle until the police arrive. I have seen it happen to better men. The plan is simple, right? We enter the gazebo, bag the mountain of wrapping paper, and retreat before the pinata sacrifice begins? Exactly. The booty is concentrated in the corner by the inflatable castle. It is a soft target. I have already cased the perimeter; the parents are distracted by a debate about artisanal sourdough starters. I feel bad about the birthday boy, Bill muttered, shifting his weight. He seemed like an okay kid, even if he did accidentally punch my shin when I was scouting the drop zone earlier. He is five, Bill. He will be on a tablet in ten minutes. He will not even remember the presents by bedtime. Focus on the payday. Those toys are vintage limited editions. What if they have a guard dog? One of those hyper-active labradoodles? Then you feed it the distraction treats I gave you. Do not eat them yourself, no matter how much they smell like bacon. I make no promises. My blood sugar is plummeting. Molly sighed and pulled out her fake mustache. The clown stumbled, dropping his bag of loose marbles, causing a chaotic domino effect that sent a row of toddlers spiraling into a collective tantrum. The air rippled with the high-pitched siren of pre-school misery. The perimeter is breached, Molly hissed, standing up. It is go time. Remember, look professional, look fast, and for the love of everything, do not look them in the eyes.
created_at
"2026-05-10T13:59:15.092Z"
5/10/2026, 1:59:15 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
594
moderation_latency
(null)
generation_time
62246
tokens_prompt
77
tokens_completion
722
native_tokens_prompt
67
native_tokens_completion
603
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00092125
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"M48Aaq3nE-LRz7IPmtiduQ8"
is_byok
false
latency
594
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778421555-isfdFSexo5RHUxtbkb4l"
session_id
(null)
api_type
"completions"
id
"gen-1778421555-gV1amO2ewlRGstQmlQgD"
upstream_id
"M48Aaq3nE-LRz7IPmtiduQ8"
total_cost
0.00092125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
486
wordsDialogue
0
3.0453%