NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8VJBG6HYD2WJJF81EHBECF
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.9s
Cost:
$0.000938
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle rasping against her thighs. She checked the heavy-duty duffel bag hidden under a pile of oversized clown shoes. Bill leaned against the peeling wallpaper of the supply closet, popping a bright blue gumball into his mouth. He looked ridiculous in his polyester magician’s vest, complete with fake rabbits dangling from the pockets. The sound of children screeching in the main hall filtered through the thin wall, a chaotic symphony of demands for cake and chaos. They had thirty minutes until the grand reveal of the gift pile, a mountain of gold wrapped in shiny, primary-colored foil. You think the giant puppet will notice us, Bill? Only if you try to stuff him in the bag, Molly. I just want to be prepared for contingencies. Focus on the cake, not the mascot. Bill, it is a six-foot-tall squirrel. It has eyes that see into your soul. It has a visible zipper, Molly. It is a college kid named Tyler. Tyler might be a trained ninja for all we know. He is an English major who just wants his sixty bucks. Fine, but if the squirrel attacks me, I am using the smoke bombs. Do not use the smoke bombs. It is a room full of toddlers. I will use glitter bombs instead. It will be festive. Just stay close to the punch bowl and wait for my signal. When is the signal? When the birthday boy starts screaming about the flavor of his icing. That sounds like a very reliable indicator of chaos. It is the most reliable event in the industry. Are the zip ties ready behind the gift mountain? Yes, they are tucked under the pile of squishmallows. Good. I really need that limited-edition rocket ship set. You cannot sell a rocket ship to a fence. It is too conspicuous. It is for my nephew, Bill. I am a thief, not a monster. We are literally stealing from a four-year-old’s celebration. It is for a good cause, and that cause is my family’s holiday budget. Stop twitching. You look like you are having a seizure in that wig. It is itchy. Why does this wig have so many curls? It makes you look like a deranged toddler. Keep it on. I feel like a very large, very angry balloon animal. Good. That is exactly the camouflage we need for this room. I hear the music stopping. Is that the clown? That is the signal. Remember, grab the big boxes first. Got it. See you on the other side of the icing disaster. Stay low, keep the tutu flat, and do not make eye contact. I hate this job, Bill. I know, Molly. See you in the glitter.
created_at
"2026-05-10T11:50:09.169Z"
5/10/2026, 11:50:09 AM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
678
moderation_latency
(null)
generation_time
3789
tokens_prompt
77
tokens_completion
683
native_tokens_prompt
67
native_tokens_completion
614
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093775
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"8XAAas-7Fsucz7IPmurm-Q8"
is_byok
false
latency
678
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778413809-eCz3I6QMBvVnGCdUcm7P"
session_id
(null)
api_type
"completions"
id
"gen-1778413809-286QiHC1ICc5ze7rszjM"
upstream_id
"8XAAas-7Fsucz7IPmurm-Q8"
total_cost
0.00093775
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
457
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
464
wordsDialogue
0
0.0000%