NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR9151M724JB1AJVX3C9SCT9
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.9s
Cost:
$0.000992
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu as she peered over the top of the hedge. Bill grunted as he shoved a bag of glitter into his vest pocket. Do you have the perimeter mapped out, Bill? I have the bouncy castle trajectory timed to the bounce frequency of a six-year-old on a sugar high. Good. Did you remember the non-lethal sedative cupcakes? They are labeled as gluten-free vanilla, which serves as a natural repellent to the parents. Excellent. What about the guard rotation? The clown is currently suffering from a localized existential crisis near the lemonade stand. He is entirely compromised. Is the birthday boy distracted by the magician? The magician is currently pulling a half-baked pigeon out of a hat, so yes, the boy is captivated by the impending avian disaster. We need a clean extraction. How is the load-out cargo bay? My minivan is parked in the alley with the trunk wide open. We have exactly four minutes before the cake cutting ceremony. Four minutes is an eternity for professionals like us, Bill. Is the loot insured? I took out a policy on the gift pile. If we get caught, technically the insurance company pays for our legal fees. That is incredibly stupid. It is visionary, Molly. Now, are we doing the stealth approach or the tactical entry? We are going through the side fence. I have the wire cutters tucked into my glitter belt. Did you bring the decoy presents? Of course I brought the decoy presents. They are filled with mothballs and expired coupons for a local laundromat. The kids will be devastated. They will be resilient. It is a vital character-building lesson in the harsh realities of disappointment. You are a cold woman, Molly. And you are a man sweating inside a polka-dot vest. Let us move. Wait, check the wind direction. We cannot have the scent of the decoy coupons alerting the parents to our presence. It is a breeze coming from the east, you neurotic mess. Nobody smells a discount on tumble-dry cycles from twenty yards away. True. Should I deploy the smoke bomb? Are you insane? It is a backyard party in a cul-de-sac. Use the bubble machine. The bubble machine is a masterstroke. It creates a tactical screen of sheer whimsy. Precisely. Ready? Ready. Wait, is that a Golden Retriever guarding the swing set? He is a plush teddy bear with a bark collar I bought at a garage sale. He is harmless. You are a genius, Bill. I am an artist, Molly. Let us go secure those plastic toys. Target acquired. Move on my whistle. Wait, I think I hear the parents singing. If they reach the chorus of the birthday song, we have to abort. Then we sprint. Grab the mountain of wrapping paper and run for the minivan. I despise children, Molly. Focus, Bill. Eyes on the prize, not the toddlers. Understood. Initiating the bubble screen now.
created_at
"2026-05-10T13:27:44.527Z"
5/10/2026, 1:27:44 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
493
moderation_latency
(null)
generation_time
3800
tokens_prompt
77
tokens_completion
763
native_tokens_prompt
67
native_tokens_completion
650
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00099175
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"0IcAauGVLpKIz7IPhJPToAc"
is_byok
false
latency
493
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419664-RSP1rDzIsc5X3svVz53L"
session_id
(null)
api_type
"completions"
id
"gen-1778419664-UW38H11W91jaOYx8yJvl"
upstream_id
"0IcAauGVLpKIz7IPhJPToAc"
total_cost
0.00099175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
500
wordsDialogue
0
43.1902%