NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91AZYGFN7M0C9DRA36VBJY
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.0s
Cost:
$0.000930
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The suburban cul-de-sac was silent, save for the muffled, frantic thumping of a bass-heavy version of Baby Shark drifting from the backyard of number forty-two. Molly adjusted her tactical balaclava, which was slightly itchy and smelled faintly of mothballs. Beside her, Bill was trying to cram an oversized bowling ball bag into a decorative hedge. He looked like an amateur gardener having a nervous breakdown. Through the gaps in the cedar fence, they could see the shimmering bounty: a mountain of glitter-wrapped boxes piled precariously on a folding table, guarded only by a distracted clown currently trying to twist a balloon into the shape of a giraffe. Check the perimeter again, Molly whispered, poking Bill in the ribs. Is the clown still distracted by the balloon disaster? Bill squinted through his binoculars. He is currently being assaulted by a toddler with a plastic lightsaber. We have a solid thirty-second window before he realizes the giraffe is just a lumpy sausage. Are you sure we are doing this? It feels wrong to rob a party with a bouncy castle present. Molly sighed and checked her watch. It is a house full of sugar-crazed six-year-olds, Bill. They have no concept of asset protection. They will forget these gifts exist by tomorrow morning when they find a cool stick outside. We are essentially doing them a favor by decluttering their living room. Bill grunted, pulling his mask down over his nose. I just feel like maybe we should have targeted a jewelry store, or a vault, or literally anywhere that does not involve the threat of stepping on a stray Lego. The moral implications are starting to weigh on me. Stop being soft, she hissed, nudging him toward the side gate. Think of the loot. I heard through the grapevine that there is a Limited Edition Laser-Blaster 5000 in those ruins. Bill perked up. The one with the infrared targeting system? Exactly. Now, follow my lead. We go in low, grab the premium offerings, and exit via the sandbox. If any kids try to stop us, just toss a handful of gummy bears in the opposite direction. It works every time. Bill nodded, his confidence returning as the prospect of high-end gadgets outweighed his guilt. They slid through the gate like shadows cast by a neon party light. The air grew thick with the smell of spilled juice and despair. The clown let out a high-pitched pop, followed by the wail of a child, providing the perfect cover. Molly surged forward, her gloves gripping the edge of the table. One by one, they slid the presents into the duffel bag, moving with the precision of professionals. Just as the clown turned, clutching a tragic, limp balloon, Molly whispered a final instruction. Run, she commanded, and they vanished into the darkness, leaving only the confused cries of children and a pile of unboxed glitter in their wake.
created_at
"2026-05-10T13:30:59.415Z"
5/10/2026, 1:30:59 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
544
moderation_latency
(null)
generation_time
3904
tokens_prompt
77
tokens_completion
723
native_tokens_prompt
67
native_tokens_completion
609
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093025
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"k4gAauWIJuPNqtsPler4yAE"
is_byok
false
latency
544
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419859-cP8lpyf4dwxZk8kef6qn"
session_id
(null)
api_type
"completions"
id
"gen-1778419859-oi9CxNmfmn4GcBh0zMJA"
upstream_id
"k4gAauWIJuPNqtsPler4yAE"
total_cost
0.00093025
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
495
wordsDialogue
0
21.6892%