NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WP2GV2RHKSEE7Y35KH2YV
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.2s
Cost:
$0.000932
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
46.8248%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The minivan idled at the curb, its engine wheezing like an asthmatic pug. Molly tapped the steering wheel, her fingers dancing in a nervous, rhythmic staccato against the cracked faux-leather. Beside her, Bill adjusted his oversized velvet bow tie, squinting at the glossy invitation he held with paws that looked far too large for delicate safecracking. The suburban driveway in front of them was teeming with pint-sized agents of chaos, all vibrating with high-fructose energy and wearing cardboard crowns. Are you absolutely sure the main stash is in the garage, Bill? Molly asked, her gaze fixed on a toddler attempting to eat a balloon. She adjusted her wig, which was itching with the intensity of a thousand tiny needles. Trust me, I cased the joint during the morning grocery run, Bill replied, checking his watch with a solemnity usually reserved for bank vaults. The birthday boy has an obsessive-compulsive hoard of LEGO sets. It is a goldmine waiting for a professional touch. Molly sighed, sliding the minivan into park just as a clown tripped over a tricycle. We are stealing toys from children, Bill. My mother always said I would end up in prison, but I never imagined my crime would involve stepping on plastic bricks in the dark. It is not just toys, it is high-value assets, Bill corrected, pulling a silk sack from under his seat. We are liberating them from a life of neglect and slobber. Plus, have you seen the size of that pirate ship set? We could fetch a premium on the black market by Tuesday. Molly unbuckled her seatbelt and leaned over to check her reflection in the rearview mirror. I just hope the clown does not try to engage me in conversation. I have zero patience for balloon animals or forced levity while I am working. Bill popped the door handle and stepped out, his oversized shoes clattering on the pavement. You just focus on the perimeter, he whispered, hunching his shoulders to mimic a stealthy predator. I will dive into the clutter, secure the loot, and meet you by the bouncy castle in ten minutes sharp. Do not leave me if things get sticky. I am not leaving you, you geriatric goon, Molly muttered, grabbing her velvet bag. Just keep your head down and do not trip over the bouncy house blower. Remember, we are invisible, we are swift, and we are absolutely pathetic for doing this. Bill offered a crisp thumbs-up, his bow tie sliding sideways toward his ear. Precision is our middle name, he promised, turning toward the backyard where a sea of cupcakes beckoned. Molly shook her head, feeling the absurdity of their situation sink into her bones. She checked her disguise one last time and exited the vehicle, ready to commit a larceny that would surely haunt their reputations for years to come. The neighborhood was quiet, save for the muffled shrieks of impending birthday joy. It was time.
created_at
"2026-05-10T12:09:39.619Z"
5/10/2026, 12:09:39 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
594
moderation_latency
(null)
generation_time
4075
tokens_prompt
77
tokens_completion
728
native_tokens_prompt
67
native_tokens_completion
610
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093175
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"g3UAaraBMfqmmtkP7uKegAw"
is_byok
false
latency
594
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414979-8fr9ehjkOw3oWrt4laWN"
session_id
(null)
api_type
"completions"
id
"gen-1778414979-uk8mDUUSlpZ1AEL1bkMH"
upstream_id
"g3UAaraBMfqmmtkP7uKegAw"
total_cost
0.00093175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
491
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
497
wordsDialogue
0
46.8248%