NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYYCKMQKHAADSCCZSJ821N
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.0s
Cost:
$0.000938
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the peeling wallpaper of the van, tapping her fingers against a velvet mask that didn't quite cover her skepticism. Bill was busy unfolding a blueprint of the suburban cul-de-sac as if he were planning a raid on the federal reserve instead of a birthday bash for a six-year-old named Timothy. Are you sure about the surveillance, Bill? Molly asked, eyeing the kid-sized backyard through binoculars. It is a fortress of bouncy castles and juice boxes, he muttered, tracing a line on the map. We strike when the clown starts his balloon animal routine. It creates a twelve-second window of total distraction. Are you kidding me? A clown is basically a sensory overload machine. How do you propose we navigate the swarm of sugar-crazed toddlers without being noticed? We wear the uniforms, Bill said, pulling two crumpled neon vests from under his seat. We are the hired help. Nobody questions the guy carrying a crate labeled party favors. Isnt that suspicious if you are not invited? Everyone is invited to a six-year-olds party if you look like you know where the extra cake is hidden. Just keep your head low. Molly sighed and pulled on her vest, which smelled faintly of stale popcorn. I still cannot believe we are hitting a party for a child. This is our lowest point,Bill. My parents thought I was going to be an architect. I am an architect of opportunity, Molly. Think of the haul. There are at least thirty boxes of premium Legos in that pile. Do you have any idea what those go for on the dark web? It still feels wrong. Maybe we just steal the cake? We are professionals, Bill snapped. We take the gifts, we leave the toddlers, and we vanish before they realize the piƱata was rigged to explode with confetti instead of candy. Do you have the grappling hook? I have the collapsible ladder, Bill. We are climbing a fence, not scaling a skyscraper. Precision matters, he said, adjusting his spectacles. If we wake the neighbor's golden retriever, the entire operation is compromised. Are you ready for the signal? I was ready five minutes ago, Molly replied, opening the van door. Just tell me you did not bring the glitter bombs again. I brought the glitter bombs, he admitted, stepping out into the sunlight. It is for dramatic effect. Molly stared at him, her composure breaking into a grimace of pure annoyance. If I get glitter in my tactical cargo pants, I am leaving you with the mascot costume. He patted his pocket, checking for his lockpicking kit. Deal. Now, approach on my mark. Remember, eyes on the presents, not the cake. Molly checked her watch, inhaling the suburban air. I hope the parents ordered extra napkins, because things are about to get messy. They moved toward the fence, silhouettes against the bright, chaotic energy of the lawn.
created_at
"2026-03-03T23:01:18.079Z"
3/3/2026, 11:01:18 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
541
moderation_latency
(null)
generation_time
3896
tokens_prompt
77
tokens_completion
728
native_tokens_prompt
67
native_tokens_completion
614
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00093775
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
541
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772578878-rVvfcyBc1A6pXSGoJLo8"
upstream_id
(null)
total_cost
0.00093775
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
492
wordsDialogue
0
21.6892%