NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZY9ZYKCFQ54GBNDRBGGY3
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.5s
Cost:
$0.000915
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.0058%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink tutu and glared at the invitation pinned to the drywall. Bill was busy trying to jam a velvet party hat over his ears, his thick fingers tangling in the elastic strap. We are definitely going to hell for this, Bill muttered, snapping the strap against his chin. He hissed in pain, rubbing a red welt. Don’t be dramatic, Molly whispered, checking her watch. It is a four-year-old’s birthday bash, not a gold bullion vault. The security measures are basically a lukewarm ball pit and a stressed clown named Giggles. Giggles does not matter, Bill insisted, gesturing wildly with a half-inflated balloon. It is about the principle of the thing. We are stealing toys from toddlers. Think of the toys, Bill. The limited-edition action figures, the pristine building blocks, the remote-controlled cars. My niece wants a tablet and I am tired of being broke. Are we really doing this or are you just going to whine about your lack of moral compass until the cake is cut? Fine, Bill sighed, his shoulders slumping. But if I get a face-painting allergy, I am taking your share of the loot. You are allergic to nothing except commitment and hard work, Molly retorted. Now, memorize the floor plan. The gifts are staged by the giant inflatable slide. That is our extraction point. The parents will be distracted by the piñata ceremony, which is our window of opportunity. Exactly how are we going to haul an entire table of presents out through a suburban backyard crawling with hyperactive humans? We use the magician’s trunk, Bill. He has a false bottom. We hide the goods, wheel it to the van, and call it a day’s work. You are a terrible person, Molly. I know, but I look fantastic in this tutu. Do you think they will notice if I pocket a handful of juice boxes? For the road, Bill? For the road, Molly agreed, tucking a lock pick into her petticoat. Let us get those presents. Wait, Bill whispered, grabbing her arm. Do you hear that? Hear what? The sound of impending justice? No, the sound of a toddler army screaming the lyrics to that shark song. If we get caught, I am telling them you are my weird aunt. I am actually insulted you would suggest anything else, Molly said, smoothing her skirt. Remember, precision is key. We go in, we grab the haul, and we do not engage with the bouncy house. I cannot promise that, Bill said, eyes already gleaming with the prospect of an oversized trampoline. I have been wanting to test my vertical leap for years. Focus, Bill. The toys. Right. The toys. Just think of the resale value on those building blocks. We are basically entrepreneurs. We are absolutely criminals, Molly corrected. Keep your eyes on the slide. Let’s dance.
created_at
"2026-03-03T23:18:43.98Z"
3/3/2026, 11:18:43 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
496
moderation_latency
(null)
generation_time
3228
tokens_prompt
77
tokens_completion
679
native_tokens_prompt
67
native_tokens_completion
599
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00091525
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
496
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772579923-E3T3pxzJZ05kcpNa17Nk"
upstream_id
(null)
total_cost
0.00091525
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
475
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
481
wordsDialogue
0
1.0058%