NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZ8EA79V4VGAQK13GJRP8A
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.5s
Cost:
$0.000933
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle rasping against her thighs like a thousand angry miniature blades. She glared at Bill, who was currently attempting to wedge a glittery unicorn horn onto his sweat-beaded forehead. The suburban driveway was clogged with minivans, and the sound of a sugar-addled toddler choir shrieked from the backyard. Are you absolutely certain the target is undefended, Bill? It sounds like a war zone in there. Bill squinted through the hole in his felt mask, his mustache twitching with nervous energy. It is a birthday party, Molly. The primary defenses are juice boxes and structural instability disguised as a bouncy castle. We are elite, we are ghosts, and we are going to liberate those wrapped boxes before the cake cutting. I feel ridiculous wearing this fairy outfit, Bill. My tactical pouch is hidden under a literal lace ruffle. How am I supposed to pull a lockpick from a tutu while avoiding a clown? You look majestic. The clown is the variable, I admit. But the intel says he is an amateur hired by a frantic mother. He spent twenty minutes trying to inflate a balloon sword earlier. He is not a security threat. What about the perimeter? There are thirty six year olds with sticky fingers. If they catch us, they will swarm like piranhas. We have the ultimate distraction. Look in the bag. Molly peered into the duffel. She pulled out a small, pulsating speaker. Is this a recording of a lullaby? It is a recording of a magical fairy flute. When I activate it, every child under the age of forty-eight inches will be momentarily incapacitated by deep, rhythmic confusion. Perfect for a clean exit. You are a chaotic genius, Bill. I just hope the loot is worth the risk of being tackled by a miniature toddler in a dinosaur costume. The haul is estimated at three high-end tablet computers and at least four gaming consoles. Do you honestly think I would spend this much on a pair of fairy wings for anything less than a grand payday? Molly checked her watch, inhaling the faint scent of stale popcorn drifting from the garden fence. The flute melody better work, or I am leaving you to face the birthday girl’s wrath alone. I would expect nothing less. Now, adjust your wings. We strike when the clown turns his back to the punch bowl. She crouched low, trying to silence the jingle of the bells on her shoes. They crept toward the fence, the fairy lights above reflecting in their eyes like twin hungry predators. Do you have the exit route mapped? Yes. Through the kitchen, out the dog door, and over the fence by the shed. Just don't trip on the oversized cardboard castle. If I trip, I am taking you down with me. Deal. Now, ready? Three, two, one. Let the games begin, you fairy.
created_at
"2026-03-03T23:06:47.508Z"
3/3/2026, 11:06:47 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
625
moderation_latency
(null)
generation_time
3287
tokens_prompt
77
tokens_completion
718
native_tokens_prompt
67
native_tokens_completion
611
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00093325
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
625
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772579207-lKmRYdhRyDYimsgHsPQv"
upstream_id
(null)
total_cost
0.00093325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
488
wordsDialogue
0
25.9628%