NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJV07GAQ6BJZ67QD0BBC0YM4
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.4s
Cost:
$0.000969
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1376%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped her rhythmic fingers against the cold plastic of the minivan steering wheel. Bill sat beside her in the passenger seat, his face obscured by a knitted ski mask that appeared suspiciously like a festive snowman. Are you absolutely certain about the schedule, Bill? Molly asked, peering through the rain-streaked windshield. I checked the HOA newsletter three times, Bill replied, adjusting his wooly nose. The twins birthday bash kicks off at four. The gift heap will be unattended while the children are distracted by the aggressive pony rides. Aggressive ponies? Molly sighed. You really think those miserable beasts will keep twenty sugar-crazed toddlers busy for long enough? They are miniature Shetlands, Molly. They bite. That buys us at least twelve minutes of pure, unadulterated chaos, Bill insisted. Twelve minutes is tight for a haul of this magnitude. What if the parents are lurking near the gift table? The parents will be preoccupied with the organic gluten-free cake, Bill said, checking his watch. It is a known fact that those suburban mothers become catatonic when presented with artisanal frosting. And the presents are definitely wrapped? I cannot be seen carrying out raw toys, Bill. It ruins the aesthetic of the heist. Everything is wrapped in shiny, primary-colored foil, Bill whispered. You wanted sparkle, you get sparkle. Did you bring the industrial-grade garbage bags? I brought the reinforced ones, the kind that can hold a lawnmower or a very confused golden retriever, Molly said. Are we wearing the matching neon jumpsuits? They are the key to blending in with the decorations, Bill affirmed. If anyone spots us, we simply claim to be the hired entertainment. I am not doing a puppet show, Bill. I am a professional thief, not a wandering minstrel for preschoolers. You will do the arm gestures if I tell you to, Bill countered. Think of the haul. The sheer quantity of high-end Lego sets and battery-operated monstrosities. Imagine the black market value. Fine, but if a child pulls my jumpsuit, I am walking away, Molly muttered. They are children, Molly, not SWAT team snipers, Bill said. We slip in, fill the sacks, and vanish before the piƱata explodes. The gate creaked open, revealing a manic landscape of balloons and shrieking youth. A small pony trotted past, looking visibly hostile toward a toddler holding a cupcake. That is our cue, Bill said, patting his belt. Remember, keep your movements fluid and avoid the bouncy castle. It is a structural nightmare. I hate bouncy castles, Molly said, checking the lock on her bag. They are the enemy of stealth. Then move fast, Bill said, sliding his door open. Fortune favors the thieves who leave before the glitter cannons fire, Molly. If we survive this, you are buying the pizza, Molly warned. If we survive this, we are eating pizza off the back of a Lego-covered yacht, Bill grinned. Let us go earn our keep.
created_at
"2026-03-03T23:23:45.379Z"
3/3/2026, 11:23:45 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
540
moderation_latency
(null)
generation_time
3279
tokens_prompt
77
tokens_completion
765
native_tokens_prompt
67
native_tokens_completion
635
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00096925
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
540
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772580225-qKcIXWQ6FtpsC2fQEIPG"
upstream_id
(null)
total_cost
0.00096925
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
485
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
493
wordsDialogue
0
30.1376%