NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJV01WENYC3D5FCJBP6PA4TD
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.7s
Cost:
$0.000939
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1376%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly checked her watch, the neon tick of the second hand illuminated the cramped backseat of the getaway sedan. Outside, a colorful banner fluttered in the suburban wind, announcing Leo’s Sixth Birthday Extravaganza with obnoxious enthusiasm. She adjusted her rubber clown mask, which smelled faintly of mildew and sadness, and peered through the windshield at the sea of bouncy houses. Bill, why are we doing this again? I feel gross just standing near a glitter factory. Bill grunted, stuffing a handful of lockpicks into his oversized polka-dot trousers. Because, Molly, the market for artisanal Lego sets is booming. Plus, this kid Leo has the rarest Star Destroyer model ever produced. I have a buyer in Dubai who loses his mind over plastic bricks. Are you sure this is a secure location? I see a toddler eyeing our van like she knows we are professional burglars. She is three, Molly. Keep your eyes on the mission. Once the clown starts his balloon animal routine, we make our move. Ten minutes of chaos, two minutes of swiping the loot, and then we exit through the back hedge. I hate the hedge. It had spiders last time. The hedge is for the greater good of our retirement fund. Are you ready? Molly sighed, checking her pockets one last time. I have the heavy-duty trash bags. But if I get a glitter bomb to the face, I am taking an extra commission. Fair. Now look at them go. Cupcakes are being served. That is our signal. Bill popped the door open, his oversized shoes clattering on the asphalt. They sprinted toward the garage, ducking under a cluster of helium balloons that looked like aggressive grapes. The air was thick with the scent of sugary frosting and the high-pitched screams of hyped-up children. Molly slipped inside the side door, her lungs burning from the dash and the sheer anxiety of being around sticky fingers. Target acquired, Bill whispered, pointing toward a mountain of wrapped boxes sitting on the kitchen island. It looks like a king’s ransom of wrapping paper and bows. Grab the big ones first. The heavy boxes usually hide the sets with more pieces. Molly grabbed a rectangular box that rattled satisfyingly. Do you think they will notice the cake is missing? It’s right next to the presents. Leave the cake, you monster. We are thieves, not arsonists. We take the plastic, we leave the sugar. Focus. She hoisted the haul into her bag, her heart hammering against her ribs like a trapped bird. Outside, a chorus of voices began singing the birthday song, the melody straining against the quiet efficiency of their theft. They are getting to the candles, Bill. We have thirty seconds before the birthday boy starts unwrapping. Then we sprint, Molly. Move like a thief, not a confused aunt. They dove toward the exit, their bags clinking with the illicit bounty of childhood joy.
created_at
"2026-03-03T23:20:41.187Z"
3/3/2026, 11:20:41 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
617
moderation_latency
(null)
generation_time
3580
tokens_prompt
77
tokens_completion
738
native_tokens_prompt
67
native_tokens_completion
615
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00093925
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
617
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772580041-NDQt1IeHMWgjFxcEUxrT"
upstream_id
(null)
total_cost
0.00093925
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
485
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
489
wordsDialogue
0
30.1376%