NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8XQA3Q23Z4FP601AZ8X7NP
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.2s
Cost:
$0.000924
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9595%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle rasping against her thighs like a thousand angry glittery ants. She stood under the flickering neon palm tree of the inflatable bounce house, staring at the prize table buried beneath a mountain of glossy, bow-topped boxes. Beside her, Bill was sweating profusely into his giant, foam-rubber mascot head, which was shaped unfortunately like a dyspeptic badger. The smell of stale popcorn and cheap industrial floor cleaner was thick enough to chew. Molly hissed, checking her watch. We have exactly four minutes before the cake cutting ceremony distracts the parents. Are you ready with the industrial trash bags or are you just planning to sweat the loot into submission? Bill muffled a groan behind the badger snout. My peripheral vision in this thing is nonexistent, Molly. I can barely see my own paws, let alone identify which of these presents contains the gold-plated gaming consoles and which is just a sad collection of organic modeling clay. Focus, Bill. We are not here for clay. We are here for the high-end electronics and the gift cards hidden inside those flimsy cards. Grab the silver one with the blue ribbon first. That looks like a Lego set that sells for three hundred on the secondary market. I look ridiculous, he whispered, shuffling forward as a toddler in a bumblebee costume sprinted past them. If anyone asks, I am not a thief. I am a badger who has clearly lost his moral compass and perhaps his primary occupation. If you don't stop whimpering, I am leaving you to face the wrath of thirty sugar-crazed eight-year-olds when they realize their loot has vanished. Grab the haul and move toward the side exit. The getaway van is parked behind the bouncy castle, disguised as a mobile cupcake station. Bill reached out, his foam claws fumbling with a stack of packages. His bulky head bumped into a tray of juice boxes, sending a crimson wave of fruit punch across the tablecloth. The sound of wet cardboard hitting the floor echoed like a gunshot in the high-ceilinged community center. Oh, brilliant, Molly muttered, watching a parent turn toward the commotion. You managed to sabotage our stealth mission with a structural juice failure in under ten seconds. Just grab the bags and run, you oversized woodland creature. Bill frantically scooped the presents into the plastic sack, the badger head wobbling precariously. I am a professional thief, he hissed, kicking a discarded party hat out of his way. I never agreed to the mascot costume, and I certainly never agreed to be part of a heist involving a crime scene of sticky strawberry nectar. Less complaining, more sprinting, Molly snapped, snatching the heavy bag from his shaking paws. If we make it out of here before the clown realizes his magic show is about to be canceled, the drinks are on me. Now, move your badger knees before we end up in jail wearing nothing but polyester fur.
created_at
"2026-05-10T12:27:48.734Z"
5/10/2026, 12:27:48 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
605
moderation_latency
(null)
generation_time
4100
tokens_prompt
77
tokens_completion
748
native_tokens_prompt
67
native_tokens_completion
605
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00092425
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"xHkAatGPPK2jqtsP5K6nCQ"
is_byok
false
latency
605
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416068-DOchJG2B5ITyy0CVoYUi"
session_id
(null)
api_type
"completions"
id
"gen-1778416068-IaXME6jLb13DXNv0APVu"
upstream_id
"xHkAatGPPK2jqtsP5K6nCQ"
total_cost
0.00092425
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
497
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
505
wordsDialogue
0
49.9595%