NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Y0NGMCSPX4YSRRXSHV27P
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.0s
Cost:
$0.000927
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.2460%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly flattened herself against the prickly hedge, her sequined balaclava itching like a nest of angry fire ants. Beside her, Bill was trying to stuff a discarded clown wig into his tactical vest, his breathing wheezy and erratic. The backyard was a sprawling, suburban nightmare of primary-colored plastic castles, oversized balloons, and the distant, high-pitched shrieks of children fueled by pure, unadulterated sugar. Do we really have to wear the glittery masks, Bill hissed, rubbing at his nose where plastic sequins had started to flake off. It is humiliating. Molly rolled her eyes and adjusted her binoculars to scan the patio. It is a fairy-themed birthday party, Bill. If we look like bank robbers, someone calls the police. If we look like two very deranged, budget-strained party guests, we just look like a social train wreck. Blend in or go home. Why are we stealing presents from toddlers, he asked, his voice dripping with moral confusion. This feels like a new low even for us. They are spoiled, Molly whispered, pointing a gloved finger at a pile of gift-wrapped towers that looked suspiciously like imported electronic gadgets. That kid in the velvet vest has an iPad, a drone, and a robotic dog that costs more than my car. We are performing a wealth redistribution maneuver. Is it still redistribution if we are just buying rum with the proceeds, Bill muttered, shifting his weight as a stray soccer ball bounced off his shin. It is a lifestyle supplement, she corrected, glancing toward the sliding glass door where a frantic parent was currently wrestling a rogue glitter cannon. Listen, the distraction is happening now. When the clown slips on the cake, you go for the pile. I will cut the power to the bouncy house. You promised me the clown was not going to be involved, Bill groaned, gripping a duffel bag made of duct-taped trash sacks. I have a genuine phobia of oversized shoes. Just focus on the loot, not the footwear, Molly commanded, sliding lower into the dirt. Look, the caterer is distracted by the pony. This is our moment. If we get caught by a bouncy castle, I am telling them you forced me to do this, Bill said, though he was already hunching his shoulders for the crawl across the manicured lawn. If we get caught, I am telling them you are the birthday performer's long-lost, bitter twin, she shot back, checking her watch. Three, two, one. Go. They scrambled forward, bellies low to the grass, two shadows moving between the lawn chairs. A stray toddler stopped mid-tantrum to stare at them, but Molly silenced him with a stern finger to her lips. They reached the gift table, shielded by a giant, inflatable unicorn. Bill grabbed a stack of boxes and groaned under the weight. It was time for the exit.
created_at
"2026-05-10T12:32:55.323Z"
5/10/2026, 12:32:55 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
573
moderation_latency
(null)
generation_time
3853
tokens_prompt
77
tokens_completion
708
native_tokens_prompt
67
native_tokens_completion
607
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00092725
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"93oAaszCIJekqtsPsLnCwAU"
is_byok
false
latency
573
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416375-EEkXung4PeIVDYgWdLzb"
session_id
(null)
api_type
"completions"
id
"gen-1778416375-xt7kC4jvnH9i6pvs7vaE"
upstream_id
"93oAaszCIJekqtsPsLnCwAU"
total_cost
0.00092725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
473
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
481
wordsDialogue
0
0.2460%