NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91ZZ343MG1AGRV11CE3GMC
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.3s
Cost:
$0.000960
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
37.5778%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the plastic tulle scratching her thighs in a way that made her want to commit arson instead of petty theft. Across the cramped utility closet, Bill was currently trying to wrestle a balloon animal dinosaur into the pocket of his oversized polyester tuxedo. The air smelled of damp mop heads and stale birthday cake, a harbinger of the chaos to come. "Does this make me look like a festive threat or a man suffering a nervous breakdown?" Bill whispered, tugging at a bowtie that was rapidly cutting off his oxygen supply. "You look like a waiter who lost a bet with a clown," Molly replied, checking the timing on her stolen watch. "And stop fidgeting. We have twelve minutes before the sugar-crazed toddlers realize the bouncy castle has been sabotaged." Bill sighed, letting the dinosaur deflate against his thigh. "I still think the bouncy castle was overkill. Why did we have to put a slow leak in the entrance?" "Distraction is the architecture of success, Bill. That screams-per-minute count is going to be our symphony. When they swarm toward the deflating plastic, we make our move." "They have a professional entertainer, Molly. A magician. What if he sees us?" "If he sees us stealing wrapped boxes, we kick him in the shin and tell him he was part of the Prestige. Are you ready with the getaway wagon?" "The tricycle is outside the garage. It is a bit small for my knees, but I think I can make it work if I stand up to pedal," Bill said, adjusting the fake mustache that was peeling off his upper lip. "If you fall off that tricycle, I am leaving you to face the wrath of a glitter-covered birthday girl," Molly warned, checking the hall. "Are you absolutely sure you disabled the alarm on the gift table?" "I unscrewed the sensor and replaced it with a lollipop. It is foolproof, unless the sensors have a refined palate for artificial cherry," Bill muttered, checking his pockets once more. "Good. Remember, the gold-foiled one is definitely a Lego set. That is our priority. Leave the puzzles; they are a logistical nightmare to fence." "What about the cake? Can I at least grab a cupcake? I skipped lunch for this," Bill asked, peering through the crack in the door. "If you touch the food, I will tell the parents you are a spy. Focus on the haul, Bill. We are in and out before the piƱata explodes." "I am shaking, Molly. Not from nerves, but from the lack of glucose," Bill whispered, shifting his grip on the duffel bag. "Suck it up, buttercup. We have a heist to finish. On three, we sprint for the pile of presents. Do not trip over the clowns." "Three, two, one," Bill counted, his voice trembling. "Let us go become the nightmares of every suburban parent in this zip code."
created_at
"2026-05-10T13:42:26.668Z"
5/10/2026, 1:42:26 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
610
moderation_latency
(null)
generation_time
3169
tokens_prompt
77
tokens_completion
711
native_tokens_prompt
67
native_tokens_completion
629
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00096025
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"QosAasKMNJSK6dkP2r-ZwQE"
is_byok
false
latency
610
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420546-03l68NhxeEAidFRfmYEA"
session_id
(null)
api_type
"completions"
id
"gen-1778420546-geCRWJaRQPrHMeSgqirM"
upstream_id
"QosAasKMNJSK6dkP2r-ZwQE"
total_cost
0.00096025
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
487
0.00%
Dialogue to Total Word Ratio
Ratio: 70.12%, Deviation: 20.12%
neededClean
false
wordsTotal
492
wordsDialogue
345
37.5778%