NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTXTFV3AXPD2T4BNES0RXWT
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.5s
Cost:
$0.000984
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1383%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink tutu and glared at the invitation stuck to the dashboard of their getaway van. Bill sat in the driver seat, struggling to squeeze his massive frame into a vest embroidered with dancing cupcakes. The mission was absurd, yet the potential haul was significant. They were targeting a birthday party for the son of a tech billionaire who insisted on gifting gold-plated gaming consoles and rare imported action figures. "Check the perimeter again Bill," Molly whispered, her voice tight with professional tension. "Do you see a clown?" "No clown," Bill said, adjusting his plastic party hat. "Just a trampoline and a very aggressive golden retriever." "We cannot have a dog barking mid-heist," Molly insisted. "Did you bring the sedative-laced hot dogs?" "I ate them," Bill admitted, looking sheepish. "They smelled like gourmet sausages, Molly. I am only human." Molly pinched the bridge of her nose. "You are an amateur. Fine, change of plan. You distract the parents with your terrible balloon animal skills while I slip into the pantry through the window. That is where they store the surplus boxes before the cake ceremony." "Why do I always have to be the balloon clown?" Bill grumbled. "People expect me to be funny. I have the personality of a brick wall." "Your suffering is part of the job description. Just inflate the long ones and look jolly. If a child asks for a poodle, make a lump and tell them it is a cloud. They will believe anything." "And if they ask for a refund?" "You tell them it is conceptual art," Molly snapped. "Focus. See that gate? It swings open when the catering truck enters. We ride in, grab the stash, and vanish before the piñata explodes." "What if the piñata is rigged with alarms?" "It is a cardboard donkey, Bill, not a bank vault. Stop overthinking the physics of festivities." Bill checked his watch. "The caterers are pulling up now. I suppose I should practice my clown face. Does this look like a cheery balloon artist?" He scrunched his eyes and bared his teeth in a terrifying imitation of a grin. Molly shuddered. "That looks like a homicidal woodland creature. Tone it down. Aim for vacant and confused." "Vacant and confused is my natural state," Bill noted. "Perfect. You are ready for the big leagues. Grab the nylon sack. We only have three minutes before the cake arrives, and I refuse to share the goods with a hoard of sticky-handed seven-year-olds." "Do we leave any presents behind for the kid?" Molly slapped his shoulder. "If you show mercy, you are out of the crew." "Understood," Bill sighed. "No mercy for the birthday boy." They stepped out into the bright lawn, two criminal masterminds draped in cheap glitter and polyester. The van doors creaked shut, and they marched toward the bouncy castle, ready to steal joy one box at a time.
created_at
"2026-03-03T22:41:41.744Z"
3/3/2026, 10:41:41 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
535
moderation_latency
(null)
generation_time
3394
tokens_prompt
77
tokens_completion
742
native_tokens_prompt
67
native_tokens_completion
645
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00098425
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
535
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772577701-zH6LMj8Trem77H0sm02m"
upstream_id
(null)
total_cost
0.00098425
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
485
0.00%
Dialogue to Total Word Ratio
Ratio: 64.97%, Deviation: 14.97%
neededClean
false
wordsTotal
491
wordsDialogue
319
30.1383%