NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8X6441QBJZEEQ3YATWBXGX
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.3s
Cost:
$0.000971
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle prickling her waist. She glared at the invitation pinned to the wall of their van. Bill sat on a crate labeled Party Supplies, chewing a piece of bubblegum with rhythmic, annoying snaps. We are dressed like a pair of deranged clowns, Bill. This is the low point of my criminal career. It is a disguise, Molly. Camouflage. Nobody suspects a clown of grand larceny. Except maybe children, but children are easily distracted by balloons. The objective is the gift table. Why is it in the middle of a ball pit? Tactical placement. It forces us to immerse ourselves in the objective. It is practically a sensory deprivation tank built out of recycled plastic spheres. I hate ball pits. They are germ factories. If I catch a cold, I am putting it in your file. Focus. The cake ceremony is at three. That is when the clown performance begins, and the parents retreat to the patio for artisan gin. And the dog? The giant, drooling Saint Bernard guarding the entrance? I have the bacon-flavored tranquilizer darts. One tap, and he will sleep through a hurricane. You thought of everything, haven't you? I read the brochure for the venue. They serve gluten-free cupcakes. It is a high-stakes environment. I am not stealing these presents for the sheer joy of it. I want the vintage action figures I know are in that pile. The kid is ten. He does not need a mint-condition 1984 space voyager. He has been spoiled, Molly. We are merely redistributing wealth. It is civic-minded theft. I am just here for the heist, not the moral justification. When do we move in? The entertainer arrives in ten minutes. We slip in behind him, swap his red nose for a camera, and commence the operation. What if the parents try to tip us? Take the money. It pays for the gas for the getaway. You are a degenerate, Bill. And you are currently wearing a wig that looks like a tragic shedding poodle. Should we go? Fine. But if I have to endure one more balloon animal request, I am abandoning the haul and taking the ice cream machine instead. Deal. But let us prioritize the boxes wrapped in gold foil first. Those are the electronics. Grab the canvas bags. And Bill? Yes? If you trip in that ball pit, I am leaving you to be pelted by toddlers. Understood. I would expect nothing less. Molly checked her reflective nose in the vanity mirror, adjusted her oversized polka-dot tie, and opened the van door. The faint, high-pitched shrieks of sugared-up eight-year-olds drifted across the lawn. She took a deep breath, checked her lockpicks hidden beneath her clown glove, and stepped into the chaos. Bill followed, tripping slightly over his giant floppy shoes, looking every bit the criminal mastermind pretending to be a party entertainer.
created_at
"2026-05-10T12:18:25.545Z"
5/10/2026, 12:18:25 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
594
moderation_latency
(null)
generation_time
4134
tokens_prompt
77
tokens_completion
736
native_tokens_prompt
67
native_tokens_completion
636
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00097075
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"kXcAaqjaLtznz7IPpNyL4QI"
is_byok
false
latency
594
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415505-pdjkYCT0rNQPQQHNSEfv"
session_id
(null)
api_type
"completions"
id
"gen-1778415505-Al2XBIAlim4nIkWrCuaJ"
upstream_id
"kXcAaqjaLtznz7IPpNyL4QI"
total_cost
0.00097075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
493
wordsDialogue
0
21.6892%