NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR92CY1X4TT5V7C29FA6E0P4
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.4s
Cost:
$0.001004
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the garden hedge and checked her watch. The backyard was a kaleidoscope of inflatable castles and frantic toddlers, all screaming in a frequency that rattled her teeth. Bill crouched beside her, struggling to adjust a felt fairy mask that kept sliding over his left eye. Did you secure the perimeter, Bill? Molly hissed, tapping the side of her headset. The slide is guarded by a six-year-old with a glitter-encrusted juice box, Bill whispered back, his voice thick with suppressed panic. He looks like a narc, Molly. A pint-sized, sticky-fingered narc. Ignore the toddler. Focus on the haul. Is the gift table still unattended? It is currently overseen by a clown who smells faintly of despair and cheap gin. He is staring at a balloon animal like it just insulted his mother. Perfect. The clown is the distraction. Why must we steal presents from literal infants, Molly? I feel like my soul is eroding. Can we not target a jeweler or perhaps a billionaire with a vault? Billionaires have laser grids and sharks, Bill. This party has a bowl of lukewarm punch and a bouncy castle. It is a victimless crime if you consider that these children have too many toys anyway. It is practically a community service. I am pretty sure stealing from a birthday party is a crime against humanity. My moral compass is spinning like a top. Your moral compass is taped to your wallet, so stow it. Do you have the grab bags? I have the sacks. They are pink with cartoon puppies on them. I feel ridiculous. The puppies keep the parents away. They assume we are eccentric caterers. Now, listen. When I give the signal, you create a diversion. Pop the bouncy castle. Pop it? Like, with a knife? That is a level of carnage I was not prepared for today. Use the safety scissors in your utility belt. Do it now. The castle will collapse in seconds, and there will be chaos. Are you sure about this? I am sure about the pile of designer boxes currently sitting behind the cake table. Those Legos are worth a fortune on the black market, Bill. Legos. We are risking prison for plastic bricks. We are risking prison for early retirement in the tropics. Move. Bill sighed, tucked his fairy wings under his vest, and crawled toward the castle. Molly watched as he popped the inflatable wall with the precision of a surgeon. A high-pitched screech erupted from the structure, followed by a tidal wave of wailing children. The clown jumped, dropping his balloon sword. Now, Bill! Go, go, go! Bill scrambled toward the table, shoveling boxed presents into the puppy bags. Molly sprinted from the hedge, grabbing a stack of gift cards and a particularly heavy-looking box. They ran for the fence, the sound of parental shouting rising behind them. Drop the extra teddy bears, Bill! They are dragging you down! I am keeping the bear! It is a high-quality plush! Leave the bear, you idiot! We are heading for the getaway tricycle!
created_at
"2026-05-10T13:49:31.588Z"
5/10/2026, 1:49:31 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
619
moderation_latency
(null)
generation_time
4255
tokens_prompt
77
tokens_completion
773
native_tokens_prompt
67
native_tokens_completion
658
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00100375
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"64wAatyaMauEz7IP15PxeA"
is_byok
false
latency
619
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420971-nywn1wSJeILS71niMGjo"
session_id
(null)
api_type
"completions"
id
"gen-1778420971-3qdQTMPzsm5pDnX0KLXb"
upstream_id
"64wAatyaMauEz7IP15PxeA"
total_cost
0.00100375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
510
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
518
wordsDialogue
0
45.2419%