NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25TJ2T4DM5ABN8QTDPN82Y
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.4s
Cost:
$0.005789
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.8117%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the strap of her tactical diaper bag and frowned at the floor plan of the bounce house. Bill was busy practicing his balloon animal technique, which so far had only resulted in squeaks of terror and several popped latex casualties. "Put the dog down, Bill," Molly whispered, her voice laced with the tension of a woman who had spent three weeks studying the nap schedules of twenty toddlers. "We have a four-minute window between the cake cutting and the pinata massacre. If we miss it, we are staring down the barrel of a sugar-laden riot." "I’m trying to make a giraffe," Bill muttered, his thumbs tangled in yellow rubber. "It’s my cover story. If a six-year-old questions my presence, I present the giraffe. Boom. Perfect assimilation." "You look like a giant thumb wearing a clown nose," Molly said, tapping her finger on the map. "Now focus. The target is the gift table in the northeast corner of the conservatory. It is guarded by a formidable barricade of helium balloons and a very alert grandmother named Beatrice." "Beatrice is eighty-two. I can take her," Bill scoffed, finally giving up on the balloon and tossing the deflated yellow lump onto the table. "What’s her weapon of choice? A walkers? A knitted shawl?" "A highly weaponized guilt trip and a purse full of hard strawberry candies," Molly warned. "Do not underestimate her. Last year at the charity bake sale, she tackled a man for taking the last lemon bar. Now, your job is the distraction. You are going to trigger the sprinkler system." "By pulling the fire alarm?" Bill asked, his eyes lighting up. "I love the classics." "No, you idiot. If the fire alarm goes off, the parents evacuate the building with the children, creating a stampede. We’ll be crushed under a wave of light-up sneakers." Molly leaned in closer, her eyes glittering with criminal genius. "You are going to feed the birthday boy, Timmy, a massive spoonful of red velvet cake. The kid is highly sensitive to food dye. Once the red forty hits his system, he will become a tornado of chaotic energy. He will dismantle the bouncy castle from the inside. That is your diversion." "That is sick," Bill said, staring at her with newfound respect. "It’s brilliant, but it’s sick. Where do we put the loot?" "The juice boxes," Molly replied, pointing to a wagon parked near the kitchen door. "We dump the juice, line the cooler with bubble wrap, and load the Lego sets first. They have the highest street value. Then we grab the Nintendo Switches and go." "What about the hand-knit sweaters from the aunts?" Bill asked. "Leave them. We have standards, Bill. We are thieves, not monsters." Molly checked her watch and gave a sharp nod. "It’s time. Put on your party hat and let’s secure the bag."
created_at
"2026-05-20T07:50:24.606Z"
5/20/2026, 7:50:24 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
639
moderation_latency
(null)
generation_time
4323
tokens_prompt
77
tokens_completion
713
native_tokens_prompt
67
native_tokens_completion
632
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0057885
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"wGcNaq-2Lfni_uMP_avumQI"
is_byok
false
latency
638
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263424-4J6s0uGGTt0pxi9TG8fv"
session_id
(null)
api_type
"completions"
id
"gen-1779263424-PMeMoTkE5LiDbBUm6bop"
upstream_id
"wGcNaq-2Lfni_uMP_avumQI"
total_cost
0.0057885
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
476
0.00%
Dialogue to Total Word Ratio
Ratio: 69.77%, Deviation: 39.77%
neededClean
false
wordsTotal
483
wordsDialogue
337
1.8117%