NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS26189AX7DMSSP1RGEHS32H
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.6s
Cost:
$0.005690
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The smoke from Molly’s cheap cigar drifted into Bill’s face, making him blink rapidly. He adjusted his fake mustache, which was currently clinging to his upper lip by a single thread of spirit gum. They glared at the target across the street: a brightly decorated suburban house currently vibrating with the shrieks of twenty caffeinated eight-year-olds. It’s a fortress, Bill said, rubbing his temple. There’s a bouncy castle blocking the rear exit, three clowns on patrol, and a mother who looks like she’s wrestled crocodiles. We aren't fighting the mother, Bill, Molly replied, stabbing her map of the living room with a manicured fingernail. We’re hit-and-run artists. We slip in during the cake cutting. The cake cutting is a bloodbath of sticky fingers, Bill pointed out. We’ll get covered in frosting. It ruins the grip on my crowbar. You don't need a crowbar for a Barbie Dreamhouse, Molly snapped, her eyes narrowing. We need stealth. We need disguise. Hence, the lederhosen. I look ridiculous, Bill groaned, pulling at the tight leather straps. Why do I have to be the assistant entertainment? Because nobody questions a man carrying copper wire if he’s also carrying an accordion, Molly said. You play the polka, they look at you, and I empty the gift table into the laundry bags. What if they request a song I don’t know? Bill asked. My repertoire is strictly limited to ‘Weird Al’ parodies. Play those, Molly shrugged. Kids love parody. Just make sure you draw the crowd away from the foyer. That’s where the high-yield loot is. Timmy’s grandmother is loaded. I’m talking gold-plated Lego sets. Are we sure this is ethical? Bill muttered. Stealing from a child? He’s eight, Bill, Molly said, dismissing his guilt with a wave of her cigar. He doesn't need a third iPad. He needs to learn about disappointment. It’s character building. We’re doing him a favor. And the bouncy castle? Bill asked. What if it gets popped? That’s our distraction, Molly grinned, her eyes flashing with mischief. If things go south, you poke it with a toothpick. The sudden deflation will create a mass panic. We escape in the confusion. You’re a monster, Molly, Bill smiled, his doubts melting away. A beautiful, genius monster. I know, Molly said. Now, put on your balloon-animal apron. We go in on my mark. Wait, Bill yelled, grabbing her arm. What about the dog? The invite said they have a golden retriever. I brought premium deli ham, Molly whispered, revealing a greasy foil wrapper. He’ll be our accomplice within ten seconds. Bill adjusted his accordion straps with a dramatic sigh. Let’s go steal some childhood.
created_at
"2026-05-20T07:54:03.952Z"
5/20/2026, 7:54:03 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
736
moderation_latency
(null)
generation_time
4500
tokens_prompt
77
tokens_completion
699
native_tokens_prompt
67
native_tokens_completion
621
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0056895
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"nGgNarqPBr-p_uMPzKyI2A4"
is_byok
false
latency
736
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263643-McYEc0HERvFlG4KYhBxk"
session_id
(null)
api_type
"completions"
id
"gen-1779263643-byrruWdrF1oLjLBODEBb"
upstream_id
"nGgNarqPBr-p_uMPzKyI2A4"
total_cost
0.0056895
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
439
0.00%
Dialogue to Total Word Ratio
Ratio: 0.67%, Deviation: 49.33%
neededClean
false
wordsTotal
446
wordsDialogue
3
0.0000%