NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TMX69VXST494EGQVS9B9K
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 41s
Cost:
$0.242542
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
99.36%
Dialogue to Total Word Ratio
(details)
99.5543%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her wool ski mask, even though the summer heat inside the rusted station wagon was quickly approaching triple digits. She smoothed out a crumpled napkin on the dashboard, which featured a detailed layout of the suburban backyard drawn in pink glitter gel pen. Outside, the target loomed. It was a giant, brightly colored bouncy castle shaped like a dragon, vibrating violently under the relentless force of twelve hyperactive children. "We have a three minute window when the magician starts. That is when the parents will be distracted by the terrible card tricks. We slip in and grab the cargo." Molly whispered, her voice tight with the false confidence of a seasoned mastermind. "Do you think they have pigs in a blanket? I cannot work with an empty stomach." Bill sighed deeply, chewing on a plastic straw, his eyes locked on the buffet table visible through the fence. "Focus, Bill! We are here for the loot. Timmy is spoiled, his wealthy grandparents bought him a miniature electric ATV." Molly slapped the dashboard, sending a shower of loose pennies into the dark footwell. She glared through her poorly cut out mask. "What if the dinosaur wrapping paper is too loud? That kind of noise travels fast." Bill asked, his stomach letting out an incredibly loud growl as he nervously rubbed his gloved hands together. "We are clearing the entire gift table. Lego sets, video games, cash envelopes, everything goes into the sacks." Molly insisted, gesturing wildly at the drawing. "There is a clown. You did not mention a clown. I have rules." Bill adjusted his rusty binoculars, squinting at a terrifying figure in giant, oversized shoes making balloon animals by the patio. "He is a teenager named Kevin. Throw glitter in his eyes if he gets close." Molly said, waving her gloved hand dismissively. "Glitter is a permanent hazard under international law. It never truly washes out." Bill shook his large head solemnly, his black mask shifting awkwardly on his nose. "The magician is starting. Strap on your juice box holster and get moving." She checked her cheap wristwatch as a fancy man in a tall top hat stepped onto the lush green lawn. Molly declared, nudging his shoulder to prompt immediate action. Bill grumbled but reached into the cooler, strapping a heavy velcro belt of apple juice pouches across his chest to maintain their hydration. He grabbed his primary weapon, a brightly colored plastic bubble blower shaped like a friendly dolphin designed to create a tactical smoke screen if they were cornered by angry moms. Molly opened her squeaky door, keeping extremely low behind the overgrown, prickly bushes. The warm air smelled heavily of cheap sunscreen, charred beef hot dogs, and impending sweet triumph. They crept toward the wooden fence, their black turtlenecks already dripping with sweat under the relentless afternoon sun. This was destined to be their biggest score yet, or at least, the only one that did not involve a security guard. They reached the gate, and Molly raised her hand.
created_at
"2026-05-19T19:15:50.605Z"
5/19/2026, 7:15:50 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1595
moderation_latency
(null)
generation_time
100888
tokens_prompt
77
tokens_completion
7781
native_tokens_prompt
67
native_tokens_completion
26938
native_tokens_completion_images
0
native_tokens_reasoning
26309
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2425425
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"5rYMau_zLfWvjrEP7cOHgQ0"
is_byok
false
latency
1582
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218150-mlsOBUOMXy5hahhtY2lr"
session_id
(null)
api_type
"completions"
id
"gen-1779218150-cRISSj9QLeM2dMvhnHKI"
upstream_id
"5rYMau_zLfWvjrEP7cOHgQ0"
total_cost
0.2425425
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
99.36%
Dialogue to Total Word Ratio
Ratio: 30.36%, Deviation: 0.36%
neededClean
false
wordsTotal
504
wordsDialogue
153
99.5543%