NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TP5ZX8QG6MR6WG24STXPP
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 24s
Cost:
$0.207901
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The cramped back of the rusty surveillance van smelled heavily of stale cheese puffs and ambition. Molly adjusted her heavy green dark vision goggles, even though it was three o'clock in the sunny afternoon. She dramatically pointed a cracked telescoping plastic pointer at a crude drawing of a massive striped bouncy castle. "This is the drop zone," Molly whispered. "Timmy’s sixth birthday basin of greed." "Why are we whispering?" Bill asked. "We are parked three blocks away." "Tactical discipline, Bill. If we lose focus, the clown wins." "I still do not like this," Bill sighed. "Stealing from children feels wrong. What about our code?" "Our code says we steal valuables. Have you seen the price of lego sets lately? It is practically liquid gold." "But a bouncy castle is a logistical nightmare to transport." "We do not steal the castle, idiot. We steal the loot pile." Molly furiously tapped the glossy map with her pointer. Bill nervously chewed on a sweet red licorice rope, his sweating face severely pinched with moral anxiety. He uncomfortably adjusted his ridiculously itchy wool ski mask, which was far too hot for this sweltering July weather. "How do we bypass the suburban mothers?" Bill asked. "We use the juice box distraction," Molly replied. "Moms cannot resist recycling issues." She proudly pulled out a massive blueprint. It was extensively decorated with colorful crayon drawings because she had cleverly stolen it from a local kindergarten art class earlier that morning just to save money on drafting paper. "That is a picture of a duck," Bill said. "It is a disguise," Molly sharply snapped. "The duck represents the gift table." "And the sun with a smiley face?" "That is the perimeter guard, a golden retriever named Buster." "Can we bribe him?" Bill asked. "With premium beef jerky," Molly nodded. "I have three sticks of jerky ready." Bill squinted anxiously through the dusty, tinted window of the cramped van. A bright yellow minivan pulled up to the driveway, and a tall clown carrying a massive balloon pump stepped out. Bill gasped in absolute, genuine terror. "We did not prepare for a professional balloon artist," Bill whimpered. "Calm down," Molly ordered. "He is just a man in makeup." "He has a squeaky nose, Molly. That is psychological warfare." "We stick to the plan. At four, the cake arrives." "And that is our window?" Bill asked. "Yes. Sugar shock hits. The kids go wild, the parents get distracted trying to stop the screaming, and we sweep the pile." Molly sharply zipped up her black canvas duffel bag. She offered Bill a small, strange weapon. It was a neon orange water gun filled with sticky grape juice. She smiled wickedly, her teeth flashing in the dim light. "Is this absolutely necessary?" Bill asked, nervously hefting the bulky plastic toy. "If anyone blocks our exit, you ruin their expensive silk clothes," Molly said. "That is incredibly evil," Bill whispered with deep admiration. "I know. Now pull your mask down. It is play time."
created_at
"2026-05-19T19:16:32.393Z"
5/19/2026, 7:16:32 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1451
moderation_latency
(null)
generation_time
84282
tokens_prompt
77
tokens_completion
6062
native_tokens_prompt
67
native_tokens_completion
23089
native_tokens_completion_images
0
native_tokens_reasoning
22414
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2079015
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"ELcMaqzUIJfF-sAP467JgQs"
is_byok
false
latency
1451
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218192-zfTtHFzomYr5UZvuCEuY"
session_id
(null)
api_type
"completions"
id
"gen-1779218192-Eww3DDrP8R287JxnNJ79"
upstream_id
"ELcMaqzUIJfF-sAP467JgQs"
total_cost
0.2079015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
250
100.0000%