NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0V174XDNKTMQ1GR97XY3VT
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 38s
Cost:
$0.237412
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The surveillance van was a rusted 1998 Dodge Caravan that smelled intensely of stale onion rings and desperation. Molly pressed her face against the passenger window, green spy binoculars fogging up with her rapid breaths. Across the street, a suburban fortress was adorned with inflatable unicorns and a banner reading Happy Birthday. It was a terrifying gauntlet of domestic bliss. Under her heavy black boots, the floorboards vibrated with distant juvenile pop music. Bill asked, "How is the perimeter?" He was busy dusting powdered sugar off his black tactical turtleneck, looking too relaxed for a man about to commit a felony. Molly muttered, "Hostile. We have twelve targets under age seven, armed with red food dye and kazoos. The clown looks like he has seen some things." She adjusted the focus ring on her magnifying optics, tracking a frantic mother who was currently wrestling a screaming toddler into a dinosaur suit. Nearby, the hired clown leaned against a fencepost, looking like a man who had lost his soul in a tragic balloon modeling accident. Bill asked, "Do we have a blueprint?" Molly quickly unfolded a greasy Map of the World placemat. With a thick silver marker, she had drawn a desperate series of tactical arrows leading toward the sunroom. Molly whispered, "The target is the gift table in the sunroom. We strike during the pinata incident, right when the candy drops." She tapped the laminated placemat for emphasis, her dark fingernails making a sharp clicking sound. She knew the sugar rush from the upcoming candy shower would create a localized riot, completely blinding parental supervision for at least ninety seconds. It was their only opening. Bill asked, his voice trembling slightly, "What about Brenda? She has eyes like a hawk and a clipboard." He was referring to Timmy’s mother, a legendary figure in suburban crime prevention. Molly said, "If Brenda spots us, we run. That woman managed a PTA budget with terrifying efficiency; she has no mercy." Bill reached into his black tactical vest, slowly rummaged past a coil of heavy rope, and pulled out a juice box. Bill said, "Ready. I also brought extra large trash bags for the giant Lego sets." Molly sighed, pinching the bony bridge of her nose to stave off an impending headache. Molly warned, "No glitter, Bill. If you get glitter in this van again, I swear I will leave you in the ball pit." Bill muttered, "It was an accident. The musical birthday cards exploded." Molly commanded, "Put on your ears, check your sugar levels, and prepare for breach." Bill muttered, "This is grape flavor. It is vital for maximum tactical stamina." They both reached into a cardboard box and fished out two pairs of plush, oversized Mickey Mouse ears. Bill strapped his set securely under his chin, adjusting the elastic band over his tactical balaclava. Molly did the same, carefully angling her plush black ears to a jaunty, highly professional angle. They opened the sliding side door and stepped into the sunshine.
created_at
"2026-05-19T19:22:34.02Z"
5/19/2026, 7:22:34 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1588
moderation_latency
(null)
generation_time
97430
tokens_prompt
77
tokens_completion
18891
native_tokens_prompt
67
native_tokens_completion
26368
native_tokens_completion_images
0
native_tokens_reasoning
25727
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2374125
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"ergMau6lCoqq_uMP876xiQY"
is_byok
false
latency
1588
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218554-2L9Sesl2Y1A3r6nmwjkG"
session_id
(null)
api_type
"completions"
id
"gen-1779218554-D0V9wNdRayha46lVXJQF"
upstream_id
"ergMau6lCoqq_uMP876xiQY"
total_cost
0.2374125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%