NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TT29N79D4FM6N5JY407F8
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 34s
Cost:
$0.236053
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.81%
Dialogue to Total Word Ratio
(details)
99.9040%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The beaten-up transit van smelled of stale doughnut glaze, cheap damp carpets, and desperation. Molly peered through her high-powered binoculars, her eyes tracking the suspicious movement of a clown juggling dangerously sharp plastic bowling pins. Beside her, Bill was struggling to squeeze his enormous frame into a neon-green spandex bodysuit that made him look like a severely depressed, overgrown garden vegetable. "Remind me why I am wearing this, Molly," Bill wheezed, adjusting the waistband. "Because the theme is alien invasion, Bill. You are our distraction. If you do not blend in, we do not get the loot. Have you visualized the prize?" Molly whispered. "I am thinking of the Legos. They are beautiful." Molly tapped the laminated, slightly stained schematic of the suburban living room layout. The critical point of entry was the dog flap in the kitchen, currently guarded by an exceptionally lazy, elderly Golden Retriever who could be bribed with a single slice of premium ham. The target vault was a folding table covered in shiny paper, situated dangerously close to the bounce house. Sugar-crazed children represented their primary threat. "Is the getaway vehicle fully prepped for our escape?" Molly asked. "I double-checked the sliding door. And I bought those juice boxes." Bill said. "Focus, Bill. This is the big leagues. Twenty toddlers. One massive pile of gifts. We need to be surgical. No mistakes." Molly snapped. "Do we have to steal the bouncy castle too?" Molly sighed deeply, slowly rubbing her aching temples. She sometimes wondered how Bill had survived this long in the cutthroat criminal underworld, let alone how he managed to parallel park. She checked her watch. It was exactly two. On the lawn, high-pitched screams indicated that the pinata had finally been breached, releasing a massive frenzy of wild sweets-deprived children. This was the chaotic window they needed. She gripped her empty, stolen burlap sack and checked her flashlight. "They are distracted now. Go, my green alien." Molly hissed. "I will do my best, boss. I am ready." Bill mumbled. "Remember, no survivors left empty-handed. We want every single box from that table." Molly insisted. "Even the wool socks from their grandmas?" "Especially the socks. Brand new apparel has very high resale value. Now, move before they eat the cake." Bill nodded solemnly, his weird cucumber-like silhouette bracing for immediate, high-stakes action. Molly slipped her black sleeping mask down over her eyes, realized she could no longer see absolutely anything at all, and pushed it back up onto her forehead with an irritated grunt. She slammed her palm against the dented van dashboard, signaling the start of the operation. The sliding door creaked open, exposing them to the sweet, chaotic scent of frosting, grass, and imminent backyard crime. In her mind, she was already counting the proceeds from the stolen plastic toy trains and action figures. She followed Bill out cautiously into the blinding afternoon sun, ready to make absolute criminal heist history or, at the very least, secure some cold, high-quality juice boxes.
created_at
"2026-05-19T19:18:39.674Z"
5/19/2026, 7:18:39 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1615
moderation_latency
(null)
generation_time
93995
tokens_prompt
77
tokens_completion
6419
native_tokens_prompt
67
native_tokens_completion
26217
native_tokens_completion_images
0
native_tokens_reasoning
25547
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2360535
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"j7cMavSXNMu4sOIPxqq6yQ8"
is_byok
false
latency
1614
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218319-9dZPMscA7vt2XPtSoum0"
session_id
(null)
api_type
"completions"
id
"gen-1779218319-6jBxrqPzddHXU5Oq5WCP"
upstream_id
"j7cMavSXNMu4sOIPxqq6yQ8"
total_cost
0.2360535
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
99.81%
Dialogue to Total Word Ratio
Ratio: 29.80%, Deviation: 0.20%
neededClean
false
wordsTotal
510
wordsDialogue
152
99.9040%