NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVP2JKRQRW885TK8N0RCWSG
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 9s
Cost:
$0.193022
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled a schematic of a suburban backyard across the warm hood of their rusted surveillance van. She adjusted her dark tactical sunglasses and tapped a heavy red marker against the paper. Bill rubbed his temples and stared at the childish drawing with a look of profound disgust. He hated domestic jobs. "The primary target is the folding patio table directly next to the giant inflatable bounce castle. The wealthy parents have foolishly piled all the unopened presents in one completely unsecured centralized location." Bill crossed his arms and let out a sigh before speaking. "The bounce castle? You know that specific zone is heavily guarded by toddlers fueled by pure refined sugar and unbridled chaos. I completely hate sugar toddlers. They have absolutely no fear of death or personal injury. They will bite my ankles and call me bad names." Molly pointed the red marker directly at his chest to emphasize her strict operational authority. "Which is precisely why you are going to wear the oversized clown suit. It provides the perfect tactical distraction. When you start twisting the balloon dogs, the primary hostiles will naturally swarm your position right away. That leaves the birthday gift table entirely exposed for exactly forty seconds." Bill leaned against the van door and nervously chewed on his lower lip. "Forty seconds is simply not enough time to gather twenty heavily taped cardboard boxes. Plus I am a seasoned master of midnight stealth. I am a legendary vault virtuoso. I absolutely refuse to squeeze the stupid squeaky red rubber nose. It ruins my entire professional mystique." Molly checked the digital countdown timer strapped tightly to her left wrist. "You will squeeze the red nose because you need your cut of this massive payout. The big prize is a limited edition space laser station worth thousands on the collector black market. I need both of my hands to secure that lucrative package into our motorized getaway stroller." Bill shook his head. The sheer indignity of the situation weighed heavily upon his seasoned criminal soul. "What about the maternal grandmother? She stays relentlessly near the frosted sheet cake to guard the dessert perimeter. She is a retired gym teacher with terrible hearing and she carries a very loud silver whistle all the time." Molly packed up the paper map and handed him a brightly colored costume bag. "If she blows the athletic whistle, we will immediately abort the mission without hesitation. We drop the plastic building blocks, deploy the dense smoke grenade, and vault the wooden privacy fence behind the barbecue grill." Bill grabbed the bag with a very deep scowl and opened the rusty broken van door. "I cannot vault a tall wooden privacy fence while wearing incredibly oversized bright yellow clown shoes. We are currently planning to steal the carefully wrapped presents right from a little boy at his seventh birthday party. We crossed the final ethical line hours ago. This entire suburban operation is completely pathetic and beneath our immense criminal talents."
created_at
"2026-02-19T19:29:50.558Z"
2/19/2026, 7:29:50 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3535
moderation_latency
(null)
generation_time
189146
tokens_prompt
77
tokens_completion
5557
native_tokens_prompt
67
native_tokens_completion
16074
native_tokens_completion_images
0
native_tokens_reasoning
15501
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.193022
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3535
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529390-TEL2Xb9AgnMaf0Aq0WYs"
upstream_id
(null)
total_cost
0.193022
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%