NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPB0ZDWZVV12D3Z88PN5N8
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 44s
Cost:
$0.173222
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"The blueprints indicate a heavy perimeter. Two inflatable bouncy castles flank the patio, and a crazed mob of second graders is currently patrolling the backyard. What is the exact status on our primary payload?" Molly tapped a red circle onto the stolen map while adjusting the collar of her tactical sweater. She squinted through the windshield. "The gift table is situated precisely between the juice boxes and the petting zoo. We are looking at an unprecedented haul this weekend. Legos, video games, highly armed action figures, and at least four expensive remote controlled drones. The host mother strictly enforced a physical merchandise only rule on the invitation." Bill wiped an anxious bead of sweat from his forehead and peered intently through his binoculars at the wrapped boxes glimmering under the sun. "That is incredibly lucrative. What is the infiltration strategy? Those suburban mothers have hawkeye vision and the neighborhood watch is notoriously ruthless." Molly pulled a deflated rubber chicken from her bag and proudly tossed it onto the dashboard. "I am going in disguised as the hired entertainment. Specifically, Chuckles the clown. You will be concealed inside my oversized prop trunk. Once the children gather around for my juggling routine, you slide out the trapdoor, low crawl beneath the folding tables, and silently bag the wrapped merchandise." Bill stared blankly at the sad toy before shifting his amazed gaze back to his confident partner. "Are you completely out of your mind? Have you ever performed for an eight year old? If you drop a single juggling ring, those kids will tear you limb from limb. They are absolute savages when denied quality entertainment." She slowly rolled her eyes in extreme annoyance and crossed her arms tightly over her protective tactical vest. "I have been practicing for three whole days. You just focus on securing all the birthday presents. I will expertly handle the crowd control." He nervously chewed on his left thumb while heavily sighing. "I still strongly think we should initiate a tactical rappel from the large oak tree directly onto the gift pile. A quick snatch and grab." Molly slapped her gloved hand against the dark leather steering wheel. "Absolutely not. There is far too much aerial exposure. If one parent catches a glimpse of a grown man descending from the branches in full stealth gear, they will call the police immediately. We stick to the clown routine. Now, carefully review the extraction protocol." Bill sighed deeply, surrendering to the terrible plan, and quickly pulled a crumpled piece of notebook paper from his vest pocket. "When the festive donkey is finally breached by the birthday boy, the candy will scatter. We initiate protocol candy rush. Using the ensuing chaotic riot as our primary distraction, I will deploy the confetti smoke grenade. We grab the massive sack of toys, sprint through the side gate, and vault the fence." Molly smiled a wicked grin and turned the steel ignition key. "Perfect. Let us go literally ruin a magical suburban childhood."
created_at
"2026-02-19T19:34:27.405Z"
2/19/2026, 7:34:27 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3236
moderation_latency
(null)
generation_time
163960
tokens_prompt
77
tokens_completion
4767
native_tokens_prompt
67
native_tokens_completion
14424
native_tokens_completion_images
0
native_tokens_reasoning
13819
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.173222
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3236
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529667-JvrPWCZfwP5F56kQBhYT"
upstream_id
(null)
total_cost
0.173222
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%