NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPB8CC7ZD90RYG9P1RXG79
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4m 37s
Cost:
$0.295226
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Explain the secret entry point again," Molly said, wiping sweat from her forehead inside their van. "We breach through the bouncy castle," Bill whispered, adjusting his plastic binoculars. "There is a mesh flap by the slide. I watched the yard guards." "You mean the parents?" Molly asked, tapping her fingers against the steering wheel. "Yes, the parent sentries," Bill confirmed, keeping his vigilant eyes locked onto the suburban backyard. "They are distracted. The father is struggling to open a collapsed folding table. It is a severe security vulnerability. We move when they bring out the cake." "What about the target package?" Molly tapped the dashboard impatiently. "The gift table is located in the splash zone of the hanging piƱata. Once the blindfold goes onto the dizzy birthday boy, chaos reigns. We strike." "Are you confident you packed the right tactical gear for this specific operation?" Molly asked quietly. "I have smoke grenades, heavy duty plastic zip ties, and a professional grade balloon animal arrangement kit," Bill stated proudly to his serious partner. "Then why the balloon kit, Bill?" Molly tilted her head sideways, expressing her evident confusion. "Plausible deniability. If cornered by a toddler, I fashion a sharp poodle." "Focus," Molly commanded, glaring at him. She hated when he brought unpredictable craft supplies to a heist. "We are walking away with every single building block set and motorized scooter today. Understand?" "I understand we are robbing a seven year old named Timothy," Bill sighed, leaning heavily back into his stained fabric seat. The springs creaked loudly. "Did you see the size of that spectacular corner pile? Our fence is going to weep when he sees this haul." "Exactly. So we absolutely need precision. Walk me fully through the complex extraction." "I secure the perimeter using juice boxes," Bill explained confidently. "I scatter them to create a sticky trap for the pursuing adults. You sweep the table, bagging the presents." "The dirty sack that looks exactly like a common household garbage bag?" Molly asked drily. "It is a camouflage device. To the untrained eye, we are just the party cleanup crew. To the trained eye, we are ghost operatives." "What if the hired party clown interferes in our escape route?" Molly asked, looking at the aggressively colorful figure entertaining screaming children. The clown honked a red horn. "We take him out," Bill stated firmly. "No half measures." "With brutal physical violence?" Molly raised one dark eyebrow suspiciously. "With a classic whoopee cushion firmly planted to the painted face. It is psychological warfare. He will never fully recover his professional dignity." Bill proudly mimed the proposed nonlethal trap assault. "That is coldly brilliant, Bill. Now, review the primary threat." Molly nodded, genuinely impressed. She deeply respected a fellow opportunistic criminal who knew how to dismantle a man's fragile soul. "Sugar crash. In twelve minutes, the cupcakes hit their chaotic bloodstreams. The children will possess the terrifying speed and endless ferocity of cornered wolverines. We must be gone before the frosting frenzy begins."
created_at
"2026-02-19T19:34:34.988Z"
2/19/2026, 7:34:34 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3578
moderation_latency
(null)
generation_time
276556
tokens_prompt
77
tokens_completion
8779
native_tokens_prompt
67
native_tokens_completion
24591
native_tokens_completion_images
0
native_tokens_reasoning
23929
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.295226
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3577
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529674-UVTKqJqU05MLa8drKJ2W"
upstream_id
(null)
total_cost
0.295226
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%