NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPNB909WFJ58ZF4TF0A00D
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 49s
Cost:
$0.230594
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The planning table was covered in crayon scribbles. Molly tapped her index finger against the colorful paper schematic. "We are infiltrating little Timmy's seventh birthday party tomorrow afternoon. The potential loot haul is massive, and security is looking surprisingly light for a suburban target." Bill scratched his shaved chin. "I thought we were doing a high stakes jewelry heist today. A neighborhood children's party feels below our standards." Molly smoothed out the blueprint. "You are ignoring the ridiculous street value of a pristine plastic robot dinosaur, Bill. Timmy's unbelievably rich grandmother is bringing one. It will easily fetch at least five hundred bucks." Bill nodded, pulling a notepad from his jacket. "That is incredibly solid intel, Molly. What exactly are we facing out there for perimeter security?" Molly leaned across the card table. "It is a severely hostile environment. We have fifteen screaming children hopped up on cake, plus a tight perimeter of exhausted parents drinking wine. The primary gift stash is heavily piled up right inside the dining room, located directly behind the massive inflatable bounce castle." Bill immediately dropped his plastic pen in shock. "A literal bounce castle? That is a total tactical nightmare. The structural integrity relies entirely on a loud electrical air pump. Taking off my shoes will ruin my traction." Molly let out a very loud, dramatic sigh. "You are clearly not entering the bounce castle, you absolute moron. You are working major distraction duty in the backyard." Bill wiped sweat from his forehead. "Thank goodness for that. I really despise static electricity. What is my brilliant strategy?" She forcefully tossed a heavy, colorful plastic bag over at him. "You dress up as Chuckles the Clown. When they cut the cake, you start making balloon animals. That draws the parents outside. I slip through the dog door, bag the presents, and meet you at the van." Bill stared blankly down at the ominous clown bag. "I never learned how to twist balloon animals." Molly began violently rubbing both of her temples in frustration. "Just bend them until they pop, and tell the crying toddlers they are noisy snakes." He finally cracked a bright, genuine smile. "That is clever. What if the hostiles swarm me, though?" Molly confidently scooped a thick glass jar directly off the dusty shelf. "You deploy the glitter bombs. Total visual chaos. Escape while they are blinded." Bill jotted the instructions down furiously. "Bribe them with snakes, blind them with glitter. Will the family dog be an issue?" She maintained a highly arrogant, confident smirk upon her face. "A golden retriever. I packed premium sausages. He will beg for treats before I touch the wrapping paper." Bill carefully tucked his tiny notebook safely away. "Should I actually wear the big red squeaky nose?" Molly deliberately checked the heavy silver diver watch tightly strapped to her wrist. "Obviously you wear the squeaky nose. Nobody questions a clown. We breach at exactly fourteen hundred hours, immediately after the colorful piƱata takes a dirt nap."
created_at
"2026-02-19T19:40:05.625Z"
2/19/2026, 7:40:05 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3466
moderation_latency
(null)
generation_time
169145
tokens_prompt
77
tokens_completion
5992
native_tokens_prompt
67
native_tokens_completion
19205
native_tokens_completion_images
0
native_tokens_reasoning
18553
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.230594
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3466
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771530005-bD2F7Vlczqk8E3EPOU8D"
upstream_id
(null)
total_cost
0.230594
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%