NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPMENGEPWPN6JCEXS6KSCG
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 18s
Cost:
$0.199766
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his oversized rainbow wig while peering out the tinted windows of their surveillance minivan. The suburban lawn across the street was currently swarming with screaming children. "Are you sure about the bouncy castle?" Molly kept her field binoculars perfectly steady. "It is the only blind spot. The parents are stationed by the grill, facing the inflatable slide. If we breach the mesh netting on the west wall, we bypass the cooler entirely." Bill scratched his painted face, smearing a thick layer of white greasepaint across his cheek. "I still think we should walk through the front door. I am dressed as a party clown." She gave his ridiculous neon green suspenders a critical, disapproving stare. "A clown with neck tattoos. They will ask for a balloon animal, and we know what happened last time." He crossed his massive arms over his chest. "I made a realistic snake." She lowered the binoculars. "You tied a knot in a green tube and threw it at a toddler. This is a very important job. The intel says Timmy is getting a titanium scooter. We are not blowing this payday because you refuse to crawl." He slumped back into the passenger seat. "Fine. So we breach the bouncy castle. Then what?" A wicked grin slowly formed upon her lips as she reviewed the tactical map sitting on her lap. "We wait for the piñata deployment. When the father blindfolds the birthday boy, chaos will reign. Candy hits the grass, every child dives for chocolate, and the parents pull out their phones to film the carnage." "The perfect distraction. I hit the gift table." She handed him a large black canvas sack. "Exactly. You sweep the wrapped boxes into the duffel bag. Shake them first." "What if I shake a box and it contains a hamster?" "Timmy is allergic to pet dander. No hamsters. If it rattles, it is a building set. Take it. If it is squishy, it is clothes. Leave it." "What about the envelope basket?" She tapped a tiny glass vial sitting in the cupholder. "I will grab the envelope basket. Grandma Florence uses hundred dollar bills, and she tape seals her cards." He let out a heavy sigh, sounding completely exhausted. "This is complicated. Do they even have a decent cake?" "We are not here to eat the cake." "If I am risking a sugar stampede, I want chocolate buttercream. Have you seen a little boy hyped up on fruit punch?" "Stick to the plan. We are in and out. If anyone questions you, honk your nose and aggressively throw confetti." "I ate the confetti." She stopped rummaging through her equipment bag and stared at him. "You ate the paper confetti?" He shrugged apologetically. "I thought it was sprinkles. It was in a shaker." She simply shook her head and pointed strictly toward the yard. "Get the bag ready. The piñata is going up into the tree. Put your mask on." "Red nose engaging. Let us steal some toys."
created_at
"2026-02-19T19:39:36.33Z"
2/19/2026, 7:39:36 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3327
moderation_latency
(null)
generation_time
197549
tokens_prompt
77
tokens_completion
4723
native_tokens_prompt
67
native_tokens_completion
16636
native_tokens_completion_images
0
native_tokens_reasoning
15973
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.199766
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3327
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529976-p8TjRfxABNsbLTumIqXV"
upstream_id
(null)
total_cost
0.199766
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%