NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPCGDJGXHGYSTEQZ6E092Z
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4m 44s
Cost:
$0.208658
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.98%
Dialogue to Total Word Ratio
(details)
99.9905%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Are you absolutely certain the hired clown is compromised?" Molly leaned over the grease stained blueprints scattered across the hood of their rusted getaway van, tapping a gloved finger against the schematic of the community center. "I paid him off this morning. He will fake a stomach bug precisely as the chocolate fudge cake is cut. That creates our primary diversion window." Bill adjusted his dark aviator sunglasses, checking his wristwatch with a completely serious and grim expression. "And the local event security presence?" Molly frowned and dragged a red marker across the venue map. "Two profoundly bored teenagers monitoring the petting zoo, plus one hypervigilant grandmother stationed by the gift tables. Our infiltration route brings us straight through the indoor ball pit." Bill tapped another section of the complex diagram, his voice steady and low. "The ball pit? Bill, that is professional suicide. Do you know how deep those plastic spheres go? We will lose all foot traction and flounder directly into the line of sight of the roaming toddlers." Molly crossed her arms, staring at her partner with intense and unvarnished skepticism. "I anticipated that specific logistical nightmare. We are wearing modified gripping footwear. Specifically, golf cleats." Bill proudly lifted a foot to display his heavily spiked and brightly polished leather shoes. "Golf cleats inside a toddler ball pit. Brilliant. So we traverse the pit, avoid the crazed sugar children, and then what? How do we bag the loot?" Molly sighed heavily, rubbing her temples to stave off a suddenly blooming headache. "The presents are stacked on a flimsy plastic table next to the juice boxes. I estimate thirty uniquely wrapped targets. We deploy the canvas laundry sacks, prioritize the shiny boxes, and exfiltrate rapidly through the inflatable bouncy castle." He confidently gestured toward a tiny square marked with a crudely drawn bow. "What about the actual birthday boy?" Molly raised one single, highly questioning eyebrow. "Timmy is an unpredictable variable. Last year he bit his uncle. If he spots us securing the package, immediately offer him the designated decoy tablet." Bill slowly pulled the glowing device from his tactical utility belt to demonstrate. "You bought an expensive tablet just to bribe a seven year old child?" "It displays an endlessly looping video of a digitally animated singing hippopotamus. It will neutralize his awareness for exactly three minutes." "Tactical genius. What exactly is the designated time for our initial breach?" Molly nodded slowly, slapping her hands together in preparation. "Fourteen hundred hours exactly. Our strike happens just after the rainbow animal bursts. The resulting pediatric sugar rush will blind every single attendee to our covert movements. We target outsized boxes. Seek the ones rattling like expensive interlocking building blocks. I already promised our fence three untouched starship sets." "I still worry about the grandmother. Has she been effectively neutralized?" "Decaffeinated coffee was covertly swapped into her thermos. She will be soundly asleep in an uncomfortable folding chair by thirteen forty five." "Perfect. Let us go steal some childish joy."
created_at
"2026-02-19T19:36:25.85Z"
2/19/2026, 7:36:25 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4919
moderation_latency
(null)
generation_time
214395
tokens_prompt
77
tokens_completion
5404
native_tokens_prompt
67
native_tokens_completion
17377
native_tokens_completion_images
0
native_tokens_reasoning
16752
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.208658
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
4919
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529785-Q4Huqfjef9uTtKWiqiTo"
upstream_id
(null)
total_cost
0.208658
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
501
99.98%
Dialogue to Total Word Ratio
Ratio: 70.06%, Deviation: 0.06%
neededClean
false
wordsTotal
501
wordsDialogue
351
99.9905%