NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPHX4K8CW1ZA9MMTXE0XP8
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 36s
Cost:
$0.153530
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The surveillance van smelled strongly of stale potato chips and cheap latex. Molly adjusted her binoculars, staring through the tinted windshield at the sprawling suburban lawn across the street. A gigantic inflatable castle dominated the yard, trembling beneath the weight of a dozen hyperactive children. Bill sat in the passenger seat, sweating profusely in his oversized rented clown suit. He poked at a wildly inaccurate blueprint Molly had drawn on a greasy napkin. Here was the target. Timmy Henderson. Age eight. Heir to a mountain of brightly wrapped boxes sitting defenseless on a patio table. "I still do not understand why I have to wear these ridiculous, oversized, obnoxiously squeaky clown shoes." Bill shifted his weight, causing an agonizing rubbery screech to echo off the metal walls of their getaway vehicle. He scratched at his bright red foam nose and aggressively adjusted his polka dot suspenders. Molly did not lower her binoculars as she violently shushed him. "Because you are the primary distraction. You go in, twist some pink balloons into questionable animals, and keep those frantic, sticky little monsters completely busy." She pointed a flashlight at the greasy napkin, highlighting a blotch of ketchup that represented the gift table. "While you do that, I bypass the slumbering grandfather, infiltrate the wooden gazebo, and secure the package." Bill scoffed, crossing his massive arms and nearly popping a button off his oversized yellow vest. He glared at the suburban fortress, completely baffled by the tactical genius of his partner. "The package is a massive mound of loose Legos and a gigantic remote control monster truck. How are you going to carry all that out naturally?" Molly finally lowered her binoculars and reached into the back of the van. She pulled out an absurdly large, spectacularly garish canvas sack adorned with cartoon ponies. It was a masterpiece of suburban camouflage, guaranteed to blend in perfectly amidst the chaotic debris of a frantic birthday celebration. She patted the monstrous bag with pure professional pride. "I am going to cheerfully walk right out the front gate looking like an exhausted aunt carrying presents to the minivan. It is the perfect crime." A sudden explosion of cheering erupted from across the street. A blindfolded child was ruthlessly beating a paper donkey with a baseball bat, sending a shower of cheap candy raining down upon the manic lawn. The chaos had reached its absolute peak, signaling their window of opportunity. Molly checked her tactical stopwatch. "Clock is ticking. Do you remember the extraction signal?" Bill sighed, honking his foam nose with a look of utter defeat. "I have to scream terribly and pretend to faint directly into the birthday frosting." Molly grinned, slipping a lockpick into her pocket. "Exactly. Now get out there and make them laugh." The heavy van door slid open, spilling a giant sad clown onto the pavement. "Frankly, this is a very terrible idea." Molly ignored him completely, entirely focused on the mountain of stolen loot that awaited beneath the wooden gazebo.
created_at
"2026-02-19T19:38:12.845Z"
2/19/2026, 7:38:12 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3360
moderation_latency
(null)
generation_time
95733
tokens_prompt
77
tokens_completion
4196
native_tokens_prompt
67
native_tokens_completion
12783
native_tokens_completion_images
0
native_tokens_reasoning
12168
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.15353
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3360
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529892-SmA0bCbneg70JjA97g3B"
upstream_id
(null)
total_cost
0.15353
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%