NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPQKMV98D75R10HWVYFV64
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 45s
Cost:
$0.162566
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
60.0948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The interior of the surveillance van smelled intensely of cheap garlic powder and professional anxiety. Through the cleverly tinted windshield of the nondescript plumbing vehicle, the suburban fortress loomed in the afternoon daylight. It was a viciously manicured split level house currently harboring the most lucrative stash of unrecorded physical assets in the entire metropolitan area. The lush backyard was heavily fortified by a giant inflatable dragon. Molly adjusted the focus ring on her tactical binoculars, her breath fogging the glass slightly as she leaned forward. She aggressively scanned the perimeter, mentally noting the choke points and escape paths. A massive hoard of sticky fingered hostiles swarmed the backyard patio, high on heavily frosted cupcakes and unbridled youth. The primary target was a folding banquet table draped in a polka dot plastic cloth. It sagged delightfully under the weight of brightly wrapped electronics, rare collectible building sets, and festive envelopes thick with untraceable cash. Beside her in the passenger seat, Bill rigorously tapped a pen against his clipboard. He was currently dressed in the uniform of a regional cable inspector, though his tactical combat boots somewhat ruined the casual blue collar illusion. He adjusted his earpiece and studied a hand drawn schematic of the lawn. "Give me the tactical breakdown of the patio area." Bill lowered the clipboard and cleared his throat. "The perimeter is surprisingly soft, but the hostiles are highly erratic. They are hopped up on pure processed sugar and lack any predictable patrol routes." Molly kept her eyes pressed to the binoculars. "And the primary stash?" "Located under the patio awning. It is poorly guarded by an unattended cooler of juice boxes, but the parental overwatch is active near the grill." She slowly processed this information. A father wearing cargo shorts was currently flipping burgers with alarming vigilance. "We need a localized distraction. Something that pulls the adults away from the patio." "I could deploy the smoke grenades." Molly lowered her optics and stared at her partner with profound disappointment. "We are not teargassing a bunch of seven year olds. Think smaller." He reached into the duffel bag resting between the seats. "I have a rogue ferret and three laser pointers." A slow grin spread across her face. The plan was coming together. "That is adequate. While the beast creates chaos near the bounce house, I will infiltrate the gift registry." "What if the clown spots you?" Molly checked the action on her grappling hook. "He makes balloon animals for a living. I think I can take him." "Just remember to grab the large silver box first. It is the limited edition hoverboard." Bill checked the metal latches on the animal carrier. Inside, a frantic scraping sound suggested their biological weapon was fully primed. He gave his partner a solemn nod of solidarity. They had pulled off casino vaults and museum galleries, but infiltrating a suburban backyard required a special brand of courage. Molly pulled a black beanie over her hair. She unlocked the heavy side door and stepped out into the humid air. The greatest heist of their entire professional careers was about to officially begin.
created_at
"2026-02-19T19:41:19.729Z"
2/19/2026, 7:41:19 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3547
moderation_latency
(null)
generation_time
164873
tokens_prompt
77
tokens_completion
4095
native_tokens_prompt
67
native_tokens_completion
13536
native_tokens_completion_images
0
native_tokens_reasoning
12893
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.162566
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3547
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771530079-zu7QKEo0DVKK4wg3Ytf7"
upstream_id
(null)
total_cost
0.162566
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
520
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
520
wordsDialogue
156
60.0948%