NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVP09Y69E9NH6VPZ5W7SXV8
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 32s
Cost:
$0.306338
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped the laminated blueprint on the steering wheel of their rented van. She checked her silver watch. "Okay, let's review the floor plan one more time. The primary target is Timmy's eighth birthday bash. He specifically asked his rich grandparents for the new virtual reality headset, which means there is extremely high value loot sitting right on that folding table." Bill adjusted his black binoculars, peering through the buggy windshield at the crowded front lawn. "I still think we should have brought a smaller getaway car, Molly. A minivan blends in, sure, but it completely lacks the torque to evade angry, athletic soccer moms." Molly forcefully smacked his left shoulder. "We need the cargo space, Bill! Do you have any idea how much volume forty wrapped building block sets occupy? Not to mention the foam dart blasters and the remote control trucks." Bill rubbed his arm with a pained grimace. "Fair point. So, what is the main entry vector?" She traced a thick red line across the paper. "We breach straight through the inflatable bounce castle. It provides excellent visual cover from the parental units stationed on the back patio who are busy sipping their mimosas." Bill squinted at the towering pneumatic structure. "Are we taking out the clown first? He looks terribly shifty." Molly shook her head from side to side. "No, the clown is strictly a civilian. Plus, I read his Yelp reviews online. He makes a mean balloon giraffe. We leave him be." Bill shifted uncomfortably, quickly lowering the heavy binoculars. "What if he spots us stuffing the presents into the duffel bags? Clowns are notoriously huge snitches." She reached into the back and grabbed a pile of green fabric. "We will mitigate that risk by wearing extreme camouflage." Bill groaned loudly, letting his head hit the glass. "You mean the giant plush dinosaur suits again? I sweat entirely too much in those." Molly tossed the surprisingly heavy plush dinosaur costume directly onto his lap. "Exactly. Nobody questions a sweating dinosaur at an eight year old's backyard party. We lumber in, eat a slice of funfetti cake to establish our solid alibi, sweep the entire gift table into our synthetic bellies, and lumber right back out. It is foolproof." Bill poked the floppy snout with one hesitant finger. "What about the sugar high? Those kids have been eating blue cotton candy all afternoon. They are going to be acting like feral wolves. If they swarm us, I am definitely kicking." Molly grabbed him firmly by the collar of his shirt. "There will be absolutely no kicking of the children, Bill! We are professional thieves, not monsters. If they swarm, you throw a handful of glitter and roll away." He sighed heavily and began slowly unlacing his tennis shoes. "Fine. But I definitely get to keep any expensive action figures that we score today." Molly smiled, pulling her giant reptilian mask into place. "Deal. Now zip it up. It is officially time to completely ruin a birthday."
created_at
"2026-02-19T19:28:36.435Z"
2/19/2026, 7:28:36 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3798
moderation_latency
(null)
generation_time
210564
tokens_prompt
77
tokens_completion
6174
native_tokens_prompt
67
native_tokens_completion
25517
native_tokens_completion_images
0
native_tokens_reasoning
24865
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.306338
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3641
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529316-qhDPGNWWFJXAOGCBfyKh"
upstream_id
(null)
total_cost
0.306338
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
350
100.0000%