NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVP09Y6TK8ABTT9ZCXZZ4W0
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 39s
Cost:
$0.218834
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The surveillance van smelled entirely of stale pepperoni and nervous sweat. Molly peered through the heavy plastic binoculars. She adjusted the focal lens until the inflatable medieval bounce castle resolved into a shockingly bright neon monstrosity. Bill sat completely immobilized beside her. A half eaten slice of pizza dangled loosely from his right hand. The target location was a seemingly innocent suburban backyard. It was heavily fortified by pastel paper streamers and a terrifyingly cheerful clown twisting balloons. Molly lowered her visual optics. "Are you absolutely certain the primary payload is located just past the inflatable dragon?" Bill slowly chewed his slice. He wiped a dollop of bright orange grease from his black tactical sweater. He gestured toward the property. "Positive. My preliminary recon is always flawless. The nervous host mother relocated the entire gift table inside the glass sunroom to protect the expensive wrapping paper from the lawn sprinklers. It is an extremely vulnerable position just waiting to be exploited." Bill picked up a detailed schematic of the house. He tapped a red circle indicating the rear patio doors. "And what exactly is our strategy for bypassing the hostiles?" Molly watched a tiny boy aggressively assault the hired entertainment with a green pool noodle. The clown fell backward into a plastic kiddie pool. "You mean the chaotic swarm of sugar crazed seven year olds? We deploy the ultimate diversion. You will wear the giant plush purple dinosaur costume. Once you breach the rear perimeter, you will initiate your synchronized hip hop dance routine. The children will immediately swarm you. While they are thoroughly distracted by your clumsy prehistoric flailing, I will quickly scale the wooden trellis." Bill stared blankly at the complex architectural blueprints carelessly scattered across the dirty dashboard of the van. He shuddered quite visibly. "I still believe we are vastly underestimating the sheer destructive power of these toddlers. Did you see what they did to the unicorn piƱata earlier? It was a massacre. It took them less than twelve seconds to completely obliterate a majestic papier mache creature into mere confetti." Molly grabbed a grappling hook from the metal floorboard and shoved it directly into her utility belt. She offered her partner a deeply reassuring nod. "You must remain extremely brave. Remember the massive payday. There is a rumored limited edition robotic puppy sitting inside that sunroom. It retails for hundreds on the black market." Bill tossed his pizza crust out the window. He placed a giant foam dinosaur head over his face. He checked his reflection in the mirror. "If I do not make it back to the getaway car, tell my mother I love her." Molly unlocked the heavy side doors of the cargo vehicle, glancing toward the affluent neighborhood. She double checked the tactical smoke pellets lining her belt. She took a deep breath. "Just focus exclusively on doing the moonwalk correctly. Their simple minds will be fully captivated by your sick dance moves. I will bypass the lock and grab the presents."
created_at
"2026-02-19T19:28:36.432Z"
2/19/2026, 7:28:36 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3590
moderation_latency
(null)
generation_time
218422
tokens_prompt
77
tokens_completion
8467
native_tokens_prompt
67
native_tokens_completion
18225
native_tokens_completion_images
0
native_tokens_reasoning
17626
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.218834
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
3590
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529316-ZyYXIVMqh5iQHqcN5nrY"
upstream_id
(null)
total_cost
0.218834
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
250
100.0000%