NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYWXK8P5TDF37G1EE5EPYC8
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The map was sprawled across the grease-stained table of the Tip-Top Diner, though to the untrained eye, it looked suspiciously like a placemat featuring a maze for toddlers. Molly circled a drawing of a bouncy castle with a red felt-tip marker, her eyes narrowed with the intensity of a diamond cutter. I’m telling you, the bouncy castle is tactical high ground, Bill. If we secure the perimeter of the inflatable dragon, we control the flow of the entire patio. Bill leaned back, nursing a lukewarm coffee. His face looked like a catcher’s mitt that had been through a car wash. It’s a five-year-old’s birthday party, Molly. We aren't storming the beaches of Normandy. We’re swiping Lego sets and a miniature electric Jeep. Perspective, Bill, Molly hissed, leaning in so close she nearly dipped her ponytail into his fried eggs. This is the Jenkins estate. We aren’t talking about generic building blocks. We’re talking about limited edition sets with resale values that would make a Swiss banker weep. The Jeep has leather seats. Actual leather seats. How are we even getting through the gate? The security is a kid named Tyler who wears a vest with too many pockets. He’s an amateur. I’ve already secured the Trojan horse. Molly reached into her oversized duffel bag and pulled out a matted, neon-pink fur suit. Bill stared at the googly eyes of the costume, which seemed to be judging his life choices. You want me to go in as Sparkles the Dysfunctional Unicorn? You’re the only one who fits the height requirements for the rental, Bill. Also, unicorns are statistically less likely to be tackled by overzealous suburban dads. While you’re distracting the herd with balloon animals—which you’ve been practicing, right? I made a dog this morning, Bill grumbled, staring at the ceiling. It looked like a cluster of grapes with a tail, but the neighbor’s kid didn't cry, so I’m counting it as a win. Good enough. While you’re being mobbed for latex poodles, I’ll be in the sunroom. The gift table is situated right under the central chandelier. I’ve got a modified leaf blower and a series of industrial-strength magnets. A leaf blower? Isn’t that a bit loud for a stealth operation? Not if I time the suction bursts with the singing of Happy Birthday. It’s the perfect sonic camouflage. We have a forty-second window between 'Long to reign over us' and the blowing out of the candles. Bill sighed, rubbing his temples. We’re really doing this. We’re really going to be the people who ruin little Timmy’s big day for a high-end toy car and some plastic bricks. His name is Arthur, and he has a private tutor for Mandarin. He’ll be fine, Bill. He’s got equity. We’ve got a van with a sliding door that doesn't close all the way and a dream. Fine. But if I get kicked in the shins by a sugar-crashing toddler, I’m taking the Jeep for myself. Deal. Now, put on the horn. We need to see if you can see out of the nostrils.
created_at
"2026-02-08T15:11:36.32243+00:00"
Sunday, February 8th, 2026 at 3:11:36 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
670
moderation_latency
(null)
generation_time
6716
tokens_prompt
61
tokens_completion
664
native_tokens_prompt
67
native_tokens_completion
703
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0021425
router
(null)
provider_responses
0
status
200
is_byok
false
latency
670
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770563489-H21mLPJ8gK4iA6el2lWy"
upstream_id
(null)
total_cost
0.0021425
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
9.6081%
Matches word count
n/a
neededClean
false
words
522
0.0000%
Dialogue to Total Word Ratio
Ratio: 2.30%, Deviation: 27.70%
neededClean
false
wordsTotal
522
wordsDialogue
12
4.8041%