NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYXJ8KNJA8DHF0GXEEAAN1A
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"What are we wearing, Bill?" "I was thinking tactical black." "It’s a bouncy castle rental store, Bill, not the Louvre. If we go in there looking like ninjas, we’re going to stand out against the primary colors and the glitter glue. We’re wearing the suits." "The velvet ones? I look like a depressed blueberry in mine." "You look like a professional children's entertainer. We are the Great Gazoo and his assistant, Sparkle-Toes. I’ve already printed the business cards." "I am not going by Sparkle-Toes. I have a reputation in the tri-state area." "Your reputation is for tripping over a laser grid at the history museum. Today, your reputation is making balloon poodles. Now, tell me the layout." "Right. The target is a six-year-old named Tyler. He’s a Leo. High energy, low attention span." "I don't need his horoscope. Where are the gifts?" "In the Living Room Zone. It’s a high-traffic area, Molly. We’re talking twelve toddlers, three exhausted suburban parents, and a golden retriever that hasn't been fed since breakfast. It’s a gauntlet." "And the cake?" "Scheduled for two o'clock. That’s our window. When the singing starts, everyone looks at the kid. Nobody looks at the pile of Lego sets and overpriced plushies." "Exactly. The Happy Birthday diversion. It’s a classic for a reason. Do we have the extraction vehicle?" "The Honda Odyssey is gassed up. I even put a ‘Baby on Board’ sticker on the back so the cops won't pull us over if I take a corner too fast." "Good thinking. What about the gift bags?" "I’ve got fake ones. We swap the real loot for bags filled with those little plastic whistles and sugar-free gummies. By the time they realize they’ve been robbed, we’ll be halfway to the pawn shop with twelve vintage-edition Barbie dream houses." "Bill, it’s a six-year-old’s birthday. There aren't vintage Barbies. It’s mostly going to be Roblox credits and slime kits." "Is slime liquid? Can we fence slime?" "We aren't fencing it. We’re selling it back to the black market moms on Facebook Marketplace. It’s a closed-loop economy." "I don't like the dog, Molly. He looked at me funny when I did the reconnaissance." "You did reconnaissance? At a Chuck E. Cheese?" "I had to know the vibe! The dog is a wild card. He knows I’m not Gazoo. He smells the larceny on me." "He smells the ham sandwich you had for lunch. Just stay focused. When the candles are lit, we bag the goods and exit through the patio. If anyone asks, we’re going to the van to get more glitter." "I hate glitter. It stays in the carpet for years. It’s the ultimate forensic evidence." "Then don't spill it. Do you have the nose?" "The red foam one?" "The red foam one." "I have it. But I’m still not being Sparkle-Toes." "Fine. You can be Captain Cupcake. Now get in the van. We’ve got a heist to pull."
created_at
"2026-02-08T15:22:54.119843+00:00"
Sunday, February 8th, 2026 at 3:22:54 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1323
moderation_latency
(null)
generation_time
7174
tokens_prompt
61
tokens_completion
659
native_tokens_prompt
67
native_tokens_completion
742
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0022595
router
(null)
provider_responses
0
status
200
is_byok
false
latency
1323
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770564166-zScyT9EJoZ8arZzkiek9"
upstream_id
(null)
total_cost
0.0022595
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
499
0.0000%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 30.00%
neededClean
false
wordsTotal
499
wordsDialogue
499
49.9995%