NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYX8W75RBFEP28WJ87N8Q6Q
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Check the perimeter again," Bill whispered, adjusting a clip-on tie that was several shades too neon for a man of his criminal aspirations. "I checked it, Bill. It’s a bouncy castle. The perimeter is made of air and vinyl," Molly snapped. She was currently wedging a tactical earpiece into place, though it was mostly being used to block out the sound of a nearby kazoo solo. "Intel says the target is in the sunroom. The ‘Gift Mountain,’ they’re calling it. Very high altitude. Very dangerous," Bill said, tapping a diagram drawn in crayon on the back of a pizza box. "It’s a pile of LEGO sets and Paw Patrol merchandise, Bill. We aren't infiltrating the Louvre. We’re infiltrating a seven-year-old’s backyard." "But the security, Molly! Think of the security!" "You mean the Golden Retriever?" "He has a very suspicious bark! And he’s been eyeing my ham sandwich since we parked the van." Molly groaned, checking her watch. "The magician starts in five minutes. That’s our window. When he pulls the rabbit out, the demographic will be distracted. We slip through the sliding glass door, bag the loot, and vanish before the cake is served." "And if we get cornered by the mother?" Bill asked, his voice trembling slightly. "She looks like she does CrossFit, Molly. I saw her lift a cooler with one hand." "If she spots us, we pivot to Plan B." "Which is?" "We pretend to be the face painters who were supposedly stuck in traffic." "I can’t paint faces! The last time I tried to draw a cat, the police thought it was a topographical map of the Andes." "Then you’re the balloon animal guy. Just twist a long one into a circle and call it a donut. Kids are idiots, Bill. They have no standards." "I don't know," Bill said, peering through the binoculars. "Wait. Movement at twelve o'clock. It’s a toddler. He’s got a juice box. He looks like a snitch." "He’s sticky, Bill. He’s not a federal agent. Put the binoculars down before someone sees the reflection." "I’m just saying, the stakes are high. If we get caught, it’s not just jail. It’s social ostracization. We'll be the people who robbed little Timmy." "It’s little Kevin, actually. And Kevin has three iPads. He’s a hoarder. We’re practically Robin Hoods, but instead of giving to the poor, we’re giving to our overdue rent." "Do you think there’s a remote-control drone in there? I’ve always wanted one with a camera." "Focus! The rabbit is out! Go, go, go!" Bill scrambled out of the bushes, tripping over a plastic lawn flamingo. "My ankle! I think I’ve tripped a silent alarm!" "That was a bird feeder, you moron! Get to the door!" "I’m going! I’m going! Should I put the mask on now?" "No! It’s a birthday party! Put on the party hat and try to look like you’re having a mid-life crisis, not a felony!"
created_at
"2026-02-08T15:17:46.409908+00:00"
Sunday, February 8th, 2026 at 3:17:46 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
746
moderation_latency
(null)
generation_time
7051
tokens_prompt
61
tokens_completion
665
native_tokens_prompt
67
native_tokens_completion
724
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0022055
router
(null)
provider_responses
0
status
200
is_byok
false
latency
746
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770563858-rqHkzxjid9ZfIV16lflO"
upstream_id
(null)
total_cost
0.0022055
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
97.6276%
Matches word count
n/a
neededClean
false
words
493
0.1281%
Dialogue to Total Word Ratio
Ratio: 81.54%, Deviation: 11.54%
neededClean
false
wordsTotal
493
wordsDialogue
402
48.8778%