NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYXJRJHG9ZX4GDKKYJFVW21
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Check the perimeter again," Bill whispered, adjusting his overly tight tuxedo. "It’s a bouncy castle, Bill. Not a missile silo," Molly retorted. She was currently disguising a heavy-duty industrial crowbar inside a hollowed-out baguette. "Sun Tzu said to know your enemy. Our enemy is currently eating paste and vibrating from a juice-box-induced sugar high. We need tactical precision." "We need to get into the sunroom, grab the pile of boxes wrapped in shiny dinosaurs, and leave before the clown starts his balloon animal set. That’s the plan." "The clown is a variable. He has shifty eyes. I don't trust a man who can fit that much primary color on one face." Molly rolled her eyes and clipped a walkie-talkie to her belt, which was hidden under a flowery summer dress. "Is the getaway vehicle primed?" "The tricycle is positioned behind the hydrangea bushes," Bill said with a firm nod. "We agreed on the sedan, Bill. Why is there a tricycle?" "Low profile, Molly. Plus, it has a bell. For intimidation." "Give me the floor plan." Bill pulled a crumpled napkin from his pocket. "Okay, here. Entry point A is the dog door. It’s tight, but if you exhale and I use the butter from the buffet, we’re golden." "I am not sliding through a dog door covered in Kerrygold," Molly snapped. "We’re using the patio entrance like civilized burglars." "And the security?" "You mean the Golden Retriever?" "He’s a sentry, Molly! He’s been trained in the art of the belly rub to lower our guard. It’s a psychological trap." "I brought a bag of high-end beef jerky. He’ll be our accomplice within ten seconds." Bill peered through his binoculars, which were actually just two toilet paper rolls taped together for 'urban camouflage.' "Look at the loot, Molly. That rectangular one in the blue foil? That’s definitely a Lego Death Star. Black market value is through the roof." "Focus. What’s the status of the target's parents?" "Occupied. The father is currently failing to light a charcoal grill, and the mother is trapped in a conversation about gluten-free cupcakes with a woman named Brenda. We have a four-minute window before the 'Happy Birthday' song begins." "That’s our cue," Molly said, checking her watch. "Once the singing starts, the acoustic interference will mask the sound of the bag zipping." "Wait! Look at the perimeter. We have a problem." "What now?" "Toddlers. Three of them. They’re patrolling the sandbox." "They're three years old, Bill. They're not patrolling, they’re looking for things to put in their ears." "They have sticky hands, Molly! If they touch the tuxedo, the forensic evidence will be everywhere. We'll be linked to the crime scene by strawberry jam fingerprints." "Pull yourself together. Put on the party hat. It’s our only hope for blending in." Bill solemnly placed a cone-shaped hat with glittery tassels on his head. "I feel like a fool." "You look like a guest. Now, grab the baguette. We go in on my signal." "On three?" "On 'cake'."
created_at
"2026-02-08T15:23:10.115142+00:00"
Sunday, February 8th, 2026 at 3:23:10 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
791
moderation_latency
(null)
generation_time
6836
tokens_prompt
61
tokens_completion
689
native_tokens_prompt
67
native_tokens_completion
735
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0022385
router
(null)
provider_responses
0
status
200
is_byok
false
latency
791
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770564182-Szxhn9Evq32dGU2FjTfu"
upstream_id
(null)
total_cost
0.0022385
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
93.6496%
Matches word count
n/a
neededClean
false
words
509
7.4321%
Dialogue to Total Word Ratio
Ratio: 77.21%, Deviation: 7.21%
neededClean
false
wordsTotal
509
wordsDialogue
393
50.5409%