NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYXDBHWW04A7QMJ9557YYCR
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The map was a crayon drawing on the back of a soggy Chuck E. Cheese placemat. Molly tapped a greasy smudge near the ball pit icon. This is the rendezvous point, she whispered, her eyes darting around the minivan. We hit the juice box cooler first to create a distraction. Bill adjusted his clip-on tie and looked in the visor mirror. Do I look like a birthday magician? I feel like a funeral director for hamsters. You look like a professional, Bill. Now, focus. The target is a pile of rectangular boxes wrapped in Elsa and Spider-Man paper. We estimate a net worth of four hundred dollars in LEGO sets and at least one remote-controlled drone. Is the drone the one with the camera? Bill asked, checking his pockets for the smoke bombs they had fashioned out of baking soda and glitter. Because I really want that drone, Molly. My nephew has one, and he mocks me from the driveway. Focus on the mission, not your family trauma, Molly snapped. You take the north perimeter by the bouncy castle. I’ll infiltrate the gift table using the 'Auntie from Out of Town' maneuver. Does that involve the cheek pinching? Bill shivered. I hate the cheek pinching. It’s a low blow, even for us. It’s a tactical necessity, Bill. If a toddler asks who I am, I tell them I’m Great Aunt Gertrude. They’re usually too sugared up to check the genealogy. What about the clown? Bill pointed a trembling finger toward the driveway. There’s a guy named Bubbles out there making balloon giraffes. He looks like he’s seen things. He’s got shifty eyes. Bubbles is a civilian, Molly said, checking her watch. We ignore him unless he tries to block the exit. If he pulls a balloon sword, you use the glitter bomb. And the cake? Bill licked his lips. Is the cake part of the haul? I heard it’s red velvet. The cake is a secondary objective only, Bill! We are here for the high-value cardboard. If you get distracted by frosting again like the Omaha job, I’m leaving you in the ball pit. That ball pit was a tactical retreat! Bill protested. I was blending in with the primary colors! You were crying because you lost your shoe, Molly countered. Now, put on your spectacles. The bouncy castle is inflating. That’s our signal. I’m going in, Bill said, gripping the door handle. If I don’t make it back, tell my cat I loved her. Your cat hates you, Bill. Just get the LEGOs. Now move! On three? Bill asked. On three, Molly confirmed. One, two, three. For the plastic gold! For the red velvet! Bill shouted, tumbling out of the minivan and immediately tripping over a lawn sprinkler.
created_at
"2026-02-08T15:20:13.308858+00:00"
Sunday, February 8th, 2026 at 3:20:13 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
910
moderation_latency
(null)
generation_time
7035
tokens_prompt
61
tokens_completion
595
native_tokens_prompt
67
native_tokens_completion
612
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0018695
router
(null)
provider_responses
0
status
200
is_byok
false
latency
910
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770564005-r6cTlGwYlJwWUzGrEfl5"
upstream_id
(null)
total_cost
0.0018695
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
463
0.0000%
Dialogue to Total Word Ratio
Ratio: 1.30%, Deviation: 48.70%
neededClean
false
wordsTotal
463
wordsDialogue
6
0.0000%