NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0V5FVPB7XWS81F03GRE0J8
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 24s
Cost:
$0.204598
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
99.93%
Dialogue to Total Word Ratio
(details)
99.9557%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The smell of stale french fries and cheap adhesive tape filled the cramped back of the rusted transit van. Molly adjusted a pair of oversized plastic night vision goggles that were entirely unnecessary for a bright Tuesday afternoon, squinting at a hand drawn blueprint of the suburban house. Beside her on the carpeted floor, Bill was busy trying to peel a sticky green cherry sucker off his corduroy trousers, looking far more concerned with fuzzy lint than grand larceny. He sighed deeply, giving up on the sticky candy and looking up at his intense partner in crime. "We have a three minute window when the magician starts his rabbit act." Molly tapped her squeaky red marker against the blueprint, tracing an intricate path through the living room. Bill frowned and leaned closer to the paper. "Are you absolutely sure? What if he does card tricks instead?" "He won't. My inside contact, who is seven, says the rabbit is the finale." She pointed a sharp, manicured finger at the back patio, where a cheerful cluster of helium balloons drifted very gently in the breeze. "The entire suburban perimeter is heavily fortified." "We face a giant bounce house, lawn sprinklers, and toddlers who will bite." Bill shuddered at those terrifying children, vividly remembering their previous encounter with a sugar crazed toddler during their bakery job. He adjusted the tight black ski mask, which unfortunately had fluffy cartoon puppy ears sewn directly onto the top because it was the single final mask remaining in stock at the suburban party supply store down by the old highway exit ramp. "Do we have any sort of tactical distraction?" "I bought organic juice pouches. If things go sideways, we dump them." "The health nut parents will stampede, creating a beautiful diversion." Bill nodded slowly, deeply impressed by the immense tactical brilliance of the psychological leverage she held over modern parenting anxieties. "That is incredibly devious, but what about the birthday cake?" "Focus. We are here for the toys, not the dairy." Molly hissed, tapping a tiny hand drawn map of the formal dining room where the target gift table was located. "Once I grab the pile, you throw the tactical smoke screen." "Is it an actual tactical smoke bomb?" She reached deep into the pocket of her bulky tactical vest and carefully pulled out a white plastic squeeze bottle. "It is baby powder, but it creates a decent cloud if you squeeze." "I am ready. For honor, glory, and free plastic race cars." Molly rolled her eyes and sighed, looking down at her cheap plastic digital watch. Outside the parked vehicle, the distant, tragic squeak of a wet rubber clown shoe and a terrible accordion rendition of Happy Birthday echoed across the brightly manicured lawn. The perfect target was ripe. She grabbed a giant canvas laundry sack decorated with silver glitter and slid open the squeaky side door of the van, ready to execute the most ridiculous backyard caper of her professional thief career.
created_at
"2026-05-19T19:24:54.012Z"
5/19/2026, 7:24:54 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1567
moderation_latency
(null)
generation_time
83934
tokens_prompt
77
tokens_completion
5873
native_tokens_prompt
67
native_tokens_completion
22722
native_tokens_completion_images
0
native_tokens_reasoning
22108
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2045985
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"BrkMaoLKC8nhugfN3qjpDA"
is_byok
false
latency
1567
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218694-1GDvrnGg5t3ad5rxxB4x"
session_id
(null)
api_type
"completions"
id
"gen-1779218694-X6Y2OlWq2MTEiI7KTFQD"
upstream_id
"BrkMaoLKC8nhugfN3qjpDA"
total_cost
0.2045985
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
99.93%
Dialogue to Total Word Ratio
Ratio: 30.12%, Deviation: 0.12%
neededClean
false
wordsTotal
498
wordsDialogue
150
99.9557%