NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25X80CJFE3RA251550HQSY
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.8s
Cost:
$0.005555
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.08%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
You want me to what, Molly? Just listen to me, Bill, and stop looking at your GPS like it’s a bomb about to go off. It is the perfect score. It is a bouncy castle, Molly. We are parked outside a bouncy castle in suburbia. There is a giant inflatable giraffe staring at me. Exactly. And inside that backyard is a mountain of pristine, untaxed cardboard boxes wrapped in shiny cartoon paper. They are toys. We are international art thieves. We stole a Matisse from the Louvre using nothing but a fishing rod and a very cooperative pigeon. And we got chased by Interpol for six months and couldn't buy groceries. Do you know what Lego sells for on the black market? It is practically gold bullion but easier to carry. Plus, no serial numbers. We grab the loot, we sell it to crooked toy collectors, and we retire to Mallorca. I am not going to prison for stealing a Paw Patrol Lookout Tower, Molly. First of all, the shame would kill me. Second of all, how do we even get in? I have a foolproof plan. You always say that right before I get bitten by a guard dog or stuck in a ventilation shaft. This time is different. Look in the back seat. I see a pastel pink overalls set. And a giant balloon pump. Molly, tell me those are not for me. You are Sparkles the Clown. I am not Sparkles the Clown. I am William ‘The Whisper’ Vance. I can crack a safe in forty seconds using only my teeth and a stethoscope. And today, William, you will crack a helium tank. You go in, you distract the screaming children by blowing up balloon giraffes, and I sneak into the living room to bag the presents. What about the parents? They are going to notice a middle-aged man sweating through his face paint in ninety-degree heat while his partner drags a heavy-duty trash bag down the hallway. The parents will be completely incapacitated. How? Laser grid? Sleeping gas? Chardonnay and juice boxes. I did my recon. It’s a six-year-old’s birthday. By hour two, the mothers will be staring blankly into the distance and the fathers will be huddled around the grill arguing about charcoal. We could steal the actual house and they wouldn’t notice until Tuesday. What is the escape route? The bouncy castle is our cover. If things go south, we jump onto the inflatable slide, bounce over the cedar fence, and scramble into the getaway van. Is that safe? I have a bad knee. You will be wearing oversized clown shoes, Bill. They have excellent traction. Now, put on the red nose. It smells like cheap plastic and regret. That is the smell of economic freedom, my friend. Now, let’s go make some balloon animals.
created_at
"2026-05-20T07:51:52.594Z"
5/20/2026, 7:51:52 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
925
moderation_latency
(null)
generation_time
4657
tokens_prompt
77
tokens_completion
692
native_tokens_prompt
67
native_tokens_completion
606
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0055545
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"GGgNauu3L7bTjMcP-82ruQM"
is_byok
false
latency
925
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263512-YadukO1yfdyfKTG37snz"
session_id
(null)
api_type
"completions"
id
"gen-1779263512-VEn3mKv3BQ4OYaT5bNQ6"
upstream_id
"GGgNauu3L7bTjMcP-82ruQM"
total_cost
0.0055545
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.08%
Matches word count
n/a
neededClean
false
words
471
0.00%
Dialogue to Total Word Ratio
Ratio: 0.63%, Deviation: 69.37%
neededClean
false
wordsTotal
476
wordsDialogue
3
0.0424%