NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25YB63Y3WV1WM82QNCM1PG
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.5s
Cost:
$0.005464
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.08%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly patted her pockets and checked the straps of her oversized tactical vest. Bill, why am I wearing camouflage when the target venue is painted hot pink and smells of cotton candy? Because we are professionals, Molly, and professionals respect the terrain. Bill adjusted his fake moustache, which was already peeling off at the left corner. Now, do you have the blueprints? You mean the crayon drawing of the bouncy castle? Molly pulled a crumpled napkin from her pocket and smoothed it against the hood of their dented getaway hatchback. Yes, I have them. The cake is scheduled for three o'clock. The sugar high will peak at three-fifteen. That is our window of maximum chaos. Brilliant. Bill nodded solemnly, squinting through a pair of plastic binoculars. The guards are formidable, though. I count at least twelve toddlers hopped up on apple juice. One of them is wielding a giant plastic bubble wand. He looks like a loose cannon. We can bypass the bubble guard if we use the diversion. Molly pointed at a cardboard box in the back seat. Is the decoy ready? The decoy is primed. Bill reached into the box and pulled out a stuffed bear that looked like it had survived a lawnmower accident. I call him Sergeant Snuggles. When we throw him into the sandbox, they will swarm. It will be like piranhas on a pork chop. And what about the mark? Molly tapped her fingers on the map. Where are they keeping the loot? The pile is located in the solarium, adjacent to the clown station. Bill shuddered. We must avoid the clown at all costs, Molly. His balloons are loud, and his face holds no joy. Agreed. No clowns. We grab the pile, bag the goods, and make a run for the hedge. Molly leaned closer, her eyes narrowing. What about the parents? They are completely distracted by the artisan mimosa bar. Bill smirked, tapping his temple. I hacked their playlist earlier. At exactly three-twenty, I am dropping a heavy metal track. The ensuing parental confusion will give us the cover we need. You are a criminal mastermind, Bill. Molly zipped up her duffel bag with a sharp metallic hiss. Let us review the rules. No pony rides, no face painting, and we do not touch the bouncy castle under any circumstances. But what if the bouncy castle is unoccupied? Bill pleaded, his eyes shining. It has a slide, Molly. A giant inflatable slide. Focus, Bill! If you get stuck in the castle again like the Henderson job, I am leaving you behind. That was a structural collapse, and you know it. Bill sniffed, straightening his vest. Besides, I rescued three juice boxes. Just get your potato sack ready. Molly opened her door, the scent of vanilla frosting drifting through the air. It is showtime.
created_at
"2026-05-20T07:52:28.615Z"
5/20/2026, 7:52:28 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
707
moderation_latency
(null)
generation_time
4374
tokens_prompt
77
tokens_completion
696
native_tokens_prompt
67
native_tokens_completion
596
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0054645
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"PGgNapPiLpa__uMP29DIqQE"
is_byok
false
latency
707
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263548-568DRkRIqKnF2KnGqtmO"
session_id
(null)
api_type
"completions"
id
"gen-1779263548-tGvFwG7owCOa6qfZYw1s"
upstream_id
"PGgNapPiLpa__uMP29DIqQE"
total_cost
0.0054645
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.08%
Matches word count
n/a
neededClean
false
words
471
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
473
wordsDialogue
0
0.0424%