NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TV5CX1X05G3Q7W4DRBMK6
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 20s
Cost:
$0.194897
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her brass binoculars, her brow furrowed as she stared through the wet windshield of their rusted pest control van. Across the quiet suburban street, a massive green inflatable dragon was slowly rising in the manicured backyard of house forty. Dozens of helium balloons bobbed against the wooden fence, and the sharp screams of twenty wild children sent shivers of anxiety down her spine. This was an incredibly dangerous arena, heavily populated by unpredictable miniature humans and their tired parents. She slowly turned to her partner, who was currently distracted by a powdered pastry, leaving a thick white mustache on his upper lip. He was chewing loudly, completely unaware of the looming tactical challenge. "What is our status on the perimeter?" "The dragon is inflated, and there is a blind spot behind the tail." "Clowns are wildcards, Bill. We cannot risk a direct confrontation." "I can distract him with a balloon animal challenge." "No bribes, we only want clean stealth." "Understood, boss." Bill dabbed his sticky fingers on his trousers and pointed toward the rear hedge. He had spent several hours studying the property layout, though his short attention span often drifted toward the snacks. He retrieved a map drawn in colorful crayon, tapping a red circle over the living room where the big mountain of glossy gift boxes sat on a mahogany table. Molly leaned in close, examining his crude sketches with intense, professional focus. To anyone else, this was just a noisy celebration, but to them, it was a vault. "Can we handle the golden retriever?" "I bought three pounds of prime rib to cause a food coma." "Is that actually safe?" "It is basic biology, Bill. How do we haul the heavy cargo?" "We can use the yellow plastic escape slide." "We cannot slide down with forty heavy Lego boxes, Bill." Molly rubbed her temples, feeling a headache coming on. Bill possessed the physical strength of a bear but the tactical mind of a toddler. She looked back through the lens, focusing on the prize. The finest possession inside was a rare, unopened set of vintage space building blocks, gifted by a wealthy uncle, worth thousands on the collector market. There were also brand new gaming consoles, tablets, and small toy cars ripe for the picking. They just needed clean execution and absolute silence, two things Bill struggled with even on his best days. He was already laced into his heavy leather boots. "We must grab the big spaceship set first." "Can we also steal the chocolate cake?" "If you carry the sack properly, yes." "I will hold it like a newborn baby." "Just do not eat it before we escape." "No promises, Molly, it looks delicious." "Focus, Bill!" "Fine, I will behave." Bill grinned, pulling a black ski mask over his face, completely oblivious to the white sugar still coating his nose. Molly sighed, checked her wristwatch, and unlocked her side door as the first notes of happy birthday drifted across the quiet suburban street.
created_at
"2026-05-19T19:19:15.619Z"
5/19/2026, 7:19:15 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1358
moderation_latency
(null)
generation_time
79756
tokens_prompt
77
tokens_completion
5273
native_tokens_prompt
67
native_tokens_completion
21644
native_tokens_completion_images
0
native_tokens_reasoning
21011
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.1948965
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"s7cMarHzLqKyjrEP8NzzkAc"
is_byok
false
latency
1358
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218355-MFa5yDjxkDlKdi8nVddA"
session_id
(null)
api_type
"completions"
id
"gen-1779218355-9AbojU8EBXe7Clbf1ka6"
upstream_id
"s7cMarHzLqKyjrEP8NzzkAc"
total_cost
0.1948965
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%