NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25VP82Y3Z1DBKYYP0Y91E8
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.5s
Cost:
$0.003908
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The greasepaint is itching, Bill, Molly rasped, adjusting her oversized polka-dot bowtie. I feel like an absolute idiot. You look like an absolute idiot, Molly, but that is the price of tactical genius, Bill replied, squeezing a large red squeaker onto his nose. If anyone asks, we are Barnaby and Bumbles, the last-minute balloon animal emergency response team. A nine-year-old’s birthday is not an emergency, Bill, she sighed. It is a breeding ground for sticky fingers and high-pitched screaming. Precisely, and that is our cover, Bill said, peering through the hedge at the chaotic backyard. Parents are distracted by sugar crashes and existential dread. The target is the gift table on the patio. Have you scanned the perimeter security? Molly asked. A hyperactive Golden Retriever and a clown who looks like he’s practicing for his mugshot, Bill whispered. I’ll distract the clown with a fierce debate about balloon folding techniques. You slip past the cake station and bag the loot. What if the birthday boy spots me? Molly inquired. Tell him you’re doing a magic trick where his Xbox disappears, Bill said. He won’t even blink. We only have a four-minute window before the pinata drops, she warned, checking her watch. Once the papier-mache donkey breaks, it is total mob rule out there. I’ve factored in the candy-frenzy trajectory, Bill assured her, pulling a giant velvet sack from his pocket. We strike when the first piece of taffy hits the grass. Are the smoke bombs ready? These are glitter bombs, Bill, Molly corrected, holding up two small plastic eggs. You bought glitter bombs. Same thing, but more festive, he grinned. If I get glitter in my eye, I am leaving you behind to face the bouncy castle, she threatened. Fair play, Bill agreed. Let’s move. Remember, keep your knees high and your squeaker ready.
created_at
"2026-05-20T07:51:01.656Z"
5/20/2026, 7:51:01 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
786
moderation_latency
(null)
generation_time
3279
tokens_prompt
77
tokens_completion
485
native_tokens_prompt
67
native_tokens_completion
423
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0039075
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"5WcNasD9Nayb_uMPioHnmQY"
is_byok
false
latency
786
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263461-b6Pr8rm3TTvHqoLnFI7v"
session_id
(null)
api_type
"completions"
id
"gen-1779263461-xU0xcQxCc5b9pcYTStNQ"
upstream_id
"5WcNasD9Nayb_uMPioHnmQY"
total_cost
0.0039075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
304
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
312
wordsDialogue
0
0.0000%