NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95JG1FVA7TJYKYGRXQDPS5
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
30.8s
Cost:
$0.019290
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.50%
Dialogue to Total Word Ratio
(details)
49.7493%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The smeared party invitation, a garish beacon of childhood glee, lay on the worn oak table. Molly, a vision of sleek intent, tapped a perfectly manicured nail against Teddy 'the Terrible' Thompson's crayon-drawn sun. Her sharp mind, however, saw not innocent whimsy, but a vault of unguarded treasures, gleaming with potential. Bill, her perpetually weary partner, merely squinted at the colorful tableau. "Fifth birthday bash. A haul bigger than most bank vaults, Bill." "Loot, Molly? For a five-year-old? Plastic fantastic. Chewed teddies," he grumbled, rubbing his chin. "Don't be naive. High-net-worth parents. Bespoke action figures, artisanal play-doh, a real diamond pacifier. This is an *art* heist." Her grin practically peeled paint. He foresaw sticky chaos. "Toddlers. Fizzy the clown. Security is projectile cake." "Exactly! Mayhem is our cover. Fizzy distracts, you hit the kitchen. Oldest trick." "And your diversion?" "Puppies in the bouncy castle." "Puppies and fire trucks," Bill groaned, utterly deflated. "My life's pinnacle." "Think miniature Tesla," Molly countered with a wink.
created_at
"2026-02-25T01:10:48.63Z"
2/25/2026, 1:10:48 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1622
moderation_latency
(null)
generation_time
30776
tokens_prompt
77
tokens_completion
2973
native_tokens_prompt
67
native_tokens_completion
7708
native_tokens_completion_images
0
native_tokens_reasoning
7455
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0192901
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1622
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981848-YZaDv9ELu4a4KOZPte4U"
upstream_id
(null)
total_cost
0.0192901
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
163
99.50%
Dialogue to Total Word Ratio
Ratio: 52.66%, Deviation: 2.66%
neededClean
false
wordsTotal
169
wordsDialogue
89
49.7493%