NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94XRTT8VGGKYC51PWGEM9H
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
12.0s
Cost:
$0.006793
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Seriously, Bill? A bouncy castle? How do we get fifty pounds of LEGO through that without looking suspicious?" "Suspicious? Moll, we're aiming for strategic chaos. Think distraction! A rogue balloon artist?" "Your side. Not causing a scene. We blend." "So I'm the disillusioned clown. You're the one-legged pirate." "I'm 'the overwhelmed mum, wandering off with her 'gift bag' containing *all* the presents.' You, Bill, need a *subtle* diversion." "Subtle? But the cake table! The frosting! The sprinkles! Glorious!" "No, Bill. Ferret incident? Tiny Bartholomew tried to steal a toddler's tiara. Not subtle." "He wanted a souvenir." "He wanted a felony. Look, you cause a *mild* kerfuffle. Juice box spill, tripping hazard. I grab the loot. Operation Sugar Rush. We'll be swimming in plastic and glitter." "But what *kind* of glitter? Biodegradable?" "The kind that doesn't get us arrested, dolt! And don't make eye contact with any five-year-olds, Bill. They see *everything*."
created_at
"2026-02-25T00:59:29.505Z"
2/25/2026, 12:59:29 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1382
moderation_latency
(null)
generation_time
11931
tokens_prompt
77
tokens_completion
1182
native_tokens_prompt
67
native_tokens_completion
2709
native_tokens_completion_images
0
native_tokens_reasoning
2456
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0067926
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1382
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981169-2i94lvwgP24qYdUjkCRl"
upstream_id
(null)
total_cost
0.0067926
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
151
36.79%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
154
wordsDialogue
154
18.3940%