NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-11-18-01KACP54650SJQPA2AW05QG744
Run Details
Model:
Gemini 2.5 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
35.6s
Cost:
$0.046314
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
99.68%
Dialogue to Total Word Ratio
(details)
90.4759%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly jabbed a greasy fork at the napkin spread across their peeling booth table. The surface was a chaotic blueprint of a child’s birthday party, complete with a terrifyingly large sun drawn in ketchup. Bill leaned in, the gurgle of his milkshake the operation’s only soundtrack. “Okay, the bouncy castle is here,” she stated. “It’s our visual obstruction and primary exit.” Bill squinted, tracing a dotted line with his finger. “And this?” “That’s the Splash Zone. High-powered sprinkler. Galoshes required.” “Galoshes. Got it.” He nodded gravely. “So the score is piled on the patio?” “Presents, Bill. They’re called presents.” Her patience was wearing visibly thin. “The parents are the main obstacle until three.” “What happens at three?” “The magician. His finale uses flash paper. That’s our diversion.” A slow grin spread across Bill’s face. “Using a children’s entertainer as an unwitting accomplice. Diabolical.” “He works for cake.” Molly’s fork hovered over a furious scribble. “Security is one agent: a teacup poodle named Fifi.” Bill paled. “Not Fifi. Her bark is pure psychological warfare.” “Don’t worry,” Molly said flatly, a glint in her eye. “I have a squeaky steak.”
created_at
"2025-11-18T23:53:20.883432+00:00"
11/18/2025, 11:53:20 PM
model
"google/gemini-2.5-pro"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2047
moderation_latency
(null)
generation_time
35568
tokens_prompt
68
tokens_completion
1914
native_tokens_prompt
67
native_tokens_completion
4623
native_tokens_completion_images
0
native_tokens_reasoning
4342
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.04631375
api_type
"completions"
id
"gen-1763509965-dYmXGObrI2ouKV2sYiKf"
upstream_id
(null)
total_cost
0.04631375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
188
99.68%
Dialogue to Total Word Ratio
Ratio: 52.38%, Deviation: 2.38%
neededClean
false
wordsTotal
189
wordsDialogue
99
90.4759%