NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ954491GD96PMWZKV3TRZDA
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
6.7s
Cost:
$0.003813
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
99.96%
Dialogue to Total Word Ratio
(details)
99.9370%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned closer to the walkie-talkie, the faint scent of plasticine and party cake wafting from Bill's end. Her own vantage point from the neighbour’s trimmed hedge offered an excellent view of the mayhem unfolding in the backyard. Balloons bounced with a mind of their own, and the saccharine melodies of a children's sing-along blared, providing surprisingly good cover. This was it. The children, utterly oblivious, would become unwitting pawns in the most ethically questionable, yet lucrative, operation of their careers. Bill, disguised poorly as a gardening enthusiast, was already blending in, or rather, standing out. “Okay, so you’ve got eyes on the bouncy castle?” Molly whispered. “Affirmative, Molly. Operation ‘Birthday Bonanza Bust’ is a go. Big inflatable obstacle, small potential witnesses.” “Witnesses, Bill? They're five. Their biggest worry is if Elsa sings ‘Let It Go’ again.” “A legitimate concern. But the clown looks shifty. Definitely ex-military, seen that stare before.” “He’s holding a balloon dog, not a rifle. Our target is the gift table. Confirm layout.” "Rectangular, laden. Mostly Paw Patrol, some unicorns. Security? Nonexistent. Unless Mrs. Higgins' prize-winning petunias are a perimeter defense." “They are if you're allergic. So, the piñata explodes, that's our window.” “Chaos diversion. Brilliant. Pure, unadulterated chaos.”
created_at
"2026-02-25T01:02:57.831Z"
2/25/2026, 1:02:57 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1514
moderation_latency
(null)
generation_time
6679
tokens_prompt
77
tokens_completion
897
native_tokens_prompt
67
native_tokens_completion
1517
native_tokens_completion_images
0
native_tokens_reasoning
1196
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0038126
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1514
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981377-AlF9WMi9i14BqFHWSxOc"
upstream_id
(null)
total_cost
0.0038126
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
203
99.96%
Dialogue to Total Word Ratio
Ratio: 51.46%, Deviation: 1.46%
neededClean
false
wordsTotal
206
wordsDialogue
106
99.9370%