NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94YSGN513YVRBQ3RZRJYHZ
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
9.5s
Cost:
$0.000773
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1378%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a smudged diagram drawn on the back of a Pizza Palace menu. "Operation: Birthday Bonanza is a go. Timmy's seventh. Max present accumulation window opens precisely at three PM." Bill squinted, his perpetually hopeful eyes tracing the greasy lines. "So, we crash the party, snag the loot, and vanish like a fart in the wind?" "Subtlety, Bill. We're ninjas in disguise, not rampaging toddlers." Molly adjusted her ill-fitting fake glasses. "My intel suggests the prime loot is near the sofa, piled high. We need a distraction. I've procured props." "Ooh, props!" Bill bounced in his seat. "Am I a giant, menacing bunny rabbit?" "You're a clown. A balloon-twisting, face-painting terror. While you're busy terrifying the small ones and promising them cheap plastic trinkets, I'll be liberating the wrapped treasures. Focus on anything shimmering." Molly ignored Bill’s sudden attempt to juggle two sugar packets, which promptly exploded. "No juggling. Just clowning." Bill looked crestfallen. "But what about the cake?" Molly sighed, already calculating the escape route. "Priorities, Bill. Presents first, cake second. Or maybe not at all. It's a heist, not a five-star buffet."
created_at
"2026-02-25T01:00:02.975Z"
2/25/2026, 1:00:02 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1226
moderation_latency
(null)
generation_time
9442
tokens_prompt
77
tokens_completion
1054
native_tokens_prompt
67
native_tokens_completion
1917
native_tokens_completion_images
0
native_tokens_reasoning
1635
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0007735
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1226
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981202-PcGoC4unkjXVlUIp8e1Q"
upstream_id
(null)
total_cost
0.0007735
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
185
0.00%
Dialogue to Total Word Ratio
Ratio: 68.78%, Deviation: 18.78%
neededClean
false
wordsTotal
189
wordsDialogue
130
30.1378%