NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94HX81907WA40Z371BDE89
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
9.4s
Cost:
$0.001016
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0018%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly traced a finger over the crudely drawn floor plan, a smudge of what looked suspiciously like jam marring the diagram of the Henderson residence. Bill leaned closer, his breath smelling faintly of lukewarm tea and questionable biscuits. "So, this is it? Operation: Birthday Bonanza?" It was a surprisingly ambitious name for stealing presents from a six-year-old's party. "Precisely," Molly whispered, pointing to a bold 'X' near a drawing of a cake. "The treasure hoard is situated in the living room, unguarded during the crucial 'Pass the Parcel' phase. We slip in, grab the loot, and vanish like a fart in the wind, leaving only baffled parents and a distinct lack of joy." Her eyes gleamed with professional avarice. Bill scratched his chin. "But aren't the little ankle-biters usually, you know, *everywhere*? And what if they've got party poppers? Those things are loud enough to rupture eardrums." "That's where your role as Chief Distraction comes in," Molly explained, tapping a picture of a clown. "You go in dressed as 'Jolly Jangles', complete with a menacingly cheerful balloon animal. While they focus on whatever horror you conjure, I'll make my move on the presents. Simple, effective, and deeply regrettable for future therapy bills." "My balloon animal skills are rusty, Molly. Last time I made a 'dog', it looked more like a deflated giraffe having a seizure. And 'Jolly Jangles' sounds utterly terrifying to a small child." Molly sighed dramatically, pushing a stray curl from her face. "Nonsense. It’s foolproof. Just try not to scare the kiddies *too* much, alright? We need them crying over missing toys, not primeval terror."
created_at
"2026-02-25T00:53:00.806Z"
2/25/2026, 12:53:00 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1416
moderation_latency
(null)
generation_time
9393
tokens_prompt
77
tokens_completion
1430
native_tokens_prompt
67
native_tokens_completion
2524
native_tokens_completion_images
0
native_tokens_reasoning
2145
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0010163
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1416
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980780-84wbHfMMh4hxqsng8nFz"
upstream_id
(null)
total_cost
0.0010163
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
268
0.00%
Dialogue to Total Word Ratio
Ratio: 67.90%, Deviation: 17.90%
neededClean
false
wordsTotal
271
wordsDialogue
184
0.0018%