NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95EB1CK6CDY8TXX2TFHT47
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
5.6s
Cost:
$0.000513
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
12.95%
Dialogue to Total Word Ratio
(details)
28.1665%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her goggles, the glitter-strewn chaos of ‘Princess Sparkle’s Third Birthday Bash’ unfolding below. Tiny humans shrieked with glee near a deflated unicorn. Bill, dressed inexplicably as a giant crayon, shifted his weight nervously. "You really think this stunt is gonna work, Mol? Stealing from a gaggle of toddlers?" Bill whispered, tugging at his felt hat. "Seems a tad unsporting, even for us." "Nonsense, Bill," Molly replied smoothly, her gaze fixed on a suspiciously large pile of wrapped boxes under a cherry tree. "It's not stealing, it's an early Christmas redistribution. Think of them as underdeveloped entrepreneurs. Our job is to give their capital a more experienced manager. You create the diversion when the cake comes out. I’ll charm the gift bags." "Diversion? What kind of diversion?" Bill asked, eyes wide. "I’m a crayon, Mol. My options are limited. Do I… draw attention to myself?" Molly winked, producing a miniature smoke bomb disguised as a party favour. "Exactly. A rather colourful distraction, one might say. Now, remember the signal. When you hear three ‘Happy Birthdays,’ we move. And try not to melt."
created_at
"2026-02-25T01:08:32.435Z"
2/25/2026, 1:08:32 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1549
moderation_latency
(null)
generation_time
5512
tokens_prompt
77
tokens_completion
757
native_tokens_prompt
67
native_tokens_completion
1265
native_tokens_completion_images
0
native_tokens_reasoning
999
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0005127
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1549
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981712-WdA1UzuKshrT21Y457wM"
upstream_id
(null)
total_cost
0.0005127
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
183
12.95%
Dialogue to Total Word Ratio
Ratio: 61.96%, Deviation: 11.96%
neededClean
false
wordsTotal
184
wordsDialogue
114
28.1665%