NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94T8DQTRT924CHSZ6E4NAF
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
22.0s
Cost:
$0.002178
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
77.92%
Dialogue to Total Word Ratio
(details)
38.9624%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The booth was a shrine to the deep-fried, smelling faintly of forgotten dreams and chili oil. Molly, all sharp angles and contained energy, spread a crumpled, tear-stained napkin across the sticky table. It wasn't evidence of emotional distress, but rather the detailed blueprint of her latest caper, the edges softened by countless hours of desperate planning. The dim, flickering fluorescent lights of the diner cast long shadows, highlighting the clutter on their table: half-eaten toast, an empty jam packet, and Molly’s ever-present lock-picking set, polished to a dull sheen. Her fingers traced a crudely drawn rectangle labelled 'Party Zone'. Across from her, Bill, a man built like an overstuffed armchair, was attempting to juggle three sugar packets, dropping them with embarrassing frequency. He paused, watching one packet skitter under the table, his brow creased in thought. The air hung heavy with the mingled scents of frying bacon and unspoken anxieties. This wasn't their usual turf, far from the gleaming banks and hushed auction houses. This was… fluff. But fluff with potential. Big, brightly wrapped potential. Molly tapped the napkin. "Alright, Bill. The target's secured." Bill blinked, his focus shifting from the floor to her. "Secured? Like, triple-locked vault secured?" Molly sighed, a familiar sound of weary exasperation. "No, Bill. The *event*. A child’s birthday party. We're going in for the presents.” He brightened considerably, his scattered sugar packets forgotten. "Presents! You mean like, toys? And maybe a decoder ring?" "Precisely. And the method of entry is key. We'll be posing as entertainers. 'Happy Hamster Havoc'." She pointed a sharp fingernail at a section of the napkin labelled 'Stage Area'. "You're the hamster handler." Bill leaned forward, intrigued. "Do I get a tail?" "You get a full-body hamster suit, Bill. Your job: create extreme, overwhelming cuteness, plus a thirty-minute puppet show. While the tykes are captivated by Mr. Nibbles' existential crisis, I'll be extracting anything that looks expensive enough to pawn. The plan is simple: distraction and extraction." He nodded slowly, picturing himself as the giant rodent. "So, I just gotta be a fuzzy distraction while you do the sneaky bits. Got it." Molly tapped the napkin again, her gaze distant. The sheer volume of potential loot, all gathered conveniently in one place, was staggering. It was the kind of haul that could fund them for months.
created_at
"2026-02-25T00:57:34.402Z"
2/25/2026, 12:57:34 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1214
moderation_latency
(null)
generation_time
21960
tokens_prompt
77
tokens_completion
3274
native_tokens_prompt
67
native_tokens_completion
5429
native_tokens_completion_images
0
native_tokens_reasoning
4894
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0021783
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1214
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981054-DoV8Mtf0INm9rW1we8BT"
upstream_id
(null)
total_cost
0.0021783
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
386
77.92%
Dialogue to Total Word Ratio
Ratio: 32.23%, Deviation: 2.23%
neededClean
false
wordsTotal
394
wordsDialogue
127
38.9624%