NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAXPAG0RT6PRDHCZ1CW99S
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
22.0s
Cost:
$0.000316
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth stuck to Molly’s elbows. She’d chosen this place specifically for its sticky surfaces and the lingering aroma of regret and burnt coffee. It felt…appropriate for the operation. Across from her, Bill was meticulously dismantling a sugar packet, arranging the tiny granules into a disturbing miniature fort. “You’re building a rampart against existential dread again, aren’t you?” Molly asked, stirring her lukewarm coffee. Bill didn’t look up. “It’s strategic. Think about it. Parallel lines of defense. A core reinforcement zone. This represents the laser grid.” “There is no laser grid, Bill.” “There could be. It’s a high-end party, Molly. Little Timmy Hargrove is the grandson of Bartholomew Hargrove, the shipping magnate. They probably have retinal scanners protecting the pin the tail on the donkey.” He finally looked up, his eyes glinting with manic energy. “The point is, we need to anticipate everything.” Molly sighed. “The point is, we’re stealing presents from a bunch of eight-year-olds.” “High-value presents. Think Lego Millennium Falcons, remote control robots, possibly even a pony. A miniature, ethically sourced pony, naturally. Hargrove wouldn't be caught dead with a non-organic pony.” “A pony?” Molly actually choked on her coffee. “Just a possibility. We need to be prepared for all eventualities. Okay, let's review. Operation: Unwrap Rage. Phase one: infiltration. You’re on distraction duty.” “And what, pray tell, is my distraction supposed to be? Juggling flaming marshmallows?” Molly was already imagining the headline: ‘Granny Arrested for Arson at Birthday Bash’. “Better. You’re a face painter.” Molly stared at him. “A face painter? Seriously?” “Think about it! Access to the children, covering our tracks amongst the chaos, and a plausible reason to be hovering around the gift table. You’re surprisingly good at butterflies, remember that street fair?” “That was one time, Bill! And those butterflies looked more like moths suffering an existential crisis.” “Details, details. You’ll be ‘Sparkles the Magnificent’. I’ll be ‘Barry, the Helpful Uncle’. I’ve grown a truly magnificent beard for the occasion.” “You always have a magnificent beard, Bill.” “This one is specifically magnificent for helpful uncle-ing.” He paused, tapping the sugar fort. “Timing is crucial. The magic show is at two-thirty. That’s our window. Everyone's focused on doves and disappearing scarves. We go in, grab the goods, and vanish before little Timmy realises his inheritance has been…reallocated.” “And then what? We fence a pile of kids' toys?” “There's a collector, a very discerning collector, who specializes in high-end children’s merchandise. Apparently, original packaging is key. He operates out of a laundromat in Little Italy.” Molly rubbed her temples. “A collector. Of course. This is officially the most ridiculous heist I’ve ever been involved in.” Bill beamed. “Isn’t it glorious?”
created_at
"2026-02-20T20:12:39.643Z"
2/20/2026, 8:12:39 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
227
moderation_latency
(null)
generation_time
21937
tokens_prompt
93
tokens_completion
832
native_tokens_prompt
75
native_tokens_completion
688
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
16
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0003156
router
(null)
provider_responses
0
endpoint_id
"2f608ade-87b1-46ed-8ae7-2714dddf2abb"
id
"cmpl-ae6698f3834f4f18aa86c30bfca5b6ff"
is_byok
false
latency
227
model_permaslug
"google/gemma-3-27b-it"
provider_name
"Parasail"
status
200
api_type
"completions"
id
"gen-1771618359-7HH7B5Z8RxrvVNHBPDIc"
upstream_id
"cmpl-ae6698f3834f4f18aa86c30bfca5b6ff"
total_cost
0.0003156
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
456
0.00%
Dialogue to Total Word Ratio
Ratio: 77.41%, Deviation: 47.41%
neededClean
false
wordsTotal
456
wordsDialogue
353
0.0000%