NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAV48E5C2RM6D40YX35B1F
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
22.6s
Cost:
$0.000311
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth stuck to Bill’s elbows. He was meticulously dissecting a sugar packet, creating a miniature, geometrically-questionable pyramid. Molly watched him, stirring her lukewarm coffee with the kind of intensity usually reserved for bomb disposal. “You’re going to ruin the structural integrity of Big Sugar, Bill. Focus.” Bill didn’t look up. “It’s a metaphor, Molly. A metaphor for the house. Precarious foundations. Easily toppled.” “It’s a sugar packet.” “Everything is a metaphor if you squint hard enough.” He finally glanced up, a smear of sugar dusting his upper lip. “So, Operation ‘Gift Grab’ is a go, right? Because I’m having second thoughts about the bouncy castle.” “Second thoughts? We discussed the bouncy castle! It’s integral to the distraction!” Molly practically hissed. “Think of it, Bill. A swarm of sugar-fueled six-year-olds bouncing off the walls while we delicately liberate the spoils.” “Delicately liberate? It’s stealing presents from children, Molly. It feels…low.” “Low? We once stole a prize-winning gnome from a garden! This is practically philanthropy. Think of the resale value on a vintage My Little Pony collection.” Bill sighed, abandoning his sugar pyramid. “Okay, okay, ponies. Look, I’m just saying, little Timmy's face when he realises his robot dinosaur is missing? That’s brutal.” “Timmy will get another robot dinosaur. Probably a better one. His parents are clearly flush with cash if they're throwing a party with a llama.” “A llama?” Bill’s eyes widened. “A fully-fledged, spitting llama. Apparently, it’s a trend. The parents got it specifically for the ‘Instagrammable moments’." Molly rolled her eyes. “Which, frankly, is just peak human ridiculousness. Adds to the chaos, though. Perfect for us.” “Right. Chaos. Good. Llama. Bad. Presents. Good.” He rubbed his chin theatrically. “So, you grab the board games, I go for the tech. Avoid the glitter. Definitely avoid the glitter. It clings like regret.” “Agreed. Glitter is the bane of my existence. And focus on the pre-teen stuff. The Xbox controllers, the headphones…leave the princess dresses to the professionals.” “You think there’ll be a security system?” “For a children's party? Bill, please. The greatest security threat is probably a rogue piñata.” Molly took a long swig of her coffee. “The real challenge is avoiding being identified. No bright colours, blend in. I’m thinking ‘disgruntled uncle’. You?” Bill considered this. “I can do ‘overly enthusiastic magician’s assistant’. I’ve still got the sparkly waistcoat.” Molly choked on her coffee. “Absolutely not. You look like a rejected Elvis impersonator in that thing. Think…dad. Middle-aged, slightly awkward, desperately trying to connect with his niece. Beige is your friend.” Bill groaned. “Beige. My life is officially beige.”
created_at
"2026-02-20T20:11:15.609Z"
2/20/2026, 8:11:15 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
355
moderation_latency
(null)
generation_time
22517
tokens_prompt
93
tokens_completion
801
native_tokens_prompt
75
native_tokens_completion
678
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0003111
router
(null)
provider_responses
0
endpoint_id
"2f608ade-87b1-46ed-8ae7-2714dddf2abb"
id
"cmpl-23b9bb5e24854de7b7c237c0c6d18765"
is_byok
false
latency
279
model_permaslug
"google/gemma-3-27b-it"
provider_name
"Parasail"
status
200
api_type
"completions"
id
"gen-1771618275-pUF8ATpukcJEgTWc0mbf"
upstream_id
"cmpl-23b9bb5e24854de7b7c237c0c6d18765"
total_cost
0.0003111
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
442
0.00%
Dialogue to Total Word Ratio
Ratio: 78.28%, Deviation: 48.28%
neededClean
false
wordsTotal
442
wordsDialogue
346
0.0000%