NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9W4F867VNH4VNFYXJJEH2
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
22.0s
Cost:
$0.000144
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth felt cold under Molly’s elbows. She traced the floral pattern with a sugar packet, avoiding Bill’s increasingly frantic gaze. “Okay, so let me get this straight,” Bill said, his voice a strained whisper despite the lunchtime bustle around them. “We’re… we’re robbing a birthday party.” Molly finally looked up, a small, entirely unapologetic smile playing on her lips. “Technically, Bill, we’re *reallocating* resources. Think of it as a festive redistribution of wealth.” “It's presents, Molly! For kids! Little Timmy's seventh birthday!” “Little Timmy has parents with disposable income, Bill. And frankly, a pirate-themed party is asking for trouble. Too many potential trip hazards for a sophisticated operation like ours.” Bill groaned, running a hand through his thinning hair. “Sophisticated? We’re going in dressed as… clowns.” “Strategic camouflage! Everyone expects clowns at a kid’s party. No one suspects a clown of grand larceny.” Molly tapped the sugar packet against the table. “Plus, I found these amazing oversized shoes. Seriously, they’re fabulous.” “Fabulous doesn't help me explain to the judge why I was apprehended with a bouncy castle in my van.” “Details, details. Look, the intel is solid. Mrs. Henderson, Timmy’s mother, is a notorious over-giver. The guest list is exclusively the children of the town’s one-percenters. We’re talking designer toys, gaming consoles, potentially even a miniature pony.” “A pony? You didn’t mention a pony!” Bill’s eyes widened. “A potential asset. Assess on entry. Now, the plan. I create a distraction – a rogue balloon animal, perhaps a truly magnificent bubble sculpture. You sweep the gift table, efficient and discreet. We meet back at the van, and we’re golden.” “And what if Timmy or one of his little rich friends catches us?” “Improvise. You’re good at improv. Remember the art gallery job?” Bill shuddered. “Let’s not speak of the interpretive dance.” “It bought us ten minutes, Bill! Ten glorious minutes! This is similar – controlled chaos. Think of it as performance art with a financial incentive.” “I think of it as a terrible idea.” “Oh, come on, Bill. When was the last time you felt genuinely *alive*? When did you last experience the thrill of the… gift acquisition?” “I bought myself a new wrench last week and I was pretty thrilled.” Molly sighed. “Look, we need this, Bill. The landlord’s breathing down my neck, and your pigeon racing habit isn’t exactly bringing in the big bucks.” “They’re pedigree pigeons, Molly! And Pip is a contender for the regional championships!” “Pip can wait. Timmy’s presents can’t. Do we have a deal?” Bill stared at his coffee, stirring it with a despondent expression. “Do the clown shoes light up?” Molly grinned. “Naturally.”
created_at
"2026-02-20T19:54:20.016Z"
2/20/2026, 7:54:20 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
584
moderation_latency
(null)
generation_time
21934
tokens_prompt
93
tokens_completion
795
native_tokens_prompt
74
native_tokens_completion
674
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000143606
router
(null)
provider_responses
0
endpoint_id
"cf530f36-7cb8-4eb9-8953-880eef952af3"
id
"9e574433-4d6d-4778-9180-4856972679dc"
is_byok
false
latency
513
model_permaslug
"google/gemma-3-27b-it"
provider_name
"Novita"
status
200
api_type
"completions"
id
"gen-1771617260-MtkMABatXWSzCHiZz8BS"
upstream_id
"9e574433-4d6d-4778-9180-4856972679dc"
total_cost
0.000143606
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
446
0.00%
Dialogue to Total Word Ratio
Ratio: 80.04%, Deviation: 30.04%
neededClean
false
wordsTotal
446
wordsDialogue
357
0.0000%