NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAXPH4SS9KK3M942D1FQGJ
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
11.7s
Cost:
$0.000202
Provider:
Nebius
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica table wobbled with every frustrated thump of Bill’s fist. Half-eaten toast crumbs clung to his beard like lost hopes. “A children’s party, Molly? Seriously? This is what we’ve come to?” Molly, meticulously polishing a pair of comically oversized sunglasses, didn’t look up. “Don’t knock it ‘til you’ve cased it, Bill. Old Man Hemlock’s grandson, Timothy, is turning six. And Timothy, according to my sources – Brenda, the dog walker – is *spoiled*. We’re talking a miniature train set, a remote controlled dinosaur, possibly even a pony.” “A pony?” Bill’s jaw dropped. “You're planning a heist for a *pony*?” “Not specifically the pony, although if it fits in the van…” Molly finally looked up, tapping the sunglasses against her chin. “The principle, Bill. The sheer volume of untraceable, easily fenced goods. Think of it. Designer board games. Rare action figures. That little electric guitar little Timmy will promptly ignore. It's a gifting goldmine.” Bill scrubbed a hand through his thinning hair. “So, we’re robbing children. That feels… low, even for us.” “We’re not robbing *children*, Bill. We’re retrieving misplaced generosity. Old Man Hemlock made his fortune exploiting artisanal cheese markets. This is karma. Besides, those kids have too much already.” “That’s… a terrible justification, even for you.” Bill sighed, reaching for the lukewarm coffee. “What’s the plan? We can’t just waltz in there dressed as clowns.” “Actually…” Molly’s eyes gleamed. “I was thinking face painting. I’ve been practicing butterflies.” Bill sputtered, nearly choking on his coffee. “Face painting? You? The woman who once tried to disguise herself as a potted fern?” “Hey! It almost worked. And this is different. It provides access. We blend. We observe. We liberate.” “Liberate? You make it sound like we’re rescuing the gifts from a tyrannical regime.” “Semantics, Bill. Now, the party starts at two. Brenda says there’s a bouncy castle. A bouncy castle presents both opportunity and challenge.” “Challenge? How does a bouncy castle challenge a professional, albeit morally questionable, theft operation?” “Visibility. And potential for collateral damage. We don’t want to decapitate little Timmy with an inflatable dinosaur.” “Decapitate? Good lord. Okay, butterflies and bouncy castle avoidance. What about security?” “Minimal. One bored teenage nephew mostly preoccupied with his phone. Brenda says he’s easily distracted by talk of limited-edition Pokémon cards.” “Pokémon cards. Of course. This is the level we’ve sunk to. Bribing a teenager with Pokémon cards to steal presents from a six-year-old.” Bill stared into his coffee, a defeated slump to his shoulders. “I need a stronger coffee.”
created_at
"2026-02-20T20:12:39.857Z"
2/20/2026, 8:12:39 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
200
moderation_latency
(null)
generation_time
11593
tokens_prompt
93
tokens_completion
769
native_tokens_prompt
75
native_tokens_completion
648
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002019
router
(null)
provider_responses
0
endpoint_id
"db3af039-8c6e-4b32-bbf5-c858d644d1b5"
id
"chatcmpl-1cf8ca5d0c3040478a22f761eeb2c119"
is_byok
false
latency
200
model_permaslug
"google/gemma-3-27b-it"
provider_name
"Nebius"
status
200
api_type
"completions"
id
"gen-1771618359-V5A0oPJAEyMpbYDT1TsM"
upstream_id
"chatcmpl-1cf8ca5d0c3040478a22f761eeb2c119"
total_cost
0.0002019
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nebius"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
421
0.00%
Dialogue to Total Word Ratio
Ratio: 80.05%, Deviation: 30.05%
neededClean
false
wordsTotal
421
wordsDialogue
337
0.0000%