NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7SR6M96CR4ZXBHGQR62M9
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.6s
Cost:
$0.000119
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Okay, Bill, walk me through it again. Slowly. Like you’re explaining it to a golden retriever.” Molly tapped a pen against a meticulously drawn floorplan of ‘Billy Bob’s Birthday Blast’ – a children’s entertainment centre. It featured bouncy castles, a ball pit, and a frankly alarming amount of glitter. Bill sighed, running a hand through his already dishevelled hair. “We go in disguised as…party entertainers. I’m ‘Captain Calico’, the pirate. You’re ‘Princess Penelope’, dispenser of temporary tattoos.” “Captain Calico?” Molly raised an eyebrow. “Seriously? Couldn’t you have gone with, I don’t know, ‘The Shadow’ or ‘Mysterious Mr. Midnight’?” “Look, the costume rental place was having a deal. Pirate was buy-one-get-one-half-off. Princess was extra. Plus, I look surprisingly good in an eye patch.” “You look like you lost a fight with an especially aggressive seagull.” “Details, Molly, details! The point is, we blend. We entertain. While I’m distracting little Timmy with a rousing sea shanty, you’re making a beeline for the gift table.” “And then what? Just…stuff everything into oversized princess pouches?” “No! We’ve got the modified gift bags. They look like regular birthday bags but have reinforced bottoms and…dimensional pockets. I got them from a magic supply store. The guy didn’t ask questions, which was good.” “A magic supply store? Really?” “Don’t judge my sources, Molly. It’s a surprisingly useful place. Anyway, we fill the bags, then we ‘magically’ make the presents disappear during the cake cutting. The chaos will mask our exit.” “‘Magically’? You’re going with ‘magically’?” “What? It’s child-appropriate! We can’t just yell, ‘Prepare to be robbed, tiny humans!’” “Point taken. But what about security?” “The greatest security of all: parental complacency. They’ll be too busy taking pictures and preventing sugar crashes to notice a subtle shift in gift inventory.” “You’re counting on the sugar rush being a distraction?” “It’s a key component, yes. And the bouncy castle. Maximum distraction. Think of it as a tactical buffer.” “Tactical buffer. You’ve really thought this through, haven't you?” “I have. I've spent a week observing this place. They play the ‘Baby Shark’ song approximately seventeen times an hour. It breaks the spirit.” “Sounds…horrific. But effective cover, I suppose. What's the contingency plan if little Timmy decides he wants a treasure map instead of a sea shanty?” Bill groaned. “Don’t even joke about that. I’m terrible at drawing treasure maps. Contingency plan is…improvise. More shanties. Threaten to make him walk the plank. I’m adaptable!” “Just…try not to get glitter on the eye patch, okay? It ruins the effect.” “I’ll do my best. Now, let’s review the escape route. It involves a strategic deployment of bubbles…”
created_at
"2026-02-20T19:18:04.769Z"
2/20/2026, 7:18:04 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
301
moderation_latency
(null)
generation_time
24350
tokens_prompt
93
tokens_completion
818
native_tokens_prompt
75
native_tokens_completion
705
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001188
router
(null)
provider_responses
0
endpoint_id
"1d5251f0-02fd-484b-bb6c-d346277c8a0d"
is_byok
false
latency
125
model_permaslug
"google/gemma-3-27b-it"
provider_name
"Phala"
status
429
1
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-RcMCsbfM4A3Y6YbHzrxpVdWj"
is_byok
false
latency
123
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615084-EIv7o057lMydLYKOetNB"
upstream_id
"chatcmpl-RcMCsbfM4A3Y6YbHzrxpVdWj"
total_cost
0.0001188
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
439
0.00%
Dialogue to Total Word Ratio
Ratio: 90.89%, Deviation: 20.89%
neededClean
false
wordsTotal
439
wordsDialogue
399
0.0000%