NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8HQTMY84FFPQ6D51P9FPS
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.1s
Cost:
$0.000103
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Right, so let’s go over it again,” Molly said, meticulously sketching a diagram of a bouncy castle on a napkin. “Operation Present Plunder. Phase one: infiltration.” Bill, picking at a chipped Formica tabletop, grunted. “Infiltration. Sounds fancy for ‘walkin’ in’.” “It’s about establishing a *tone*, Bill. A professional tone. We’re not just ‘walkin’ in’, we’re assessing the security perimeter, identifying potential obstacles, and blending seamlessly with the…enthusiastic clientele.” “You mean the screaming kids hopped up on sugar?” “Precisely. Now, the intel suggests glitter. A lot of glitter.” “Glitter? That's the security?” “No, it’s a hazard! It’ll ruin the dark clothing. We’ll need to factor in removal time.” “Removal time? Molly, we're stealing presents, not entering a beauty pageant.” “Details, Bill, details! A professional doesn’t leave a trail. Now, the layout. Bouncy castle near the refreshments, balloon arch by the entrance. Present mountain… strategically placed under the gazebo.” “Strategic? It’s a pile of boxes, Molly.” “A strategically placed pile of boxes! Overflowing with potential riches! Okay, you distract the party host, Brenda. Feign interest in little Timmy's magic trick. Compliment the cake. The usual.” “Compliment the cake? I don't *do* compliments. Especially not about sponge.” “Just say it’s…structurally sound. And then, while Brenda is distracted by your stunning architectural critique, I go for the presents.” “And what if Timmy's magic trick is actually good?" Molly sighed dramatically. “Then you express polite bewilderment. The point is, you occupy Brenda. I acquire the bounty.” “Acquire? You mean stuff in bags?” “Yes! Acquire, obtain, procure… it's all semantics, Bill! Just keep her talking. And for heaven's sake, don't mention your taxidermy collection.” “What? Why not? Brenda likes unusual hobbies!” “Because last time you talked about your squirrel collection at a garden party, you got banned from the neighborhood bake sale! Relevancy, Bill, relevancy!” “Okay, okay. No squirrels. Compliment the structurally sound cake. Feign bewilderment at magic. And avoid glitter. Got it.” Molly tapped the napkin with her pen. “And remember the exit strategy. Through the back garden, over the fence, rendezvous at the van. No unnecessary acrobatics.” “Acrobatics are my specialty!” “Absolutely not! We’re after presents, not broken bones. This has to be clean, Bill. Clean and glitter-free."
created_at
"2026-02-20T19:31:10.817Z"
2/20/2026, 7:31:10 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
616
moderation_latency
(null)
generation_time
23863
tokens_prompt
93
tokens_completion
705
native_tokens_prompt
75
native_tokens_completion
607
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00010312
router
(null)
provider_responses
0
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-RR00RzeDvQfKL3K2ntENi3oX"
is_byok
false
latency
56
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615870-wcUXnPgVU4qN5LZHvsUm"
upstream_id
"chatcmpl-RR00RzeDvQfKL3K2ntENi3oX"
total_cost
0.00010312
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
365
0.00%
Dialogue to Total Word Ratio
Ratio: 91.51%, Deviation: 21.51%
neededClean
false
wordsTotal
365
wordsDialogue
334
0.0000%