NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAFM1JQ2JEJW09B4QXAXPR
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
23.6s
Cost:
$0.000121
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth felt cold under Molly’s elbows. She’d chosen this place – ‘Bernie’s Belly Busters’ – specifically for its anonymity and the smell of regret. Bill, across from her, was meticulously dismantling a sugar packet, constructing a miniature leaning tower. “Right, so let’s go over it again,” Molly said, stirring her lukewarm coffee. “Eight-year-old Timmy. Superhero themed party. Parents gone for ‘a quick run to the store for more ice cream’ – classic.” “Ice cream. The weakness of all parental security systems,” Bill murmured, adding another sugar crystal to his tower. It wobbled. “Structural integrity is key, Molly. Just like this operation.” “Just like hoarding all the ‘Spider-Man’ and ‘Batman’ paraphernalia is key, you mean.” Bill huffed. “It’s not hoarding. It’s…curating. A collection. And a lucrative one, I might add. Limited edition action figures are climbing in value, you know.” “Sure. And Timmy’s ‘My Little Pony’ collection is going to fund our retirement. Focus, Bill.” “Okay, okay. Timmy’s party. The plan? Infiltration disguised as…what were we thinking? Clowns?” Molly shuddered. “Absolutely not. I did a stint as a birthday party princess and I am still in therapy. No face paint. Too messy. Too many sticky fingers.” “Fine, fine. What about…balloon artists?” “Even worse. Imagine trying to discreetly stuff a ‘Hulk’ action figure into your balloon animal bag. It screams ‘criminal intent’.” “Okay, a valid point. We need subtle. How about…highly enthusiastic uncles?” Molly raised an eyebrow. "Enthusiastic uncles?” “Yeah! You know, loudly complimenting the cake, marveling at the decorations, generally being overbearing and distracting. People *expect* uncles to be embarrassing.” “Hmm,” Molly tapped her spoon against the mug. “It could work. You could be Uncle Barry, prone to terrible magic tricks. I’ll be Aunt Carol, forever asking everyone about their future aspirations.” “Aunt Carol? Seriously?” Bill protested. “That’s…depressing.” “It’s distracting! No one will suspect Aunt Carol, earnestly inquiring about a seven-year-old’s career path. It's genius.” “Right. And the presents? Timmy’s parents reportedly splurged. High-end stuff.” “That’s where your tower-building skills come into play. We’ve got the laundry bags concealed in the oversized fruit basket. Quiet, efficient, maximum capacity.” “I’m more of a structural engineer than a bag-stuff engineer,” Bill grumbled. “And what about security? Any dogs? Nosy neighbors?” “Mrs. Henderson next door is a busybody, but she’s allegedly hard of hearing. And no dogs, thankfully. Just a lot of sugar-fueled chaos. Perfect cover.” Molly paused, a glint in her eye. "Just imagine, Bill. A mountain of unopened joy, just waiting for us." Bill, finally abandoning his sugar tower, grinned. “A mountain of profit, you mean. Let's go make Timmy’s birthday a little… lighter.”
created_at
"2026-02-20T20:04:58.557Z"
2/20/2026, 8:04:58 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1058
moderation_latency
(null)
generation_time
23531
tokens_prompt
93
tokens_completion
830
native_tokens_prompt
75
native_tokens_completion
716
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00012056
router
(null)
provider_responses
0
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-RqSDSb9i8kxg0T8sBe8J7uuT"
is_byok
false
latency
113
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771617898-8mCaJdrJZ1CVm2MqrEj1"
upstream_id
"chatcmpl-RqSDSb9i8kxg0T8sBe8J7uuT"
total_cost
0.00012056
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
441
0.00%
Dialogue to Total Word Ratio
Ratio: 81.63%, Deviation: 31.63%
neededClean
false
wordsTotal
441
wordsDialogue
360
0.0000%