NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKFQQ173TC7DPR9TY4EQN0MN
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
20.2s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.32%
Dialogue to Total Word Ratio
(details)
43.3495%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The floor of the abandoned warehouse was gritty under Bill’s knees as he unrolled a vast, hand-drawn map of the suburban house. “Alright, here’s the nursery. Peak activity from 2 to 4 PM. Cake-cutting at 2:30 is our golden window.” Molly, peering through a pair of comically oversized binoculars from the window, snorted. “Bill, I’m looking at a fortress of joy. There’s a bouncy castle. A clown. The clown is… juggling. This isn’t a heist, it’s a humanitarian mission to remove plastic junk from circulation.” “Focus, Molly. The target is the Present Table. By the fireplace. Estimated haul: thirty to forty gaudy, ribbon-wrapped units.” “What’s our ingress?” “Primary entry is the back door, by the snack table. Mom will be deep in conversation with another mom about organic juice boxes. It’s our best cover.” “I hate juice box talk. It’s so sincere. What about the kids?” “Contained. The bouncy castle is a magnificent prison of inflated vinyl. Their screams of joy will mask our… *acquisitions*.” Molly lowered the binoculars. “You’ve thought about this.” “I’ve been thinking about it since the invitation arrived. It was glittery, Molly. It got in my eyebrow.” “So, the snatch-and-grab. Do we have a bag?” Bill produced a large, cheerful-looking gift bag, bright pink with a cartoon dinosaur on it. “Camouflage.” Molly stared at it. “You’re a monster. It’s perfect. What’s our exfil?” “Back through the kitchen, into the garage, and out the side door. Our getaway vehicle is parked two houses down.” “The minivan with the ‘Baby On Board’ sticker?” “It’s invisible to suspicion. Now, timing is everything. At 2:30, the mom will ring a bell. That’s our cue. Kids swarm the cake table. We move on the presents. We have maybe four minutes before the sugar rush hits and they turn back.” “Countermeasures? The father?” “Currently in a lawn chair, hypnotized by the grill. He’s a non-factor. The clown is our biggest variable. He has a patter, moves between groups. We need to stay out of his line of sight. He might try to make us a balloon animal.” “The horror. So we go in, grab what we can, and vanish. What if a kid sees us?” Bill sighed. “Bribery.” He patted his pocket, which jingled. “I have two dozen mini candy bars. Standard distraction protocol.” “Alright, it’s thin, but it could work. One last thing.” “Name it.” “Why? We’re stealing from six-year-olds. We could just buy this stuff. It’s all plastic dreams and sugar.” Bill met her gaze, his expression deadly serious. “Molly, this isn’t about the presents. It’s about the principle. They took the last parking spot at the grocery store. The mom. With the minivan. She saw me waiting, and she took it. This is about sending a message.” Molly slowly nodded, a grin spreading across her face. “Then let’s go send it. Operation: Tiny Tim’s is a go.” They bumped fists, the pink dinosaur gift bag rustling between them.
created_at
"2026-03-12T00:39:00.08Z"
3/12/2026, 12:39:00 AM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1488
moderation_latency
(null)
generation_time
20090
tokens_prompt
77
tokens_completion
1064
native_tokens_prompt
2111
native_tokens_completion
905
native_tokens_completion_images
(null)
native_tokens_reasoning
243
native_tokens_cached
1984
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773275940-xXHeSjNXbR8vvyTDY73y"
upstream_id
"gen-1773275940-xXHeSjNXbR8vvyTDY73y"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.32%
Dialogue to Total Word Ratio
Ratio: 80.72%, Deviation: 10.72%
neededClean
false
wordsTotal
498
wordsDialogue
402
43.3495%