NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFDHBQ699X5DW670SSMC7FG
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.9s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the rearview mirror, not to check the road behind them, but to inspect the crease in her black turtleneck. A perfect, razor-sharp line. Beside her, Bill was a study in contrasts, his ill-fitting balaclava pushed up on his forehead like a woolly yarmulke. “The Pinkerton’s Bakery van,” Molly said, her voice a low hum. “Parked at 14 Primrose Lane. I’ve been watching. They deliver the cake at two, stay for coffee, and don’t move the van until five. That’s our window.” Bill squinted at the scrawled map on her lap. “It’s a six-year-old’s party, Mol. Bouncy castle. Pin the tail on the donkey. You really think the Pinkerton’s guy leaves the keys in?” “People are predictable, Bill. Especially after two slices of their own Black Forest gateau.” She tapped a section of the map labeled ‘LIVING ROOM’. “The presents are all here, in a mountain by the fireplace. We go in the back patio door, which I’ve already… loosened. We fill three canvas sacks, we go out the same way, into the van. Sixty seconds.” “Sixty seconds?” Bill grunted. “What about the kids?” “The children,” Molly corrected, “will be in the garden, supervised, attempting to hit a papier-mâché donkey with a plastic bat. The adults will be clustered near the espresso machine, debating private schools. It’s auditory and visual chaos. We’ll be ghosts.” “Ghosts who steal dolls and toy trucks.” Bill pulled his mask down. “My niece wants one of those rainbow-maned unicorns. There’s probably one in that pile.” “Focus, Bill. This is a high-value target. A collector’s edition limited-run superhero headquarters. A miniature, fully battery-operated kitchen set with actual running water. The haul could be worth four figures.” “To who? We gonna fence a play kitchen to the mob?” Molly sighed, a sound like dry silk. “We liquidate online. To discerning parents with nostalgic credit cards. It’s the new frontier.” She pointed a gloved finger at the clock on the dashboard. “Two-oh-five. The van will be there in five minutes. Are you in, or are you going to spend the afternoon wondering what could have been?” Bill looked from her determined face to the map, where she’d labeled the cake ‘Objective Cream Cheese’. He cracked his knuckles. “Alright. But if there’s a rainbow unicorn, it’s mine. Consider it my hazard pay for dealing with… frosting-based security systems.” Molly almost smiled. “Noted. Sacks ready. Let’s go introduce ourselves to the party.”
created_at
"2026-03-11T21:41:08.468Z"
3/11/2026, 9:41:08 PM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2231
moderation_latency
(null)
generation_time
16814
tokens_prompt
77
tokens_completion
908
native_tokens_prompt
2111
native_tokens_completion
776
native_tokens_completion_images
(null)
native_tokens_reasoning
234
native_tokens_cached
2048
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773265268-aJVMnamvyEbSJuuUCPyA"
upstream_id
"gen-1773265268-aJVMnamvyEbSJuuUCPyA"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
403
0.00%
Dialogue to Total Word Ratio
Ratio: 71.19%, Deviation: 41.19%
neededClean
false
wordsTotal
413
wordsDialogue
294
0.0000%