NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFFDK29HKABHM5QEQPFFY86
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
21.6s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint wasn’t so much a blueprint as it was a crayon drawing on a placemat from ‘Pizza Pete’s Fun Time Palace’. Molly had traced it from memory during a ‘reconnaissance’ visit under the guise of an aunt buying a clown. Bill, who had the spatial awareness of a startled badger, squinted at the rainbow-colored legend. “So, the ‘Laser Grid of Eternal Doom’ is the streamers,” he said, tapping a wobbly red line. “And the ‘Pit of No Return’ is the ball pit.” “Correct. And the vault,” Molly said, pointing to a crudely drawn rectangle, “is the present pile in the living room. Our window of opportunity is between the musical statues and the piñata. Chaos will be maximal.” Bill nodded, then frowned. “But what about the ‘Guardian’?” He pointed to a large, smiling stick figure with a triangle for a body. “This ‘Captain Chuckles’.” “Clown. Standard issue. Distract with a honk, then deploy the ‘Squeak-Some’.” She held up a small, battery-operated dog toy. It let out a pathetic wheeze when squeezed. “His conditioning will compel him to locate the source. It’s Pavlovian. Or… Bozo-vian.” “Brilliant.” Bill began packing the oversized duffel bag with the tools of their trade: a pair of scissors, a roll of festive wrapping paper, and a three-pronged claw grabber from the dollar store. “Right. So I’m on crowd control. I’ll create a diversion by the fairy bread.” “You’ll be on *punch bowl sentinel*,” Molly corrected, her voice firm. “The last time you were on ‘crowd control,’ you got into a crying contest with a six-year-old over a balloon poodle. You lost.” “That was a tactical empathy play,” Bill muttered, stowing the squeak toy with reverence. “And his technique was superior. He had the lip-wobble down.” Molly ignored him, unfolding a second placemat. This one had a series of stick-figure portraits. “Targets. We’re hitting the premium loot first. This one,” she circled a figure with a lightning bolt headband, “is ‘The Gamer’. He’ll have the latest handheld. This one,” another circle, “is ‘The Baker’. Her parents always go for the giant, age-inappropriate Lego set. Skip anything in pastel wrapping paper—that’s socks or a savings bond. Non-negotiable.” “And the cake?” Bill asked, a hopeful note in his voice. “The cake is a decoy. A glorious, buttercream decoy. We take the presents and leave the cake. It’s our calling card. They’ll be so busy arguing over the last slice of ‘Operation Vanilla Thunder,’ they won’t even notice the haul is gone until we’re three suburbs away.” Bill grinned. “I like it. It’s poetic.” He hefted the duffel bag onto his shoulder. It made a sad, crinkling sound. “Ready to ruin some childhoods?” “We’re not ruining anything,” Molly said, slipping on a pair of novelty glasses with a fake nose and mustache. “We’re introducing them to the harsh, beautiful realities of the free market. Now, repeat the plan.” “Distract the clown, guard the punch, grab the gamer’s gear, ignore the cake, exit via the ‘Fire Exit of Freedom’ which is just the sliding door to the patio.” “And if we get caught?” Bill sighed. “Claim we’re from the Ministry of Fun, here to redistribute joy. Then run really fast.” “Precisely.” Molly adjusted her nose. It was slightly crooked. “Let’s go steal some joy.”
created_at
"2026-03-11T22:14:02.069Z"
3/11/2026, 10:14:02 PM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2005
moderation_latency
(null)
generation_time
21528
tokens_prompt
77
tokens_completion
1161
native_tokens_prompt
2111
native_tokens_completion
990
native_tokens_completion_images
(null)
native_tokens_reasoning
247
native_tokens_cached
1984
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773267242-RZt5BNbCeYZrxDLSsVGU"
upstream_id
"gen-1773267242-RZt5BNbCeYZrxDLSsVGU"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
545
0.00%
Dialogue to Total Word Ratio
Ratio: 61.62%, Deviation: 31.62%
neededClean
false
wordsTotal
555
wordsDialogue
342
0.0000%