NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFCXH6D4AVGP476H090T6GK
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.5s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
10.0948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The fluorescent light above the kitchen table buzzed like a trapped fly. Molly spread a hand-drawn map across the checkered oilcloth, her red fingernail tapping a crudely drawn bouncy castle. “Operation Sugar Rush kicks off at three p.m. tomorrow. The target: the Parker household. Seventeen children, aged four to seven. Maximum chaos.” Bill leaned back, balancing his chair on two legs. “A kiddie party, Molly? What’s the take? Goody bags with erasers shaped like animals?” “Erasers?” Molly scoffed, pulling a crumpled party supply list from her pocket. “Jenna Parker’s son is turning six. Estimated haul: two dozen wrapped presents, a six-tier rainbow cake, and—this is the key—a piñata stuffed with premium organic candies. We’re talking Belgian chocolate coins.” “You want to rob a piñata.” “I want to liberate its contents. There will be musical chairs, Bill. A moment of supreme distraction. While they’re scrambling for seats, we move in. You’re the lookout.” Bill let his chair drop forward with a thud. “What’s my cover? I can’t just stand there in my usual attire. It’s too menacing.” “Precisely.” Molly slid a paper bag across the table. Bill peered inside and grimaced. “A clown nose? You expect me to wear a clown nose?” “You’ll be ‘Bumble the Bee.’” She produced a headband with drooping antennae. “It’s non-threatening. Parents will just see another hired entertainer. I’ll be ‘The Magician,’ naturally. All black, with this cape.” She shook out a shimmering purple fabric. “What’s my backstory?” Bill asked, putting on the headband. The antennae wobbled pathetically. “You’re new in town. Doing kids’ parties for extra cash. You’re terrified of loud noises, so you’ll flinch whenever they pop a balloon. It’s endearing.” “Alright. So we get in. Then what?” “Once the chaos peaks—the piñata smash is our crescendo—you create a diversion at the snack table. I’ll start ‘clearing the presents for safekeeping.’ Standard magician’s misdirection. I’ll put them in this large, hollowed-out magic hat.” Bill stared at the top hat on the counter. “It’s glittery.” “Exactly. It’s part of the act. We load the cake into a cooler marked ‘Magic Supplies,’ and we’re out before they start the thank-you notes.” Bill stood and practiced a clumsy, buzzing dance in the middle of the kitchen. “This is humiliating. You know that, right?” Molly smiled, tying her cape. “Bill, we’re not just thieves. We’re memory-makers. They’ll talk about the day Bumble the Bee and the Great Mysterioso made all their presents vanish forever. It’s performance art.” “It’s a felony.” “With better costumes.” She tossed him the clown nose. “Now practice looking scared of balloons. And for heaven’s sake, don’t eat any of the cake until we’re in the getaway car. I saw you at the Henderson job with that strudel.” Bill caught the nose, his antennae drooping. “This is the last time I let you plan a heist based on a Pinterest board.”
created_at
"2026-03-11T21:30:18.713Z"
3/11/2026, 9:30:18 PM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1598
moderation_latency
(null)
generation_time
18384
tokens_prompt
77
tokens_completion
1049
native_tokens_prompt
2111
native_tokens_completion
868
native_tokens_completion_images
(null)
native_tokens_reasoning
216
native_tokens_cached
1984
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264618-ZBjuiZv98K43iN1gCnKl"
upstream_id
"gen-1773264618-ZBjuiZv98K43iN1gCnKl"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.00%
Dialogue to Total Word Ratio
Ratio: 71.60%, Deviation: 21.60%
neededClean
false
wordsTotal
486
wordsDialogue
348
10.0948%