NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFGS4HJN9S1RRGBK5DFWNDA
Run Details
Model:
Stealth: Healer Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
9.5s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.79%
Dialogue to Total Word Ratio
(details)
45.6379%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of the diner smelled like old coffee and desperation. Molly slid a crumpled napkin across the table, its surface covered in a childlike drawing of a house with an exploding roof. “Alright, listen up,” she said, tapping the napkin. “The target is 42 Maple Street. Saturday, 3 PM. The ‘Birthday Bash for Braden’.” Bill squinted at the drawing. “That’s a lot of balloons for a heist, Mol.” “Amateur hour balloons. Helium. Pop one, you look like a fool. Pop fifty, you’ve got a distraction.” She leaned in, her voice dropping to a conspiratorial whisper. “We go in through the garage. The invitation says ‘Bounce House in Backyard.’ That’s our perimeter.” “A bounce house?” Bill snorted. “Our getaway vehicle’s a Dodge Neon. We’re gonna outrun kids in a bounce house?” “We’re not outrunning kids, Bill. We’re *becoming* the kids.” Molly’s eyes gleamed. “I’ve acquired two inflatable dinosaur costumes.” “You what?” “The T-Rex. Classic. We waddle in through the garage door, which I’ve scoped—the lock is a child’s toy. We blend. We’re large, green, and silent. The parents will think we’re part of the entertainment.” Bill considered this. “So the plan is to… inflate, waddle, and grab presents?” “Precisely. The presents are in the sunroom. I’ve got a map. We enter, we grab the sack labeled ‘From Grandma’—that’s where the high-ticket items are—and we exit through the sliding glass door to the driveway.” “There’ll be parents everywhere. What about Kevin, the overzealous father? He’s got a camera and a personality disorder.” “We avoid Kevin. We become part of the wallpaper. We’ll make a low, rumbling dinosaur sound if anyone gets too close. It’ll be disarming. Quaint, even.” “Quaint,” Bill repeated flatly. “Dinosaurs raiding a birthday party. Quaint.” “You’re overthinking the reptilian angle. Think of the aesthetic. We’re dinosaurs. It’s inherently funny. Who’s going to call the cops on two dinosaurs gently pilfering a few Lego sets and a stuffed bear? They’ll think it’s a bit.” Bill rubbed his temples. “What’s our exit strategy, Mol? Waddle back to the Neon, still in costume?” “Exactly. We deflate in the car. The costumes go in the trunk. We’re just two concerned adults leaving a party early, vaguely lumpy and deflated. No one looks twice.” “I look like a deflated T-Rex, I think people might look.” “Then don’t deflate all the way. Leave yourself slightly firm. Approachable.” Bill sighed, picking up his coffee. “This is the worst plan we’ve ever had.” Molly grinned, snatching back the napkin. “The best ones usually are. Now, about the cake… I hear it’s a chocolate volcano. A tactical dessert diversion could be useful.” “Mol, we’re stealing from children.” “We’re stealing from *presents*. The children are merely in the way. It’s not personal.” Bill shook his head, but a smile tugged at his lips. “Fine. But if my tail gets caught in the car door, I’m blaming you.” “Deal,” said Molly, already mentally rehearsing the rumble.
created_at
"2026-03-11T22:37:48.991Z"
3/11/2026, 10:37:48 PM
model
"openrouter/healer-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
929
moderation_latency
(null)
generation_time
9432
tokens_prompt
77
tokens_completion
1312
native_tokens_prompt
2147
native_tokens_completion
1094
native_tokens_completion_images
(null)
native_tokens_reasoning
455
native_tokens_cached
2078
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773268668-SkkG9hvThJO7PVAnOYYn"
upstream_id
"gen-1773268668-SkkG9hvThJO7PVAnOYYn"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.79%
Dialogue to Total Word Ratio
Ratio: 79.84%, Deviation: 9.84%
neededClean
false
wordsTotal
491
wordsDialogue
392
45.6379%