NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHYRKWKY8CV39NX05ZJ67ZC
Run Details
Model:
Stealth: Healer Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
43.6s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale coffee and regret. Molly pinched the bridge of her nose, staring at the blueprint spread across the dashboard. It was a child’s drawing of a house, complete with a lopsided sun and stick figures labeled “FAMLY.” “This is intelligence, Bill. Not a crayon sketch.” Bill, wedged against the passenger door with a half-eaten hot dog, shrugged. “My sources are reliable. The target is 4714 Maple Drive. One-story ranch. Unreinforced masonry. The wind could knock it over.” “It’s a birthday party for a six-year-old named Jayden,” Molly said, her voice dangerously quiet. “The ‘present heist’ is your stated objective. Not architectural demolition.” “Details.” Bill waved a dismissive hand, mustard flying. “We go in at 1400 hours. Peak present accumulation. I provide the distraction. You do the acquisition. Simple.” Molly’s eye twitched. “Your proposed distraction is a ‘ruckus.’ You wrote that in the plan. ‘Create ruckus.’ That is not a tactic, Bill. That is a symptom.” “It’s a proven method! Chaos is the thief’s best friend.” “Chaos is a ladder,” Molly muttered, rubbing her temples. “Not a plan. Look. The presents are in the living room. The adults will be in the kitchen and patio. The children will be… everywhere. A roaming, unpredictable hive.” “Hence the ruckus! I’ll be the clown.” Molly’s head snapped up. “The what?” “The clown. I’ve got the suit in the back. Red nose, big shoes, the works. I’ll make balloon animals. They’ll be so mesmerized by my artistic genius they won’t notice you stuffing a PlayStation into a burlap sack.” “That is the single worst idea you’ve ever had. And you once suggested we rob a aquarium using only snorkels.” “It had potential! Okay, fine. New plan. The bouncy castle. It’s right by the side gate. I’ll get in there. Start a… a bounce riot. Total pneumatic chaos. The kids will be so engrossed in the joy of flight they won’t see you slip in through the patio door.” Molly considered this. The bouncy castle was a tactical nightmare, but it was a contained vector. “You would be… bouncing. As a distraction.” “Correct. It’s physical. It’s immediate. It’s fun!” “It’s idiotic. But it might work. You will wear nondescript athletic wear. No clown suit. And for God’s sake, leave the hot dog.” Bill grinned, a flash of unrepentant glee. “Knew you’d see the vision. So, we’re a go? Operation: Bounce & Grab?” Molly sighed, the sound of a thousand heist manuals weeping. “We are a go. But if you pop that castle, I’m leaving you in it. And I’m taking the good presents.” “The good presents are for the marks, Molly.” “Not anymore they’re not.” The plan, such as it was, was set. Bill nodded, already mentally rehearsing his bounce trajectories. Molly stared out at the innocent suburban street, a portrait of grim determination. Somewhere, a six-year-old was blissfully unaware that his LEGO set was about to become the prize in the world’s least dignified felony. At precisely 2:07 PM, Bill, clad in black, did indeed provide a ruckus. It involved a spectacularly ill-advised cartwheel directly into the side of the inflatable fortress, causing it to buckle and sag dramatically. Children shrieked—not in fear, but in delight, assuming it was part of the show. While all eyes were on the flailing, limbs-in-the-air spectacle, Molly, dressed as a teenager retrieving a forgotten phone, slipped through the patio door. The living room was a mountain of brightly wrapped boxes. She worked quickly, stuffing smaller items into a duffel bag. The sound of Bill’s continued, grunting efforts from outside was a strange, rhythmic accompaniment. She was almost clear when a small voice piped up. “Are you Jayden’s mommy?” A child with frosting on her cheek stood in the hallway, holding a sticky juice box. Molly froze. “No. I’m… the present inspector.” “Oh.” The girl nodded, accepting this perfectly. “Is the clown your friend? He’s not very good at bouncing.” “He’s practicing,” Molly said, hefting the now-bulging bag. “Tell Jayden his presents passed inspection.” “Okay.” The girl took a sip. “The balloon dog he made bit me.” “A known hazard. Excuse me.” Molly made it to the side gate just as Bill, panting and victorious, staggered out of the deflating castle. They piled into the van and peeled away, the sounds of the party fading behind them. In the rearview mirror, the lopsided sun on the crayon blueprint seemed to be laughing at them.
created_at
"2026-03-12T21:20:41.26Z"
3/12/2026, 9:20:41 PM
model
"openrouter/healer-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
22138
moderation_latency
(null)
generation_time
43015
tokens_prompt
77
tokens_completion
1471
native_tokens_prompt
2147
native_tokens_completion
1242
native_tokens_completion_images
(null)
native_tokens_reasoning
235
native_tokens_cached
2078
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773350441-s2EXDye7X76T1NaX7mZq"
upstream_id
"gen-1773350441-s2EXDye7X76T1NaX7mZq"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
739
0.00%
Dialogue to Total Word Ratio
Ratio: 50.40%, Deviation: 20.40%
neededClean
false
wordsTotal
750
wordsDialogue
378
0.0000%