NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFFFGFP00PACF69YM2CR1ZF
Run Details
Model:
Stealth: Healer Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
11.7s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The fluorescent lights of Giggles & Games hummed over the wreckage of a birthday party. Molly spread the floor plan—stolen from the janitor’s closet—on a sticky table covered in confetti. It was a child’s drawing of a castle with a big red ‘X’ on the treasure chest. Perfect. “The main loot is in the Party Room 4. Parents drop the gifts at the door, they go straight into the giant pirate chest. Security is a teenager named Kyle on his phone,” she said, jabbing a finger at the crude sketch. Bill, nervously adjusting his miniature party hat—a mandatory cover—peered over her shoulder. “Molly, I’ve had a thought. It’s a bad one. The good kind, but the bad kind.” “Spit it out, Bill.” “Mascots. They’re everywhere. That gorilla has seen things, Molly. Its plastic eyes… they follow you.” “We’re not here for existential crises,” Molly said, ignoring the gorilla currently deflating in the corner. “Phase one: infiltration. We blend. I’m ‘Party Princess Penelope,’ you’re… ‘Adventure Time Alvin.’ We’re hired entertainment for a rival party. We load the gifts into the branded van under the guise of a prize swap.” “A prize swap,” Bill repeated, sweating. “Right. What if the real party clowns catch on?” “Then we deploy the emergency glitter bomb and run like hell.” Molly unrolled a large sheet of glittering pink paper. “These are the schematics for the glitter bomb. I’ve labeled the fuse and the glitter payload.” Bill squinted. “This is a kindergarten drawing of a unicorn with a dynamite belt.” “Artistic license! The principle is sound. Now, the critical part.” She leaned in, her voice dropping. “The access point. The kitchen has a dumbwaiter that goes to the gift room basement. It’s small. Just the right size for… a package.” “You want me to ride a tiny elevator with three hundred pounds of loot?” “*We* want you to. I’ll be topside, managing the ‘show’ and creating a diversion if Kyle stops texting his friends to look up.” She tapped the plan. “The dumbwaiter rope is old, frayed. It’ll make noise. You need to be silent as a mouse.” “Mice are loud when you step on them,” Bill noted. “Don’t step on the rope, Bill.” She sighed. “Look, the real danger is the parents. They’re vicious. They’ll fight to the death over a duplicate ‘My Singing Fluffykins’ doll. We slip in, we slip out. Simple.” A small figure in a dinosaur costume waddled past their table, trailing a balloon. “My mom says you have to share the frosting,” it mumbled. Molly smiled, a sharp, practiced grin. “Of course, little Rex. Princess Penelope has a special frosting… secret.” The dinosaur shuffled off. Bill exhaled. “You see? We’re in.” Molly rolled up the plan. “Suit up. The heist is at 3:45 PM, right after the cake, when the sugar crash hits and parental vigilance drops to zero.” “And if it doesn’t?” Bill asked, pulling on his Alvin hat. Molly hefted a canister of compressed air and glitter. “Then we create our own crash. Operation Balloon Pop is a go.” Bill looked at the deflating gorilla, then at the purple and pink chaos of the playcenter. “You know,” he said softly, “this is the worst idea we’ve ever had.” Molly beamed, adjusting her tiara. “Exactly. That’s how you know it’ll work. Now go find the dumbwaiter rope. And Bill?” “Yeah?” “Try not to get stuck.”
created_at
"2026-03-11T22:15:04.962Z"
3/11/2026, 10:15:04 PM
model
"openrouter/healer-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
482
moderation_latency
(null)
generation_time
11615
tokens_prompt
77
tokens_completion
1171
native_tokens_prompt
2147
native_tokens_completion
1010
native_tokens_completion_images
(null)
native_tokens_reasoning
242
native_tokens_cached
2092
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773267304-QPXTdi8gQnqQbEk7GUzP"
upstream_id
"gen-1773267304-QPXTdi8gQnqQbEk7GUzP"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
568
0.00%
Dialogue to Total Word Ratio
Ratio: 68.49%, Deviation: 18.49%
neededClean
false
wordsTotal
568
wordsDialogue
389
0.0000%