NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFC21Z45E211QZAXZ6VCDH8
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.0s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.06%
Dialogue to Total Word Ratio
(details)
4.8358%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The plan was spread across the kitchen table, not on blueprints, but on a brightly colored invitation featuring a cartoon dinosaur. A half-eaten toast sat forgotten beside it. “Alright, Bill,” Molly said, tapping the RSVP section. “This is it. The Pickering residence. Saturday, 2 PM. The target is the gift table. We go in, we go out. Simple.” “Simple,” Bill agreed, nodding so vigorously his oversized glasses slid down his nose. He pushed them up. “What’s the haul look like?” “Unknown, but guaranteed. It’s a fifth birthday party for Timothy Pickering. Expect high-yield, low-security items. Stuffed animals, LEGO sets, maybe a scooter if we’re lucky. The parents will be too busy with cake and chaos to notice.” “Chaos is good,” Bill said, rubbing his hands together. “I blend with chaos. What’s my role?” “You’re the distraction. I need you to engage the children. Draw their focus. I was thinking a highly visible, non-threatening character. Maybe you could wear the giant bunny costume from your uncle’s failed Easter promotion.” Bill’s face fell. “It smells faintly of despair and old chocolate. Also, it’s got one working eye. The other’s just a button dangling by a thread. It’s unsettling.” “Perfect,” Molly said, without a hint of irony. “You’ll be a monstrous, half-blind rabbit. They’ll be terrified and fascinated. That buys me sixty seconds at the gift table. I’ll be the caterer’s assistant. Neutral. Invisible.” “What’s the exit strategy?” “Once the goods are secured in the catering van, you cause a final, larger distraction. I recommend triggering the piñata early. A rain of candy will create a stampede for the door. We slip out in the confusion.” “Brilliant,” Bill breathed. Then he frowned. “One question. The invitation says ‘Timmy’s Dino-Mite Adventure.’ Do I… do I need to roar? To stay in character as a terrifying, button-eyed rabbit in a dinosaur-themed party?” Molly considered this. “A low, confused growl might be more authentic. Sell the existential crisis of the costume. But focus. The timeline is critical. We arrive at 1:50. You deploy at 1:55. I move at 1:56. We’re gone by 2:05. Not a second later. Understood?” “Understood,” Bill said, then added, “Do you think they’ll have those little juice boxes? The ones with the character straws?” “This is a heist, Bill, not a catering job. Focus on the presents.” “Right. The presents. But if a juice box presents itself during the distraction phase…” “Fine,” Molly sighed. “One juice box. Now, repeat the plan back to me.” Bill stood, puffing out his chest. “Infiltrate as a psychologically damaged rabbit. Cause a diversion. Allow you to pilfer the goods. Create a candy-based evacuation. Exit. And… one juice box. Optional.” “And the most important rule?” “Don’t get attached to the loot. Especially the stuffed animals. They have eyes, Molly. They watch you.” “Precisely. Now go iron your rabbit fur. We’ve got a party to crash.”
created_at
"2026-03-11T21:15:18.383Z"
3/11/2026, 9:15:18 PM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
746
moderation_latency
(null)
generation_time
17905
tokens_prompt
77
tokens_completion
1074
native_tokens_prompt
2111
native_tokens_completion
889
native_tokens_completion_images
(null)
native_tokens_reasoning
252
native_tokens_cached
2048
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773263718-6aEmOVYN0oYpWnYr4Zo7"
upstream_id
"gen-1773263718-6aEmOVYN0oYpWnYr4Zo7"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
478
0.06%
Dialogue to Total Word Ratio
Ratio: 82.14%, Deviation: 12.14%
neededClean
false
wordsTotal
487
wordsDialogue
400
4.8358%