NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFGYWS85NEHN3MAYCXHB7NJ
Run Details
Model:
Stealth: Hunter Alpha
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
11.1s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
95.37%
Dialogue to Total Word Ratio
(details)
48.6898%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The bouncy castle sagged in the late afternoon sun, its deflated peaks like a toppled castle in a kingdom of sugar. Molly adjusted her binoculars. “Seventeen kids. One clown. That’s the real threat. Clowns are unpredictable.” “Focus on the loot table,” Bill muttered, scribbling on a napkin. “Presents are the priority. We go in through the inflatable dragon’s mouth.” “The dragon’s mouth is a cliché. The real ingress point is the ‘Happy Birthday’ banner. Snip two ropes, it collapses, creates chaos. Classic diversion.” “Your diversions always involve structural damage. Remember the ‘grandma fountain’ incident?” “That was an artistic statement. Now, the birthday boy is obsessed with dinosaurs. The big green gift with the roaring sound box? That’s our primary. The rest is collateral.” “Collateral presents? We’re stealing from kids, Molly.” “We’re *liberating* excess materialism. Think of it as public service. At 3:07 PM, the clown will attempt to make a poodle balloon. That’s his moment of deepest concentration. We move then.” “What’s the exit strategy?” “The same way we came in. Through the dragon’s mouth. I lied about it being a cliché. It’s iconic for a reason.” “You’re the worst partner I’ve ever had.” “And yet, here we are. Ninety seconds to showtime. Try to look like a parent having a crisis. Which, historically, isn’t a stretch for you.” Bill sighed. “Just get the dinosaur.”
created_at
"2026-03-11T22:40:57.653Z"
3/11/2026, 10:40:57 PM
model
"openrouter/hunter-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1175
moderation_latency
(null)
generation_time
11058
tokens_prompt
77
tokens_completion
635
native_tokens_prompt
2111
native_tokens_completion
508
native_tokens_completion_images
(null)
native_tokens_reasoning
230
native_tokens_cached
1984
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773268857-wX123IKJTg0Fef8UURZJ"
upstream_id
"gen-1773268857-wX123IKJTg0Fef8UURZJ"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
225
95.37%
Dialogue to Total Word Ratio
Ratio: 85.33%, Deviation: 4.67%
neededClean
false
wordsTotal
225
wordsDialogue
192
48.6898%