NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBDNJXMSQ1XR22RGZVJ1XN7
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
58.2s
Cost:
$0.032152
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her turtleneck, feeling ridiculous. The bouncy castle loomed like a pastel monstrosity. Bill checked his watch, sweat beading despite the chill. They weren't targeting diamonds, but a mountain of wrapped boxes filled with plastic junk. It was questionable, yet reselling unopened Lego sets was lucrative. "Are you sure about the piƱata?" Bill whispered. Molly shook her head, checking the grappling hook. "We bypass the sugar rush. Stick to the gifts." The music thumped bass-heavy, drowning out the screams of delighted toddlers. Bill pulled a mask over his face, though it was just a sock with eye holes. He looked like a burglar who gave up halfway through. Molly sighed, knowing this was the low point of their criminal careers. Stealing birthday loot felt wrong, but rent was due. She tightened the strap on her duffel bag, ready to descend into the fray. The party began in earnest, balloons floating like trapped souls. They moved toward the back door, shadows lengthening against the suburban brick. Molly gripped the handle, her knuckles white. Bill nudged her, pointing at a sleeping guard dog named Buster. "The dog is asleep," Molly whispered. Bill nodded. "Good." They exchanged a glance before pushing forward.
created_at
"2026-02-25T22:10:47.36Z"
2/25/2026, 10:10:47 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
457
moderation_latency
(null)
generation_time
58040
tokens_prompt
77
tokens_completion
7587
native_tokens_prompt
76
native_tokens_completion
10038
native_tokens_completion_images
(null)
native_tokens_reasoning
9760
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.032152
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-2bb9ff51-7fac-9495-aec6-50fe885312b3"
is_byok
false
latency
457
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057447-XcML5yttbpl1xr9B2SdJ"
upstream_id
"chatcmpl-2bb9ff51-7fac-9495-aec6-50fe885312b3"
total_cost
0.032152
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 9.95%, Deviation: 0.05%
neededClean
false
wordsTotal
201
wordsDialogue
20
100.0000%