NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJXSD8FGQD1C77DX28DNR3
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 56s
Cost:
$0.029742
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale pizza. Molly tapped the blueprint against her knee. Outside, Happy Birthday drifted through the tinted windows. "You realize this is morally bankrupt," Bill said, adjusting his neon green tie. "We're stealing from children." "We're stealing from the parents," Molly corrected. "The kids think the presents vanished. It's magic." "It's larceny." "It's redistribution." She pointed to the photo. "Security is keyed to motion. Kids run like maniacs. We time it with the dance floor." Bill frowned. "Cake cutting? Parents are alert." "They fight over the last slice. They won't notice ninjas." Molly grinned. "We need the distraction." "I'm not wearing a ninja suit. I'm wearing a clown tie." "It blends in." "With what? The balloons?" "Yes." Bill sighed. "If we get caught, I'm telling them you made me do it." "You're the getaway driver. You're compromised." Molly checked her watch. "Five minutes. Grab the duffel. We go in through the doggy door." "The doggy door?" "It's the only way. Back door is locked." "I can't fit." "You're not a dog. You're a professional." Bill stood up. "This is the stupidest heist." "It's the most profitable." Molly grabbed her mask. "Ready?" "No." "Fear keeps you sharp." They opened the van. Sugar rushed in. Molly stepped out, checking the perimeter. Bill followed, clutching the bag. "If the dog bites me," Bill whispered, "I'm suing." "Stick to the plan. Cake time is now." They moved toward the house. Shadows lengthened against the lawn. Molly signaled the stop. She listened to the laughter. "Go," she mouthed. Bill crawled toward the flap. He hesitated. "Go!" she hissed. He shoved his head through. It got stuck. "I'm stuck!" he whispered loudly. "Wiggle it!" "I'm not a contortionist!" "You're a raccoon!" He wiggled free, landing in the grass. He stood, brushing dirt off his knees. "Okay. We're in." "Don't celebrate." They crept toward the living room. Presents stacked like a colorful fortress. "Look at that," Bill breathed. "So much plastic." "Focus. Bag it." They filled the duffel. Music swelled. A child screamed. "Did you hear that?" Bill asked. "Leave it in the van." They zipped the bag. Molly grabbed the handle. "Let's move." They turned to leave, but lights flickered on. A small boy stood in the doorway, holding a plastic sword. Bill stammered a lie about being a relative. The boy shook his head, clearly unconvinced by the clown tie. Bill looked at Molly. Molly looked at Bill. "Run," she said. They sprinted out the back door, leaving the party behind.
created_at
"2026-02-25T23:44:15.831Z"
2/25/2026, 11:44:15 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
419
moderation_latency
(null)
generation_time
79478
tokens_prompt
77
tokens_completion
9107
native_tokens_prompt
76
native_tokens_completion
9285
native_tokens_completion_images
(null)
native_tokens_reasoning
8615
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0297424
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-8ca66fed-85b6-91c4-bb04-2d47cfa692fd"
is_byok
false
latency
419
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063055-pb1GRNxby83uwjVG3y1J"
upstream_id
"chatcmpl-8ca66fed-85b6-91c4-bb04-2d47cfa692fd"
total_cost
0.0297424
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
419
0.00%
Dialogue to Total Word Ratio
Ratio: 47.97%, Deviation: 17.97%
neededClean
false
wordsTotal
419
wordsDialogue
201
0.0000%