NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBGX2EBC108SV3SBXD7FH85
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 38s
Cost:
$0.045829
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the tactical map spread across the dashboard of the rusted van. It was a crayon drawing of a living room, annotated in aggressive Sharpie strokes. She tapped a specific corner marked with a star. The target was the pile under the tree, specifically the ones wrapped in silver foil. High resale value on the black market. Bill stared at the drawing, his face pale in the dim light. It was a six-year-old’s birthday party. They were stealing toys. He questioned the morality of it, but Molly was already checking her watch. The party started at two. They went in at one-fifty. That was peak gift opening, when the adrenaline was highest. Parents would be distracted by cake and balloons. They wouldn't notice the missing boxes until the singing started. "The target is the pile under the tree," she said. "Silver foil. High value." Bill sighed, rubbing his temples. "It’s a kid's party, Molly. We’re stealing toys." "We’re redistributing wealth," she corrected, grabbing the duffel bag from the back seat. "From the rich kid to us." "That kid is named Timmy. He has a pony." "Exactly. He doesn't need a PS5. We do." Bill looked at the gear. Zip ties, lockpicks, and a single rubber chicken. "What about the parents?" "Distraction protocol Alpha," Molly said, tapping the map. "You wear the clown suit." "I am not wearing a clown suit." "It’s part of the infiltration. You’ll juggle. They’ll look at the balls. We swipe the boxes." "I can’t juggle." "Then you drop the balls. Chaos is our friend." Bill rubbed his temples again. "This feels wrong. Stealing from children?" "It’s a heist, Bill. Not a charity gala. The only thing we’re stealing is their joy." "That’s a terrible line." "It’s working on me. Get the bag." He handed her the duffel bag stuffed with zip ties and glitter bombs. "Glitter bombs?" "Insurance. If we get caught, they won't find us. They'll find a mess." "You're crazy." "I'm prepared. The party starts at two. We go in at one-fifty." "One-fifty? That’s peak gift opening." "That’s when the adrenaline is highest. They won't notice the missing boxes until the cake." Bill looked at the crayon map again. "What if they have a dog?" "Then you bribe the dog with bacon." "Do I have bacon?" "In the glove box." He opened the glove box. It was empty except for a single rubber chicken. "This is your backup plan?" "It's a decoy. Squeak it loud enough, the dog thinks it's a rival." Molly started the engine. "Ready?" "No." "Good. Fear keeps you sharp." She pulled out onto the street. "Remember, no touching the cake. Cake is sticky." "Noted." "And don't let the kid hug you. It ruins the stealth." "I'll try." "Try harder. This is our big score." "It's a kid's party, Molly." "It's a score. Don't get sentimental." They turned the corner. The house was decorated with balloons. "Look at that," Bill whispered. "So much plastic." "So much profit," Molly replied. "Let's make history." "Let's not." "Too late."
created_at
"2026-02-25T23:07:18.352Z"
2/25/2026, 11:07:18 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
553
moderation_latency
(null)
generation_time
97709
tokens_prompt
77
tokens_completion
15340
native_tokens_prompt
76
native_tokens_completion
14312
native_tokens_completion_images
(null)
native_tokens_reasoning
13551
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0458288
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-90e5fae9-cb4b-9aca-a1e3-9733684d6157"
is_byok
false
latency
553
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772060838-CX1z4OIAh6UH1IymiqDX"
upstream_id
"chatcmpl-90e5fae9-cb4b-9aca-a1e3-9733684d6157"
total_cost
0.0458288
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
0.00%
Dialogue to Total Word Ratio
Ratio: 54.60%, Deviation: 24.60%
neededClean
false
wordsTotal
511
wordsDialogue
279
49.3562%