NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBEX4QERA44BPV596CCGG8J
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
51.5s
Cost:
$0.022766
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0035%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly stared at the blueprints spread across the hood of the rusted van. Bill leaned against the bumper, chewing on a toothpick. The sun beat down on the pavement, baking the asphalt until it shimmered in the heat. Dust motes danced in the air around them. Molly smoothed the wrinkled paper with a sharp fingernail. She traced the route with her finger. "This is insane," Bill said, wiping sweat from his forehead. "It is not insane," Molly replied, tapping the paper with a manicured nail. "It is a birthday party at the community center. Easy in, easy out." "Easy out?" Bill raised an eyebrow, squinting against the glare. "There are forty toddlers and three overprotective mothers." "We have the distraction," Molly said, folding the map carefully. "The piñata?" "The piñata," she confirmed. "We rig it to drop the candy early. Chaos ensues. We grab the pile." "The pile?" Bill laughed, a dry, nervous sound. "You mean the unwrapped gifts?" "Exactly. We load them into the duffel bags." "What about the parents?" "They will be too busy stopping the sugar rush to notice us." Bill shook his head slowly. "Molly, this feels wrong. Stealing from children?" "It is not stealing," she argued, crossing her arms. "It is redistribution." "Redistribution?" "From the rich kids to us." "We are not rich," Bill pointed out. "We could be," Molly countered. "Think of the resale value on those LEGO sets." "Do you hear yourself?" "I hear profit." Bill sighed, looking at the sky. "What is the exit strategy?" "Back door," Molly said. "Through the kitchen." "The kitchen?" "Yes. The cake is there." "Are we eating the cake?" "Maybe." "This is getting weird." "Focus, Bill. We go in at four PM." "When the music is loud." "Precisely." "And the costumes?" "We wear the clown outfits." "Clowns?" "They blend in." "Nobody likes clowns." "Exactly. They ignore them." Bill rubbed his temples. "I hate this plan." "But you love the money." "I love the money." "Then we have a deal?" "We have a deal." Molly rolled up the blueprints. "Meet me at the van in ten." "Ten," Bill agreed. "Do not be late." "I am never late." "You were late last time." "That was traffic." "It was a parking lot." "Details." "Get in the van." "Fine." Molly opened the driver's side door. Bill climbed into the passenger seat. The engine roared to life. "One more thing," Bill said. "What?" "If we get caught?" "We run." "To where?" "The clown car." "That is a joke." "It is a van." "Right." "Drive." "Okay."
created_at
"2026-02-25T22:32:23.541Z"
2/25/2026, 10:32:23 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
473
moderation_latency
(null)
generation_time
51430
tokens_prompt
77
tokens_completion
7321
native_tokens_prompt
76
native_tokens_completion
7105
native_tokens_completion_images
(null)
native_tokens_reasoning
6439
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0227664
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-e6d41c3e-816e-9db2-8149-542eef673d87"
is_byok
false
latency
473
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058743-prcFWpOzQeEyPRI3HkUP"
upstream_id
"chatcmpl-e6d41c3e-816e-9db2-8149-542eef673d87"
total_cost
0.0227664
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
423
0.01%
Dialogue to Total Word Ratio
Ratio: 63.83%, Deviation: 13.83%
neededClean
false
wordsTotal
423
wordsDialogue
270
0.0035%