NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBFN0Y16M2DKDYG8DB5W4H1
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
51.3s
Cost:
$0.016591
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
90.09%
Dialogue to Total Word Ratio
(details)
66.7318%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The yellow van smelled of stale coffee and desperation, mixing with the faint scent of moldy upholstery. Molly tapped her tablet screen, the blue light cutting through the dim interior like a neon sign in a graveyard. Outside, muffled laughter drifted through the chain-link fence, the sound of children screaming over plastic trumpets echoing in the suburban silence. She adjusted her grip on the wheel, knuckles white against the leather. Bill sat in the passenger seat, vibrating with the energy of a caffeinated squirrel. He clutched a backpack filled with zip ties and a map. He looked out the window, squinting at the balloons tied to the fence. The air outside was warm and smelled of chlorine from the neighbor's pool. The engine ticked as it cooled, a small rhythmic sound against the quiet. "Okay, Bill," she whispered. "Focus. This isn't a jewelry store. It's a birthday party." Bill turned, eyes wide behind dark glasses. "Same difference. Value is value." "It is a value of crayons and rings, but sure." "Don't undermine the brand, Molly. It's about the experience." "The experience is going to jail. Tell me I'm not wrong." "You're never wrong. You're just cautious." "Do you have the distraction?" "The clown is on speed dial." "What if he shows up early?" "Then we blend in. I'm already dressed as a balloon animal." Molly stared at him. "You're wearing a balloon suit?" "It's for the infiltration. They won't look twice." "I don't know if I trust a man in latex." "Now, are we ready?" "I need a checklist." "You always need a checklist." "Security protocols. Exit strategy. Escape vehicle." "The van is fast." "The van is loud." "Sound is inevitable." "What about the parents?" "Too busy pretending to eat cupcakes." "That's not comforting." "Trust them. They trust the pizza guy." "We're the gift guy. We bring joy." "We bring handcuffs." "We bring shiny objects." "Okay, moving out." "What if they have alarms?" "It's a plastic castle. They won't notice." "They might have a security camera." "We'll wear a hoodie. Like a ninja." "You look ridiculous." "I look mysterious." "You look like a lost sock." "The party starts in five minutes." "I need the keys." "I have the keys." "Give them to me." "You know the route." "I don't know the route." "Follow the smell of cake." "There is no cake smell." "That's the dog." "Stop talking." "Fine. But if we get caught, I'm telling them it was you." "I knew you were untrustworthy." A bark echoed from the yard, sharp and sudden. Molly sighed and opened the door, letting the warm night air rush in. Bill hopped out, his latex suit squeaking with every movement. They began to sneak across the lawn, shadows stretching long under the flickering string lights. The dog barked again, closer this time. They froze mid-step. Molly reached for her phone. Bill reached for his knife. They stood perfectly still as the dog circled the corner. Molly held her breath until it hurt. Bill tapped her shoulder urgently. She nodded, ready for the next move. They walked together into the dark yard.
created_at
"2026-02-25T22:45:26.091Z"
2/25/2026, 10:45:26 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
441
moderation_latency
(null)
generation_time
51150
tokens_prompt
77
tokens_completion
8082
native_tokens_prompt
76
native_tokens_completion
8286
native_tokens_completion_images
(null)
native_tokens_reasoning
7550
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.016591
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-4bf5f0c3-c57f-9c74-ad52-7b3594f02889"
is_byok
false
latency
441
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059526-RgUgRkiR8aDhb3F3zBX5"
upstream_id
"chatcmpl-4bf5f0c3-c57f-9c74-ad52-7b3594f02889"
total_cost
0.016591
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
90.09%
Dialogue to Total Word Ratio
Ratio: 51.45%, Deviation: 1.45%
neededClean
false
wordsTotal
519
wordsDialogue
267
66.7318%