NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBKTBCKFEB18228EK6SJ3C6
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
50.6s
Cost:
$0.015053
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
78.82%
Dialogue to Total Word Ratio
(details)
39.4095%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly killed the engine of the black sedan. She turned to the driver's side and looked at Bill. "You hear that?" she asked. "That is the sound of money." "That sounds like a toddler choking on a gummy bear," Bill said, shifting in the passenger seat and pulling his knees up. "I don't like it. I don't like this at all. We should have gone for the jewelry store where the security is high and the toys are low." "They don't have jewelry in the party favor bags," Molly countered, pulling a pair of rubber gloves from the glove compartment. "They have plastic dinosaurs. And I want the dinosaurs. Specifically the ones with the glitter. They shine in the dark." "Why do you want the dinosaurs?" Bill asked, staring out the window at the flashing lights. "They are rare in the collectors market," Molly lied smoothly and adjusted the collar of her coat. "Do you know the going rate It is insane. People pay thousands for vintage toys." "They cost twenty dollars from Walmart." "Exactly. That means they are accessible. We need volume, Bill. We need the whole box. It's all about quantity." Molly looked at the house. The lights were blindingly bright and the grass was perfectly green. "Look at that lawn. It's covered in bounce houses. That is where we go in." "In the bounce house?" Bill asked, horrified and backing away from the window. "Through the vent." "We are not going in through the vent. We are walking in the front door like normal guests." "Normal guests do not carry crowbars." Bill patted his waistband where the metal was cold and hard. "We are undercover," Molly said. "I am a caterer. You are the waiter. We are there to serve cakes and steal presents before anyone notices." "You are wearing a tuxedo." "It's a tuxedo suit for a waiter. It's classic and stylish and fits my budget." "I look like a robot who lost a fight." "You look professional. Now check the timing. The cake cutting is in ten minutes." "Ten minutes?" Bill checked his watch and sighed loudly. "That's it We have ten minutes to steal an entire inventory of toys It's a fast heist. That's the point. Speed kills boredom and it helps the escape." "I can't run fast," Bill said, shaking his head and holding his knees. "My knees click every time I walk." "Just run when I say run. Do not argue with me about your knees. Just run." "What if the parents call the cops" "They won't. They will think you are a security guard hired by the host." "A security guard in a tuxedo" "It adds gravitas to the situation. People fear what they do not understand." Molly opened her door and stepped out into the cool night. "Come on." "Wait, do we have a plan B" Bill asked, his hand on the handle and freezing in fear. "Yes," Molly said. "We run." "That sounds like a terrible plan." "It is our only plan. And it is the best plan we have right now." Molly stepped onto the sidewalk. "Okay, let's do this. On three." "You are counting down" Bill asked. "It keeps us in sync and it builds tension for the moment." "Okay. One. Two. Three." Bill opened the door.
created_at
"2026-02-25T23:58:14.939Z"
2/25/2026, 11:58:14 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
405
moderation_latency
(null)
generation_time
50455
tokens_prompt
77
tokens_completion
7633
native_tokens_prompt
76
native_tokens_completion
7517
native_tokens_completion_images
(null)
native_tokens_reasoning
6780
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.015053
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-b2da5ed8-cce5-9928-96a8-5b0cc59515ce"
is_byok
false
latency
405
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063894-GM58s9BGBeEZ03mglFMP"
upstream_id
"chatcmpl-b2da5ed8-cce5-9928-96a8-5b0cc59515ce"
total_cost
0.015053
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
550
78.82%
Dialogue to Total Word Ratio
Ratio: 72.18%, Deviation: 2.18%
neededClean
false
wordsTotal
550
wordsDialogue
397
39.4095%