NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBGS6FNV9NS80ZQ28DF5622
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
55.2s
Cost:
$0.020045
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the overgrown hydrangeas, adjusting the neon pigtails that covered her ears. Bill fidgeted with the collar of his inflatable dinosaur costume, his knees knocking together loudly under the thick latex material. The party inside the house was in full swing, the bass-heavy thump of nursery rhymes vibrating through the damp garden floorboards. They needed the presents before the cake was cut, before the parents lost their minds over the noise. "The windshields are on the list, right?" Bill whispered. His voice cracked with nervous energy. "No, the windshields are for the getaway," Molly said sharply. She adjusted her gloves. "I meant the presents," Bill said. "The big ones." "Right. The presents. Do not forget the big ones." Molly pulled out a pair of nylon tights and shoved them over Bill's head like a hat. "I look ridiculous," Bill groaned. He sounded like a frog trapped in a tunnel. "You look like a dino," Molly corrected. She was staring through the kitchen window. "I look like a crime scene waiting to happen." "That is the point. Nobody suspects a costume." Inside the house, a giant candle was being lit on top of a chocolate cake. "The candle is lit," Bill said. "That means the main event is starting soon." "We move when the singing stops," Molly instructed. Her eyes were locked on the door. "When everyone turns to the cake, the perimeter becomes loose." "How do we know who is the perimeter?" "Look for the adults. They are the security detail watching the kids." "The security detail is eating chips and ignoring us." "Yes. But they watch the kids. We watch the door." Bill peered through the glass window pane. "There is a piñata." "Perfect," Molly said. "We can use it as cover." "Use it to hit them with?" "Use it to hide behind when the guards come out." "They will think I am the piñata." "Exactly. Confusion is our friend. We exploit it." Molly checked her watch. "Five minutes until the first birthday boy gets jealous of the truck." "Jealous of what?" "The other guy with the bigger truck." "We are going in for the truck?" "No, Bill. For the legos. The legos are cash." "Legos are small." "Legos are valuable currency." They stood up and moved toward the back door. The hedge rustled. "I think a dog saw us," Bill whispered. He froze in place. "Is it a guard dog?" "It is a pug." "Pugs are agile. They are fast." "And loud." "Perfect," Molly said. "Create a distraction." "I will not throw a dog." "Just bark. I will throw a toy cat." "We are using a cat?" "Not a cat. A toy cat." "That is not a cat." "That is a decoy." They crouched again. The music changed to a faster tempo. "It is time," Molly said. "I am ready." "Do not trip on the lawn." "I will not trip." "If you trip, you trip." "I am very careful." "Bill, you are not." They moved toward the back door. "The lock is electronic," Molly noted. She took out a hairpin. "Can you pick it?" "I have a hairpin." "That is all you have?" "It is a magic hairpin." "It is plastic." "It is magic plastic." The door clicked. "We are in." "Now what?" "Now we grab the stack." "The stack is the tree." "The tree is the loot." "The tree is fake." "The presents are real." They slid under the table. "There is a child here," Bill whispered. "The one in the blue hat." "He sees us." "He sees us looking." "He does not speak." "He is a good child." "We need him to be quiet." "Shh." They began to pack the bags with the toys. "This is heavy," Bill said. "It is gold." "It is a stuffed bear." "It is gold." "We leave the bear." "No. Bears are rare." "Rares are heavy." "We carry the weight." "I drop the bag." "You did not drop it." "I almost dropped it." "We have enough," Molly said. She pulled the hood up. "That is the van." "Let us go." "Wait." "Why are we waiting?" "For the music." "It is loud." "It is our cover." "Okay." "We move." They ran for the door. "Did you get the bag?" "I have the bag." "Do not look back." "I am looking back." "Look forward." "I look forward." "Good." They reached the van. "Open the door." "It is locked." "Use the hairpin." "It is the same hairpin." "It is magic." "It is plastic." "Open it." "Okay." "In we go." "We are in." "Safe." "For now." "We need to sell." "We will sell." "Tomorrow?" "Maybe." "Okay." "Good night." "Good night."
created_at
"2026-02-25T23:05:11.419Z"
2/25/2026, 11:05:11 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
382
moderation_latency
(null)
generation_time
55115
tokens_prompt
77
tokens_completion
9649
native_tokens_prompt
76
native_tokens_completion
10013
native_tokens_completion_images
(null)
native_tokens_reasoning
8835
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.020045
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-0411d1d6-5b5c-9123-9ff9-0b1f8c1b3f5f"
is_byok
false
latency
382
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772060711-NEDNFWYQyNLKx9q08cFB"
upstream_id
"chatcmpl-0411d1d6-5b5c-9123-9ff9-0b1f8c1b3f5f"
total_cost
0.020045
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
777
0.00%
Dialogue to Total Word Ratio
Ratio: 67.99%, Deviation: 17.99%
neededClean
false
wordsTotal
778
wordsDialogue
529
0.0000%