NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK13CHK9611B7M0E9NAAFRS
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
30.8s
Cost:
$0.002045
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Why are we doing this?" Bill asked, adjusting his black ski mask which kept sliding over his eyes. "It's a birthday party, Molly. For a six-year-old." "Because it's easy money, Bill," Molly whispered, crouching behind a oversized cardboard cutout of a superhero. "Look at that pile. Wrapping paper everywhere. No alarms, just a clown and a woman blowing up balloons until she turns blue." "A clown," Bill said, his voice trembling slightly. "Clowns are unpredictable. They have huge shoes and squeaky noses. What if he honks at us?" "If he honks, you honk back and keep moving," Molly snapped. "We go in, we grab the Loot Piñata, the giant teddy bear, and whatever is in the box marked 'Do Not Shake'. We are out before the ice cream melts." "The ice cream melts in ten minutes," Bill pointed out. "That is a very tight window. Also, did you see the security? There's a golden retriever wearing a party hat." "A dog?" Molly scoffed. "I can handle a dog. I'll bring a treat. Everyone loves a treat." "What if the treat is gluten-free? Dogs hate gluten-free treats, Molly. It's in the papers." "Stop reading the papers and start watching the perimeter," she hissed. "Okay, here's the plan. You create a distraction." "Me?" Bill squeaked. "Why me? You're the mastermind. Masterminds create distractions." "I need to be near the presents to assess the weight distribution," Molly argued. "You just need to go over there and ask the magician if his rabbit is single." "That is the worst distraction I have ever heard," Bill said. "The magician looks like he hasn't slept in three days. He's going to pull a sword out of his ear and stab me." "Fine," Molly groaned. "Plan B. We wait for the cake cutting. Everyone sings, everyone closes their eyes to make a wish. That's our window." "People don't close their eyes for the whole song," Bill countered. "They peek. Kids always peek to see if anyone else is cheating on the wish." "Then we wear blindfolds too," Molly suggested. "We blend in. We become part of the party." "We are going to walk into a room full of screaming toddlers wearing blindfolds while carrying armfuls of gifts," Bill said slowly. "We will look like very tall, very confused piñatas." "It's genius," Molly insisted. "Nobody suspects the blindfolded giants. Now, do you have the sack?" "I have the sack," Bill said, patting the burlap bag at his feet. "But it smells like onions. Why does our getaway vehicle smell like onions?" "It was the cheapest one at the thrift store," Molly replied. "Focus, Bill. The song is starting. 'Happy Birthday' is our cue." "They're singing off-key," Bill noted nervously. "This is bad luck. Very bad luck for a heist." "Just walk," Molly ordered, standing up and tying a striped napkin over her eyes. "Remember, smile like you belong there. If anyone asks, we're the entertainment." "The entertainment?" Bill whispered, tying his own napkin on crookedly. "What act are we performing? The Great Blunder?" "Shut up and grab the teddy bear," Molly said, stumbling forward toward the table. "And if the dog barks, throw him an onion." "We are going to jail," Bill muttered, shuffling behind her. "We are going to jail for stealing a plastic lightsaber and a gift card to a toy store." "Quiet," Molly commanded. "I hear the clown laughing. That means the coast is clear." "That means the clown knows we're here, Molly!" "Move it, or I leave you with the magician!"
created_at
"2026-02-16T10:49:53.116517+00:00"
2/16/2026, 10:49:53 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
828
moderation_latency
(null)
generation_time
30761
tokens_prompt
77
tokens_completion
898
native_tokens_prompt
78
native_tokens_completion
839
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0020448
router
(null)
provider_responses
0
id
"chatcmpl-ad4150ec-6099-99ad-a44f-671b9b5a6230"
status
200
is_byok
false
latency
828
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771238961-oFcNaAULOocVOFQH7dhL"
upstream_id
"chatcmpl-ad4150ec-6099-99ad-a44f-671b9b5a6230"
total_cost
0.0020448
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
592
0.0440%
Dialogue to Total Word Ratio
Ratio: 82.43%, Deviation: 12.43%
neededClean
false
wordsTotal
592
wordsDialogue
488
0.0220%