NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ70JFFQGJGJXB2X1Y0KD3PT
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 48s
Cost:
$0.017157
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
99.14%
Dialogue to Total Word Ratio
(details)
54.3719%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the schematic on the scarred laminate surface of their one-room kitchenette. She weighed the blueprint corners with a rusted wrench and juice boxes. Red marker circled the gazebo, the deposit point for the loot of Timmy's fifth birthday. Bill sat across from her, a man defeated by geometry, wrestling his left boot into a pair of roller skates that were three sizes too small and covered in questionable adhesive fur. Molly adjusted her egg-shell spectacles and traced the route with a trembling finger. The plan relied on precise timing. At 2:14 PM, the hired clown would deploy the balloon animals, creating a visual blockade near the snack table. This distraction window allowed Bill to infiltrate the gift zone via the ventilation shaft of the inflatable castle. Molly would watch from the treehouse, monitoring anxious mothers via a jammed baby unit. "You're sweating," Molly observed, her voice tight. The air in the room was thick with the scent of desperation and cheap cologne. Bill's face was a mask of panic, his eyes darting toward the window as if expecting a SWAT team of kindergarteners to breach the glass. He fumbled with the zipper of his primary gear: a burlap sack stitched with the words 'Property of Timmy' in glitter glue. "I can't do the skates," Bill managed, his voice cracking as he attempted to stand and immediately wobbled, grabbing the fridge for support. The appliance rattled violently, sending a cascade of magnets and urgent reminders about dentist appointments clattering to the linoleum. "The floor at the party is polished concrete near the buffet. I'll be a missile. A furry missile of failure. Plus, the wheels squeak. I can hear them from here. The squeak of doom." Molly slammed her palm onto the table, making the juice boxes jump. "We are not retreating because of traction issues. We spent four hundred bucks on those skates, money we stole from the jar labeled 'New Socks'. You will wear them. You will glide past the bounce house. You will fill the sack. And you will not stop for glitter. The glitter is a trap." Bill stared at the skates, then at the sack, then at Molly's unblinking glare. The weight of the heist pressed down on him, heavier than the guilt of stealing a tricycle he'd committed at age twelve. He sighed, a long, rattling exhalation that seemed to deflate his entire posture, causing the fur on his boots to ripple like grass in a gentle, pathetic breeze. "Fine," Bill whispered. "But if I slip and take out the birthday boy, I'm blaming the glue. And if the glue holds and I succeed, I want the Nintendo Switch. That's the deal." Molly nodded, satisfied, and picked up a marker to add a note about emergency extraction via the slide. "Noted. Now tie your laces. We have a party to rob."
created_at
"2026-04-27T08:23:25.438Z"
4/27/2026, 8:23:25 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
714
moderation_latency
(null)
generation_time
107685
tokens_prompt
77
tokens_completion
7497
native_tokens_prompt
76
native_tokens_completion
7136
native_tokens_completion_images
(null)
native_tokens_reasoning
6510
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0171568
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-32d75f36-0303-9b21-ae83-4a98f20cae23"
is_byok
false
latency
714
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777278205-y6KqseuSYpQsn1zKHXhE"
session_id
(null)
api_type
"completions"
id
"gen-1777278205-unP8xiOG1Fa27IQ3NTOc"
upstream_id
"chatcmpl-32d75f36-0303-9b21-ae83-4a98f20cae23"
total_cost
0.0171568
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
478
99.14%
Dialogue to Total Word Ratio
Ratio: 29.58%, Deviation: 0.42%
neededClean
false
wordsTotal
480
wordsDialogue
142
54.3719%