NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBDY7HZM1EQ5VS7Q2CTD02N
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 33s
Cost:
$0.022705
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
98.21%
Dialogue to Total Word Ratio
(details)
49.1026%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van idled silently behind the hedge, muffling the engine's hum like a guilty conscience. Molly adjusted her disguise, a wig that looked like a mop of dreadlocks. Bill checked the blueprints, which were actually just a napkin with crayon drawings. The target house was glowing with fairy lights, a beacon of innocence they intended to extinguish. Inside, balloons bobbed against the ceiling, unaware of their impending doom. Molly chewed her lip, the stress of the operation tightening her jaw. This wasn't a bank job; it was a sugar rush heist. The stakes were low, but the potential for sticky fingers was high. "Are you sure about the balloon popper?" Bill whispered, tapping the device on his belt. "It's a distraction, Bill. Not a weapon," Molly hissed back, keeping her eyes on the front door. "What if they ask for ID?" "We're not here for the kids. We're here for the loot." "Lego sets count as loot?" "Anything wrapped in shiny paper counts." "What about the video games?" "Those are high value. Priority one." "What about the bicycles?" "Too big. Leave them for the neighbors." Bill nodded, accepting the logic of shiny paper. He fumbled with a lockpick set that looked more like a set of dental tools. The security system was a simple keypad, set to the birthday boy's age. Molly signaled him to wait. A clown car pulled up, honking a tune that sounded like a dying goose. She winced. The clown was a liability. Clowns were unpredictable variables. She needed a extraction plan. The bags were ready in the trunk, lined with velvet to prevent crushing the electronics. They couldn't afford broken screens. "How long do we have?" Bill asked, glancing at his watch. "Cake time is in ten minutes," Molly replied. "That's when the parents are distracted?" "That's when the parents are taking photos." "Perfect. I'll go for the tech." "I'll handle the toys." "What about the cake?" "Leave it. We don't do consumables." "Even the cupcakes?" "Especially not the cupcakes." "What about the balloons?" "We can sell them as party favors later." Bill stood up, brushing crumbs off his knees. He wasn't sure where the crumbs came from, but he didn't ask. He moved toward the back window, which was unlocked. A mistake by parents. Molly followed, her movements silent. The air smelled of vanilla and impending chaos. They slipped inside, the floorboards creaking under their weight. A child laughed, oblivious to the criminals shadowing the favors. Molly grabbed a bag. Bill grabbed a bag. They were ghosts in a room full of noise. The heist began.
created_at
"2026-02-25T22:15:30.627Z"
2/25/2026, 10:15:30 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
656
moderation_latency
(null)
generation_time
92633
tokens_prompt
77
tokens_completion
8870
native_tokens_prompt
76
native_tokens_completion
9451
native_tokens_completion_images
(null)
native_tokens_reasoning
8855
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0227052
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-036379da-7165-9a5d-81fc-4e9fb510be5a"
is_byok
false
latency
654
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057730-JeK8PGTJKlc6gEUiZ87G"
upstream_id
"chatcmpl-036379da-7165-9a5d-81fc-4e9fb510be5a"
total_cost
0.0227052
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
432
98.21%
Dialogue to Total Word Ratio
Ratio: 29.40%, Deviation: 0.60%
neededClean
false
wordsTotal
432
wordsDialogue
127
49.1026%