NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBGRJ9XNAMZ6GPG6FMCH5ZW
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 28s
Cost:
$0.035286
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
92.15%
Dialogue to Total Word Ratio
(details)
46.1812%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van hummed quietly in the driveway across from the suburban mansion. Inside, the glow of a laptop screen illuminated Molly’s focused expression. She adjusted her goggles, which were currently perched on her forehead like a strange hat. Bill sat in the passenger seat, very nervously picking at a loose thread on his tactical vest. The target was a birthday party for a seven-year-old named Kevin. The loot was not gold, not jewels, but a mountain of wrapped boxes stacked against the patio wall. Molly tapped the screen. The security system was a blinking red light on the dashboard. It was a toy. Literally. A plastic remote that controlled the gate. "We are stealing birthday presents," Bill whispered. "This is wrong. Kids get sad." Molly rolled her eyes, grabbing a wrench from the console. "It is about logistics. These are high-end drones and limited edition action figures. The resale value on a pristine Playmobil castle is astronomical in the underground market." Bill looked at the stack of boxes through the windshield. "What about the cake?" Molly explained they would leave the cake but take the plastic, as cake melts while plastic lasts forever. She opened the glove compartment and pulled out a bag of glitter. "For the distraction. We throw this at the security guard. He will be blinded by sparkle." "Sparkle?" Bill asked, horrified. "We are criminals, not clowns." "We are professionals," Molly corrected. "Glitter is psychological warfare. He will be picking it out of his eyebrows for weeks." Bill sighed, adjusting his gloves. He agreed to proceed but insisted he would not return if Kevin cried. "He will not cry," Molly assured him. "His parents will blame the dog." She checked her watch. "The party starts in ten minutes. We need to be inside before the piñata breaks." Bill nodded slowly. "Good," Molly said, opening the door. "Then let us go steal some childhood dreams." She stepped out into the humid air. The lawn was covered in plastic ponies and inflatable bouncy castles. It looked like a fortress of fun. Bill followed, clutching a duffel bag that smelled faintly of old gym socks. They moved silently toward the chain-link fence, boots muffled by the grass. The gate beeped. Molly used the remote. The gate swung open with a cheerful chime. "Cheerful," Bill muttered. "Too cheerful." "It is a party," Molly said. "Everything must be cheerful until the money changes hands." They slipped under the hedge. The laughter of children echoed from the backyard. Molly pointed to the table. Hundreds of gifts waited there. Bill tightened his grip on the bag. He felt a pang of guilt, but then he saw the price tag on a remote-controlled helicopter. The guilt vanished instantly, replaced by the cold thrill of the score. "Ready now?" Molly asked. "Yes, ready," Bill said. They lunged forward.
created_at
"2026-02-25T23:04:50.754Z"
2/25/2026, 11:04:50 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
427
moderation_latency
(null)
generation_time
148234
tokens_prompt
77
tokens_completion
13835
native_tokens_prompt
76
native_tokens_completion
14693
native_tokens_completion_images
(null)
native_tokens_reasoning
14062
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.035286
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-c8966ccc-2881-9bfe-ba86-1c574f9a004e"
is_byok
false
latency
427
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772060690-VhPzn9kS3KE8khX15dWp"
upstream_id
"chatcmpl-c8966ccc-2881-9bfe-ba86-1c574f9a004e"
total_cost
0.035286
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
472
92.15%
Dialogue to Total Word Ratio
Ratio: 28.72%, Deviation: 1.28%
neededClean
false
wordsTotal
477
wordsDialogue
137
46.1812%