NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBKW6CGPDSZXPJBCTASSDMF
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 22s
Cost:
$0.039930
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
91.30%
Dialogue to Total Word Ratio
(details)
95.6496%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the monocle over her left eye, squinting at the blueprints spread across the dashboard of their getaway van. The glow of the streetlights outside reflected off the glossy paper, illuminating the layout of the suburban mansion hosting the birthday bash. Bill sat opposite her, chewing on a toothpick and tapping his knee rhythmically against the floorboard. He looked nervous, which was unusual for a man who had once hotwired a tank in Prague without breaking a sweat. The smell of stale coffee and old vinyl filled the small, cramped space inside the old van. "This is insane, Molly. It is a seven-year-old's party with clowns." Bill said, gesturing wildly at the diagram. "Clowns are distractions. They draw the eye away from the gift table." Molly replied, tapping a specific point on the paper with her pen. Bill sighed, leaning back into the vinyl seat. He rubbed his temples, feeling the heavy weight of the moral dilemma pressing down on his tired, aching shoulders. "Do you think the parents will notice? If we take the Lego set, isn't that a crime?" Bill asked. "It’s not about the Lego. It’s about the principle. We steal from the overly indulgent. The insurance is worth more." Molly said, her voice dropping to a conspiratorial whisper. Bill picked up a stale granola bar from the dashboard and broke it in half, crumbs falling onto the floor mat. "So, what’s the entry point? The front door is guarded by a mom with a taser." Bill asked. "The chimney. It’s wide enough. We drop down, grab the bags, and exit." Molly stated firmly. "The pool is full of water balloons. If we splash, we leave a trail." Bill noted, frowning. "Noise is manageable. We wear these. I bribed the DJ to play heavy metal." Molly interrupted, pulling a pair of noise-canceling earmuffs from her pocket. Bill looked at the earmuffs with suspicion. "Heavy metal? At a kids' party?" "It creates chaos. Chaos is our friend. In the confusion, we vanish." Molly explained, sliding the blueprints back into her leather satchel. Bill nodded slowly, finally accepting the logic. "Okay. But if I get stuck in the chimney, you’re pulling me out." "Deal. Now, let’s go ruin some childhoods completely." Molly said, starting the engine. The van pulled away from the curb, tires crunching on the gravel driveway. Inside, the air smelled of stale coffee and impending felony charges. Molly checked her watch. The clock was ticking. They had exactly twenty minutes before the piñata broke. Bill gripped the door handle, his knuckles white. He wasn't sure he was ready for this. But Molly never failed. She never failed. And tonight, the children of Oak Creek would learn the hard way that some gifts were better left unopened. The engine hummed a low note as they merged onto the highway, heading straight for the suburban nightmare. The city lights blurred in the rearview mirror. They sped away, leaving nothing behind but burnt rubber and a very unhappy birthday.
created_at
"2026-02-25T23:59:15.349Z"
2/25/2026, 11:59:15 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
530
moderation_latency
(null)
generation_time
201941
tokens_prompt
77
tokens_completion
16328
native_tokens_prompt
76
native_tokens_completion
16628
native_tokens_completion_images
(null)
native_tokens_reasoning
15959
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.03993
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-c458f7f3-338f-9288-8f94-881034e785ec"
is_byok
false
latency
530
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063955-4r8vROgW2tSBSg1dOoiH"
upstream_id
"chatcmpl-c458f7f3-338f-9288-8f94-881034e785ec"
total_cost
0.03993
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
501
91.30%
Dialogue to Total Word Ratio
Ratio: 31.35%, Deviation: 1.35%
neededClean
false
wordsTotal
504
wordsDialogue
158
95.6496%