NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-26-01KJBM4CZ8P3X4095QPHFMVVHD
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 39s
Cost:
$0.021685
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.16%
Dialogue to Total Word Ratio
(details)
0.0803%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the binoculars, squinting at the house across the street. The inflatable bouncy castle glowed pink under the floodlights. Bill sat on the hood of the van, picking at a thread on his jacket. The night air smelled of cut grass and vanilla frosting. Streetlights flickered overhead, casting long shadows. "Do you see that?" Molly whispered. "That is a Lego Star Wars set. Limited edition." "I see a six-year-old named Timmy eating cake," Bill said, wiping frosting off his chin. "Why are we stealing from a kid's birthday?" "Because the insurance payout on unregistered toys is higher than the loot itself," Molly explained, tapping her tablet. "Plus, the parents are distracted by the clown." "The clown is holding a knife," Bill noted, pointing. "That's a balloon animal knife. It's safe," Molly dismissed. "We need to hit the gift table before the piñata breaks." "Piñatas are a security risk," Bill argued. "Kids swing sticks. That's violence." "It's a distraction," Molly corrected. "When the sugar rush hits, we slip in the back door. You take the drones, I take the dolls." "I don't want dolls," Bill said, shuddering. "I want the remote control cars." "You get the cars, I get the action figures," Molly negotiated. "We split the cash from the gift cards." "What about the parents?" Bill asked, looking nervous. "The dad has a beard like a pirate." "He's on the phone," Molly checked her watch. "He's arguing with his wife about the cake budget. Perfect." "And the mom?" "Taking photos for Instagram," Molly grinned. "She won't notice until the box is empty." "This feels wrong," Bill muttered, standing up. "Stealing from children." "It's not stealing," Molly insisted, zipping up her bag. "It's redistribution of wealth. Kids don't need all this plastic anyway." "Timmy needs that train set," Bill countered. "He's been saving for months." "Timmy will get a new one," Molly said coldly. "We need the money for the getaway van." "The van is already paid for," Bill reminded her. "The van needs new tires," Molly lied. "High-performance tires." "You're lying," Bill accused. "I'm optimizing," Molly corrected. "Ready the grappling hook." "For a suburban lawn?" Bill laughed. "I have a ladder in the trunk." "Use the ladder," Molly sighed. "Just don't drop the presents." "What if the clown sees us?" "Then we give him a balloon," Molly said. "He'll be too busy popping it." "You're crazy," Bill said, opening the van door. "Absolutely crazy." "That's why I'm the boss," Molly replied, climbing in after him. "Drive." The engine hummed as the distant party music swelled. Molly checked the rearview mirror one last time. Bill adjusted his seatbelt with a heavy sigh. The van rolled forward silently. Shadows stretched long across the pavement as they moved. The tires crunched softly on gravel. Molly gripped the steering wheel tight.
created_at
"2026-02-26T00:03:44.238Z"
2/26/2026, 12:03:44 AM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
445
moderation_latency
(null)
generation_time
98520
tokens_prompt
77
tokens_completion
8932
native_tokens_prompt
76
native_tokens_completion
9026
native_tokens_completion_images
(null)
native_tokens_reasoning
8337
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0216852
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-74995884-0a5d-9184-a741-cc1b0e1ecc31"
is_byok
false
latency
445
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772064224-K5Dj3SyzZpJwoMtrZPZl"
upstream_id
"chatcmpl-74995884-0a5d-9184-a741-cc1b0e1ecc31"
total_cost
0.0216852
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
468
0.16%
Dialogue to Total Word Ratio
Ratio: 61.36%, Deviation: 11.36%
neededClean
false
wordsTotal
471
wordsDialogue
289
0.0803%