NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBEX3QH30G9YE7101PAMKCE
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 45s
Cost:
$0.026408
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the heavy binoculars, squinting at the suburban house glowing with cheap fairy lights. The lawn was a minefield of plastic obstacles and inflatable bouncy castles. "Look at that security system, Bill. It's a plastic dinosaur." Bill popped the hood of the getaway van, revealing a toolbox filled with gum and duct tape. "It's a T-Rex, Mo. It's a deterrent for toddlers, not us." Molly lowered the glasses, wiping sweat from her brow. "We're breaking into a seven-year-old's birthday party. For toys." "High-value assets. Action figures, limited edition dolls, unopened Lego sets." Molly shook her head, pacing the gravel driveway. "The parents are going to notice the missing gifts. The kid is going to cry." "The kid is seven. He'll forget by Tuesday. We're talking about a smash and grab of pure joy." "Ethical? We're stealing a remote control car." "It's a drift car. It's gold." Molly checked her watch, the ticking echoing loudly. "The party starts in ten minutes. The parents will be inside taking photos." "Perfect. We hit the front door while the mom is trying to pose the cake." "We can't use the front door. That's where the dad is guarding the punch bowl." "The punch bowl? It's Kool-Aid, Mo." "It's fruit punch with a lot of sugar. If he spills it, the police come." "Fine. Back door. Through the garden. We avoid the sprinklers." "The sprinklers are timed. Every fifteen minutes." "We have a laser cutter. We cut the sprinkler head." "We don't have a laser cutter. We have a Swiss Army knife and a bag of gummy bears." "Gummy bears are a distraction tool." "They are candy, Bill." "They are currency. We bribe the kid." "The kid is the victim." "The kid is the accomplice. We give him a gummy bear, he tells us where the presents are." Molly stared at him, horrified. "You want to recruit a seven-year-old into a felony." "It's a community outreach program. Now, are you coming or not? The clown is arriving." "The clown?" "The clown is the lookout. If he sees us, he honks." Molly grabbed her bag, checking the straps. "If we get caught, I'm blaming you." "Standard procedure. Let's go get some toys."
created_at
"2026-02-25T22:32:22.521Z"
2/25/2026, 10:32:22 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
424
moderation_latency
(null)
generation_time
104924
tokens_prompt
77
tokens_completion
9526
native_tokens_prompt
76
native_tokens_completion
10994
native_tokens_completion_images
(null)
native_tokens_reasoning
10451
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0264084
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-20611f3d-0a71-935d-bb3f-fc00e0b451d8"
is_byok
false
latency
424
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058742-Oxejegys7A08KPmGJ4DY"
upstream_id
"chatcmpl-20611f3d-0a71-935d-bb3f-fc00e0b451d8"
total_cost
0.0264084
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
367
0.00%
Dialogue to Total Word Ratio
Ratio: 78.28%, Deviation: 28.28%
neededClean
false
wordsTotal
373
wordsDialogue
292
0.0000%