NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ6YZXMKDRNQ77BAVV1G13NS
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 5s
Cost:
$0.027196
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill, stop licking the blueprints." Molly snatched the paper. "It's chocolate, Molly. I licked the blueprint of the Louvre once and found a safe deposit key." Bill licked his thumb. "It's not chocolate. It's mud mixed with crayon wax. You're licking dirt, Bill." Molly traced a red marker line across the paper. "We have twenty minutes until the clown arrives." "The clown. That's our biggest variable." Bill shifted his weight. "Sack feels heavy. Did you put the extra bricks in there?" Bill looked at her, his swim goggles crooked. "You can't crack a safe guarded by a whoopee cushion dispenser." "We're not stealing the clown, Bill. We're after the loot." Molly's eyes narrowed. "The mother mentioned premium LEGO sets and limited-edition action figures. This is the motherlode." "It's a seven-year-old's party." Bill scoffed as he adjusted his night-vision goggles which were clearly just painted swim goggles. "The security is a dad named Steve with a whistle and juice boxes." "Steve has the whistle, meaning auditory control." Molly tapped her temple. "Once he realizes the wrapping paper is missing, the scream triggers a meltdown cascade. We need to be gone before the sugar crash hits." "What about the birthday girl? Timmy?" "Tyler. And she's the wildcard." Molly pulled a crumpled photograph from her jacket. "I've reviewed the footage. Tyler sleeps through thunderstorms, but she has a motion sensor linked to a shrieking tea kettle." "A tea kettle? Who installs a shrieking tea kettle?" "A paranoid mother. We go in through the doggy door." Bill stared at the low wooden flap near the fence line. "The doggy door is guarded by a beagle named Mr. Nibbles with no teeth but a heart full of rage." "That's why we're bringing the cheese." Molly produced a wedge of sharp cheddar wrapped in a handkerchief. "High-grade cheddar. I'll distract Mr. Nibbles. You grab the presents." "Bag them?" Bill looked offended, smoothing his black turtleneck. "Molly, there are bows. Ribbons. Leaving a ribbon behind is sloppy." "We are thieves. Grab the bricks." Molly shoved the cheese into his hand. "Check the sock drawer." "Socks? I'm retiring to check for socks?" "Kids leave gifts under the couch. Check your gear. Do you have the laser cutter?" Bill patted his belt, producing a butter knife taped to a roll of silver duct tape. "Close enough." Molly sighed, rubbing her temples. "Noise dampeners?" Bill held up two frayed wool socks. "Genius." Molly pointed toward the glowing backyard. "Here's the play. I'll delay Steve. You circle back. Wait for the balloon segment. Squeaking means move." "What if the clown sees me?" "Make a dog. Blend in. Don't steal the nose." "What if a kid asks?" "Tell them Santa traffic. Anything for four seconds." Molly checked her watch. Bill shifted his gaze to the shadows. "This job feels wrong. Stealing toddlers lacks honor." "Honor doesn't pay." She handed him a mask shaped like a pizza slice. "Ready?" "I was born ready." Bill pulled the pizza mask over his head, his voice muffled. "Hit the presents. Hit the snacks."
created_at
"2026-04-27T07:56:03.014Z"
4/27/2026, 7:56:03 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
738
moderation_latency
(null)
generation_time
170475
tokens_prompt
77
tokens_completion
10869
native_tokens_prompt
76
native_tokens_completion
11319
native_tokens_completion_images
(null)
native_tokens_reasoning
10579
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.027196
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-434da754-bf51-90db-9eb8-48344cc853e3"
is_byok
false
latency
738
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777276563-bTGQpDRx6a2xfZuAHXLs"
session_id
(null)
api_type
"completions"
id
"gen-1777276563-oO8cZcnl3xA32z8H8kKM"
upstream_id
"chatcmpl-434da754-bf51-90db-9eb8-48344cc853e3"
total_cost
0.027196
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
0.00%
Dialogue to Total Word Ratio
Ratio: 43.75%, Deviation: 26.25%
neededClean
false
wordsTotal
512
wordsDialogue
224
48.8138%