NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ700E27C01WD0N08BBHFS0V
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 45s
Cost:
$0.016773
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
98.05%
Dialogue to Total Word Ratio
(details)
49.0243%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crumpled napkin across the rusty hood of the getaway van, her finger tracing a jagged line that represented the perimeter of the Miller family's backyard. The ink was smeared, likely from the chili dog she'd eaten twenty minutes ago, which added a concerning pink splotch right over the inflatable slide. Bill crouched beside her, adjusting his goggles which were made from three pairs of swimming gear taped together. He held a walkie-talkie that was actually just a hair dryer with an antenna glued to the handle. The van smelled of ozone and desperation. "Intel's solid," Bill whispered, tapping the hair dryer. "Fifteen kids. One clown. Loot rivals Fort Knox." Molly nodded, her leather tac-vest creaking. It was stuffed with snack bars she'd stolen from a vending machine, which served as both payload and emotional support. She pointed a gloved hand at the pink smear. "That's the juice dispenser. Bypass it or trigger the sticky trap. Parents operate dirty." Bill's eyes went wide behind the swimming goggles. He shifted his weight, the plunger-grapple shifting in his pocket. "Glue?" Molly grabbed his shoulder, her grip iron-tight. "Nectar. If boots stick, we lose speed. Clown notices, gig is up. I won't lose the loot to juice." Bill pulled a grappling hook from his pocket. It was a plunger attached to a jump rope. He mimed a throwing motion over his head. "Countermeasures ready. I practiced on Uncle Dave's shed. Plunge the fence, drop bean smoke. It confuses their olfactory senses." Molly checked the magazine of her staple gun, ensuring the staples were bent at a forty-five degree angle for maximum holding power. She shook her head slowly. The air in the alley grew heavy with the anticipation of sugar and theft. "Bean acidity corrodes seals. Use flour. Distraction at castle. I swoop, secure, exfil through pet door. The dog will howl. Watch for the parents." Bill gulped, his Adam's apple bobbing violently against the collar of his tactical turtleneck. "Pet door? Three protein shakes. I don't fit." Molly slapped the napkin shut and shoved it into her boot. She leaned in, her voice dropping to a dangerous octave. "Get thin or take the clown. And the tutu. It's polka-dotted." Bill adjusted the hair dryer one last time, saluting with two fingers. "Noted. Flour bomb. Doggy exfil. Hit the party. Bring the snack stash." Molly grinned, revealing a gold tooth she'd bought from a pawn shop for a quarter. She kicked the van door open. Rain began to patter against the metal roof, drumming a rhythm of impending chaos. "Stay low. Nephew decoy. Out. And Bill?" "Yeah?" "Don't drop the loot."
created_at
"2026-04-27T08:13:34.158Z"
4/27/2026, 8:13:34 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
679
moderation_latency
(null)
generation_time
105305
tokens_prompt
77
tokens_completion
6607
native_tokens_prompt
76
native_tokens_completion
6976
native_tokens_completion_images
(null)
native_tokens_reasoning
6342
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0167728
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-78aff74c-5e16-93e9-9b0d-9b49a09e650e"
is_byok
false
latency
679
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777277614-6uCpMj1u9xSk8FGoIBlD"
session_id
(null)
api_type
"completions"
id
"gen-1777277614-to5dmnKx6qCUeRb77ATh"
upstream_id
"chatcmpl-78aff74c-5e16-93e9-9b0d-9b49a09e650e"
total_cost
0.0167728
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
440
98.05%
Dialogue to Total Word Ratio
Ratio: 29.37%, Deviation: 0.63%
neededClean
false
wordsTotal
446
wordsDialogue
131
49.0243%