NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ6Z956CV7XECAZYDBEYBN20
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 20s
Cost:
$0.012748
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.02%
Dialogue to Total Word Ratio
(details)
0.0124%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled profoundly of crushed crackers, industrial adhesive, and the distinct, cloying sweetness of compromised integrity. Molly unrolled the tactical blueprint across a crate of contraband piƱata bats, her finger tracing the perimeter of the Henderson suburban garden with the cold precision of a surgeon mapping a tumor. The document, hand-drawn on the reverse side of a unicorn coloring page, detailed the patrol routes of Mr. Wiggles, a golden retriever whose loyalty could be purchased with any dairy product, and highlighted the critical elevation changes of the bouncy castle inflator. Every variable had been calculated, every escape route stress-tested, except for the inherent unpredictability of Bill, who sat opposite her trying to balance a party popper on his nose while simultaneously knitting a balaclava from silver streamer tape. Bill was a disaster in tactical gear. He wore a strained turtleneck paired with trousers patterned by cartoon dinosaurs that whispered secrets of his juvenile psyche with every shift, and his goggles were currently serving as a storage vessel for three loose jelly babies. He adjusted his suspenders, which were struggling to hold up the weight of a confetti cannon and a pouch of aggressive cheddar, and watched Molly with the wary respect of a man who knew his partner could and would leave him tied to a balloon arch if he sneezed during the extraction phase. The only sound in the cramped interior was the crinkle of streamers and the low hum of a generator powering a modified sippy cup walkie-talkie. "Musical statues cover the approach," Molly hissed, tapping a spot marked with a glitter pen that refused to dry. "Trampoline entry, unicycle ramp, soft play pit. We bypass the parents near the burger station and hit the gift table hidden behind the giant cardboard cake." Bill dropped the popper. It honked mournfully against his shin, vibrating through his bones. "The unicycle? My center of gravity is currently tied to a brick, Molly. If I miss the landing, I'm not stealing LEGOs; I'm becoming a cautionary tale told to toddlers at nap time. Plus, the clown has a knife. He slices cucumber sandwiches with lethal efficiency." "The clown is compromised. He wants the gift cards. Your job is the grab. Sealed games, cash in the card box, the limited-edition tea set. We're redistributing joy from those who have too much to someone with a gambling debt and a ferret prosthesis." Molly grabbed her sippy cup comms and took a sip of battery-acid coffee. "Loadout check. Smoke bombs, jump-rope grapple, duct-taped sack. Stop eating the cheese bribes." Bill fumbled with his belt. "Cheese is essential. Mr. Wiggles guards for cheddar. I packed the glitter bombs too. Just in case." "No glitter. Remember Toledo? We still find sparkles in the transmission. You drop into the potato salad. Splatter creates panic. I grab the loot. We vanish through the laundry chute. Simple." "Potato salad?" Bill paled beneath the dinosaurs. "The cold one with celery? My stomach is a fragile ecosystem. Can we swap for the water balloons?" "Water balloons pop. Salad splatters. Panic creates opportunity. Spit out the jelly babies. We roll in three. Honk early and you're Mr. Wiggles' chew toy." Bill gave a trembling thumbs-up, streamers drifting over his eyes. "No honking. Potato hero. Let's get those presents."
created_at
"2026-04-27T08:00:51.411Z"
4/27/2026, 8:00:51 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
706
moderation_latency
(null)
generation_time
80163
tokens_prompt
77
tokens_completion
5527
native_tokens_prompt
76
native_tokens_completion
5299
native_tokens_completion_images
(null)
native_tokens_reasoning
4556
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012748
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-68ac0e6e-41ff-99ba-a2a1-aa611a4df072"
is_byok
false
latency
696
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777276851-uV2ZwE8ztVgQpFAjIMNj"
session_id
(null)
api_type
"completions"
id
"gen-1777276851-RvMY0GO2RwgveIf868CM"
upstream_id
"chatcmpl-68ac0e6e-41ff-99ba-a2a1-aa611a4df072"
total_cost
0.012748
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
547
0.02%
Dialogue to Total Word Ratio
Ratio: 42.88%, Deviation: 12.88%
neededClean
false
wordsTotal
555
wordsDialogue
238
0.0124%