NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBKJ4Q8B5CW38GDXY15CGBJ
Run Details
Model:
Qwen 3.5 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
45.6s
Cost:
$0.003387
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
84.30%
Dialogue to Total Word Ratio
(details)
42.1513%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain slicked the pavement outside the Henderson residence, reflecting the neon glow of a Happy Birthday banner taped crookedly to the siding. It hung limply from the gutter, fluttering in a damp wind that smelled of wet asphalt and impending doom. Molly crouched behind a rhododendron bush, adjusting the lenses on her thermal goggles while Bill fidgeted with a trowel in his muddy hands. The house was loud. Music blared from inside, specifically a techno remix of Baby Shark that vibrated through the soil. It was the perfect distraction for the heist. Molly checked her watch, a specialized device that tracked the dog s sleeping patterns via heart rate monitor. She knew the routine. They needed to get in before the frosting ran out or the parents noticed something missing from the security grid. Bill stepped on a twig, the sound cracking like a gunshot in the humid air. He froze, looking as if he expected the Golden Retriever to emerge instantly and bite him at the ankle. Molly shot him a glare that promised extreme violence if he drew attention to their location. Bill lowered his head and whispered through clenched teeth that the dog was just barking at a squirrel. Molly turned her gaze back to the back window. The latch looked simple enough to bypass with a tension wrench she kept taped inside her boot. The strategy was clean. In and out before the candles were lit. There was simply too much risk in the cake cutting phase where children got wilder after the lights went out and sugar hits their brains. "The truck is idling," she said softly, pointing toward the street. "At the corner." "I can hear the engine," Bill replied, voice trembling slightly. "It is loud." "Keep your voice down," she warned. "We are ghosts tonight, not children." Bill nodded, clutching his duffel bag. The black velvet lining was meant to protect the stolen goods from scratches, not from the elements. Molly checked her grapple gun. The weight felt familiar. It was the only thing keeping her steady. They were going to get the big gift under the tree. The one wrapped in gold foil. It was the hardest prize. "Are you absolutely sure about the latch mechanism?" Bill asked. "I am positive," she answered. "It will open." "Uncle Bob might be patrolling outside," Bill warned. "Let him eat frosting if he gets too close," she muttered. "I am not afraid of a man in a suit anyway." "A suit is suspicious in this neighborhood," Bill whispered. "He is suspicious," she said. "We move only when the song changes." "When does the song change exactly?" "I do not know honestly," she admitted. "Okay, I will be ready," Bill said. "Signal me." "Wait until I signal," she replied. "We do not wait any longer." "Why do we not wait?" "Because the entire plan relies on momentum." "Momentum always sounds dangerous to people." "Danger pays," Molly said. "Just trust me for once." Bill hesitated, then smiled nervously. "Sure thing. Operation Toy Store." "Exactly," she confirmed. "Do not look back for any reason." "I got it," he said. "I will follow your lead exactly." "Are you ready?" "I am ready," Bill lied. She pushed him through first. The window clicked open. The air inside smelled of vanilla wax and chaos. Molly took a deep breath and followed her partner into the party. They had ten minutes before the cake was cut. It was enough time.
created_at
"2026-02-25T23:53:45.966Z"
2/25/2026, 11:53:45 PM
model
"qwen/qwen3.5-flash-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
503
moderation_latency
(null)
generation_time
45460
tokens_prompt
77
tokens_completion
9001
native_tokens_prompt
76
native_tokens_completion
8449
native_tokens_completion_images
(null)
native_tokens_reasoning
7688
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0033872
router
(null)
provider_responses
0
endpoint_id
"5f6b71b5-8672-4b27-8b2d-f674429f9e63"
id
"chatcmpl-1f9252f1-3417-9303-958d-409ed3141007"
is_byok
false
latency
503
model_permaslug
"qwen/qwen3.5-flash-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063625-wq0CdrCunLgYNGKFncV0"
upstream_id
"chatcmpl-1f9252f1-3417-9303-958d-409ed3141007"
total_cost
0.0033872
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
579
84.30%
Dialogue to Total Word Ratio
Ratio: 28.15%, Deviation: 1.85%
neededClean
false
wordsTotal
579
wordsDialogue
163
42.1513%