NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1P7KHYD1YV4Y1HMEC39ED
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7m 45s
Cost:
$0.038087
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Cold rain hammered the van roof. Molly checked her watch nervously. "Tell me again why we are targeting a kindergarten birthday bash instead of a high security bank," Molly said quietly. "High value targets with minimal security protocols involved throughout the building," Bill replied calmly, adjusting his polka-dot tie. "Limited edition action figures. Rare plushies. Unmarked cash envelopes inside cards," Bill added. "Cash envelopes are usually empty because parents spend everything on the expensive venue," Molly countered. "Not these parents because look at those luxury SUVs parked nearby outside," Bill insisted strongly. "We are robbing toddlers which is a new low even for us specifically," Molly said loudly. "It is a redistribution of wealth from spoiled kids to struggling professionals like us," Bill argued. "We are not professionals because we are wanted in three counties for stealing garden gnomes," Molly snapped back. "That was a misunderstanding since the gnomes were completely unprotected by owners," Bill said quietly. "There is a magician inside and magicians know all the tricks possible," Molly warned him. "I know more tricks than him because I can make a present disappear quickly," Bill claimed proudly. "Into your pocket specifically," Molly asked dryly. "Exactly since that is the definition of sleight of hand technically," Bill said simply. "Just do not get caught by the clown because clowns are unpredictable mentally," Molly said sharply. "I am wearing a clown nose so I am one of them officially," Bill reminded her. "You look like a thief wearing a clown nose and it does not blend," Molly observed. "It creates confusion which is known as cognitive dissonance scientifically," Bill said confidently. "It creates laughter from the police specifically when they arrive," Molly said coldly. "Are you ready to move now," Bill asked, opening the door. "Wait because what is the signal exactly," Molly said, grabbing his arm. "When the cake comes out everyone sings and distraction is maximum then," Bill answered quickly. "And what is the escape route planned," Molly questioned, eyeing the slide. "Through the ball pit because bullets will not track in plastic balls ever," Bill said firmly. "We are not bringing guns into a childrens party ever," Molly said sternly, exasperated. "Metaphorically speaking I meant candy guns specifically," Bill clarified quickly. "Just get the bags and leave the cake alone," Molly ordered, stepping into rain. "Why leave the cake because I love sheet cake deeply," Bill complained, following her. "Sugar crash is bad for driving safely," Molly said flatly. "Fair point," Bill conceded, slipping on a wet leaf. "Quiet because the doves are watching us closely," Molly hissed. "Do birds have cameras installed," Bill whispered, looking up. "Never mind just pick the lock now," Molly said, creeping toward the door. "On it and give me thirty seconds exactly," Bill said, kneeling down. "You have ten minutes so make it fast please," Molly said, watching a parent. "Pressure makes diamonds," Bill muttered, working the pins. "Pressure makes arrests so hurry up," Molly replied, checking her watch. "Click," Bill said softly, standing up. "Open," Bill said firmly. "Let us rob some kids now," Molly said sternly, pulling her mask up. "For the team," Bill whispered, pushing the heavy door.
created_at
"2026-02-16T11:07:24.036789+00:00"
2/16/2026, 11:07:24 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
859
moderation_latency
(null)
generation_time
290964
tokens_prompt
77
tokens_completion
9975
native_tokens_prompt
76
native_tokens_completion
10567
native_tokens_completion_images
(null)
native_tokens_reasoning
9843
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0380868
router
(null)
provider_responses
0
id
"chatcmpl-4243677a-7ed2-9133-8b80-7ae1ef01fa7b"
status
200
is_byok
false
latency
859
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771239752-7SuT2lJWaBWdpf2DHAJU"
upstream_id
"chatcmpl-4243677a-7ed2-9133-8b80-7ae1ef01fa7b"
total_cost
0.0380868
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0098%
Matches word count
n/a
neededClean
false
words
531
94.4014%
Dialogue to Total Word Ratio
Ratio: 68.93%, Deviation: 1.07%
neededClean
false
wordsTotal
531
wordsDialogue
366
47.2056%