NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS87CH7X8HW4GA4VCZDQ0EWV
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 33s
Cost:
$0.078325
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
71.16%
Dialogue to Total Word Ratio
(details)
35.5791%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The bright morning sun illuminated the messy kitchen. Molly unrolled the crayon drawing across the wooden table. "This is the target. Timmy seventh birthday. Presents will be piled near the bouncy castle." she said. Molly pointed a manicured fingernail at a crudely sketched slide. The blueprint was drawn in washable markers, but the stakes were incredibly high. Bill squinted at the paper. "Why is there a giant purple dinosaur guarding the perimeter?" He leaned closer, adjusting his thick glasses while studying the brightly colored layout of the suburban backyard. "That is Barney. He is an entertainer. He smokes at exactly two fifteen. That is our window." Molly replied. She took a slow sip of her black coffee, maintaining intense eye contact with her partner. "Two fifteen? What if he vapes?" "Then we have three extra minutes. Focus Bill. The real threat is a clown." Bill shuddered. He rubbed his arms, visibly haunted by a traumatic childhood memory involving a birthday magician and a very aggressive dove. "I hate clowns. Can we just shoot the clown?" "No shooting the clown. We are professionals. We use the distraction." "What distraction?" "The paper donkey. When it breaks, the children will swarm the candy. Total chaos. We slip through the patio doors, grab the boxes, and extract via the ball pit." Molly said, tapping a drawn animal. Her finger landed squarely on a bright yellow circle representing the ultimate distraction zone. "Extract via the ball pit? Molly, I am wearing a tuxedo." "You are wearing a tuxedo to a backyard party in July?" "I thought it was a gala." "It is a barbecue Bill. Take off the cummerbund and put on the foam hands. You are the mascot." "I am not wearing the foam hands. They smell like old cheese." "They smell like victory. Now, review the escape route." Bill sighed loudly. He slumped back in his wooden chair, finally accepting his ridiculous fate for the afternoon. "We exit through the side gate, avoiding the water slide, and get into the getaway vehicle." "Which is?" "The ice cream truck." "Exactly. You drive, I play the jingle. If a parent tries to stop us, I throw a frozen treat." "What if the presents are just clothes?" "Timmy is seven. There will be plastic bricks. Plastic bricks are untraceable. We fence them to teenagers down the street." "And if there is a puppy?" "We leave the puppy Bill. We are thieves, not monsters." "Right. Good boundary." "Memorize the juice boxes. If a kid asks for apple, give them apple. Do not mix up the fruit punch. That causes tantrums." "I know the juice protocol Molly." "Good. Put on the foam hands. We have a bouncy castle to infiltrate." Molly slowly grabbed her heavy black tactical sunglasses.
created_at
"2026-05-22T16:13:08.748Z"
5/22/2026, 4:13:08 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
924
moderation_latency
(null)
generation_time
152940
tokens_prompt
77
tokens_completion
9069
native_tokens_prompt
76
native_tokens_completion
10418
native_tokens_completion_images
(null)
native_tokens_reasoning
9800
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.078325
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-5de366c3-2301-950c-a917-4b7ce12cc59f"
is_byok
false
latency
924
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779466388-BMV8yn8gVnF8SM1khQ8t"
session_id
(null)
api_type
"completions"
id
"gen-1779466388-K6S8aQPhHgkskVa1YU1j"
upstream_id
"chatcmpl-5de366c3-2301-950c-a917-4b7ce12cc59f"
total_cost
0.078325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
460
71.16%
Dialogue to Total Word Ratio
Ratio: 67.39%, Deviation: 2.61%
neededClean
false
wordsTotal
460
wordsDialogue
310
35.5791%