NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK24EY1VKMFGA8T5NWP9ZKG
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
5m 37s
Cost:
$0.045895
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Are you sure about this plan?" Bill asked, peeking nervously over the green hedge. "Absolutely," Molly said, adjusting her black gloves. "Look at those wrapping papers. Gold mine." "It's a birthday party, Mol. Specifically for smart toddlers." "Toddlers have expensive taste. Lego is legitimate currency now." "I feel really weird stealing from little Timmy today." "Timmy won't remember anything. He'll be asleep by seven." "What about the clown?" "I'll distract him with balloons." "You hate balloons specifically." "Desperate times, Bill. Just grab the sack." "It says 'Happy Birthday' on the side." "Irony is our best friend. Move your feet!" "Wait, is that a live pony?" "Bonus loot. Don't question the massive haul ever." "I'm questioning the ethics involved." "Ethics don't pay rent. Grab the Xbox console." "That console is for a six-year-old." "Exactly. Prime resale value on eBay." "Okay, but what if the clown sees us..." "He's making balloon animals. He's fully focused currently." "That small dog looks aggressive." "It's wearing a party hat. It's very friendly." "I'm telling you now, this whole thing feels wrong." "Think of the commission, Bill. Think of the big Lego." "Fine. But I'm keeping the Play-Doh." "Deal. Now lift the window quietly please."
created_at
"2026-02-16T11:13:02.977356+00:00"
2/16/2026, 11:13:02 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
828
moderation_latency
(null)
generation_time
336526
tokens_prompt
77
tokens_completion
8305
native_tokens_prompt
76
native_tokens_completion
12736
native_tokens_completion_images
(null)
native_tokens_reasoning
12412
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0458952
router
(null)
provider_responses
0
id
"chatcmpl-9a5ae027-4f35-9e20-a51d-ab732a68cc0f"
status
200
is_byok
false
latency
828
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240045-v0IxtTg2pkpykcHtPZEk"
upstream_id
"chatcmpl-9a5ae027-4f35-9e20-a51d-ab732a68cc0f"
total_cost
0.0458952
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9840%
Matches word count
n/a
neededClean
false
words
202
99.1164%
Dialogue to Total Word Ratio
Ratio: 93.07%, Deviation: 3.07%
neededClean
false
wordsTotal
202
wordsDialogue
188
99.5502%