NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2NK2GAEGTZREDJ5E6R235
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
4m 34s
Cost:
$0.041240
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly whispered, Bill, look at that gold wrap near the clown. Bill sighed, It's Lego, Molly. Heavy, sharp Lego. We sell it bulk, she said. Who cares about bricks? There are twenty clowns, Bill noted. I hate clowns. They're distracted by cake. Focus on the pile. What's the exit strategy? Run fast toward the minivans, Molly replied. What about the screaming parents? They're drinking punch. Spiked punch. Bill frowned. Stealing from toddlers? Really? It's victimless. They forget by Tuesday. I feel dirty right now. You feel poor. Look at that remote control car. It's for a four-year-old. Exactly. Premium tech inside small boxes. Okay, but if a balloon pops, I'm leaving. Deal. Now move. Wait, is that a piƱata? Distraction. Smash it, grab the candy cash. This is low, Mol. It's high reward. Go. Bill stood up. Fine. But I want the Lego. Get the lot, she said. What if they cry? Wear earplugs. Bill nodded. Earplugs. Right. Move out. Did you bring the sack? It's in my coat. Good. Don't crush the cards. Why crush the cards? Resale value, Bill. Respect the hustle. Where is the getaway van? Idling around the corner. Fine. Let's rob these kids. Finally. You're learning.
created_at
"2026-02-16T11:21:21.001301+00:00"
2/16/2026, 11:21:21 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
860
moderation_latency
(null)
generation_time
273721
tokens_prompt
77
tokens_completion
7368
native_tokens_prompt
76
native_tokens_completion
11443
native_tokens_completion_images
(null)
native_tokens_reasoning
11098
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0412404
router
(null)
provider_responses
0
id
"chatcmpl-c78a8647-bff7-906b-9348-9b19e9c138e0"
status
200
is_byok
false
latency
860
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240606-QDD9e7oiRvmvsgOCU8Te"
upstream_id
"chatcmpl-c78a8647-bff7-906b-9348-9b19e9c138e0"
total_cost
0.0412404
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9190%
Matches word count
n/a
neededClean
false
words
203
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 90.00%
neededClean
false
wordsTotal
203
wordsDialogue
0
49.9595%