NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK3BEPFWZ833RQ20EDZWXSH
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
4m 37s
Cost:
$0.039462
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peered through binoculars at the brightly colored house. Bill chewed nervously on a stale crayon. Molly adjusted her glasses. Bill checked his bag. The moon was high. Shadows stretched long. "Are you sure about this plan?" Bill asked. "Think of the loot, Bill. Wrapped boxes everywhere." "It feels wrong stealing from toddlers," Bill whispered. "They don't need LEGOs," Molly said. "What about the parents?" Bill asked. "Asleep by nine," Molly replied. "I used to love parties." Bill looked down. "Focus on the prize," Molly commanded. "Okay. Let's do this." Bill straightened his mask. "Move out." Molly lowered the binoculars. They crept toward the fence. "Wait," Bill said. "What now?" Molly froze. "I left the getaway car running," Bill admitted. "You idiot." Molly groaned. Bill ran back. Molly waited. This was going poorly. She checked her watch. Time was running out. Bill returned. "Fixed it," he panted. "Good," Molly said. "Did you lock the doors?" Molly asked. "Yes, everything is secure," Bill replied. "Then we are rich," Molly smiled. "I hope no clowns show up," Bill worried. "Clowns are afraid of us," Molly assured. "That is true," Bill agreed. "Is the coast clear?" "Clear as crystal." "Let's go inside." "Right behind you." They entered the yard.
created_at
"2026-02-16T11:33:20.504624+00:00"
2/16/2026, 11:33:20 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
807
moderation_latency
(null)
generation_time
277096
tokens_prompt
77
tokens_completion
9318
native_tokens_prompt
76
native_tokens_completion
10949
native_tokens_completion_images
(null)
native_tokens_reasoning
10603
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.039462
router
(null)
provider_responses
0
id
"chatcmpl-d2d3955a-2636-9c26-a68d-676038802bbc"
status
200
is_byok
false
latency
807
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771241323-OoE5lwRFGXQ6trDjT3eL"
upstream_id
"chatcmpl-d2d3955a-2636-9c26-a68d-676038802bbc"
total_cost
0.039462
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
98.7124%
Matches word count
n/a
neededClean
false
words
206
99.9550%
Dialogue to Total Word Ratio
Ratio: 48.54%, Deviation: 1.46%
neededClean
false
wordsTotal
206
wordsDialogue
100
99.3337%