NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK186Z4X2XNHYJAPFAAQ91J
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4m 8s
Cost:
$0.038357
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill checked his watch. "Are we absolutely clear on the primary objective for tonight's operation?" "Steal the loot. Avoid the parents. Leave absolutely no trace behind." Molly adjusted her gloves. "Specifics, Molly. What is the specific protocol for the clowns here?" "Clowns are civilian collateral. You must ignore them completely." "I hate clowns. They know far too much about society." Bill tightened his mask. "Focus. The target is the big pile near the plastic throne." "You mean the yellow chair?" "The throne. That is exactly where the big boxes sit." Molly checked her lockpicks. "What happens if a toddler sees me inside the house?" "Smile. Wave. Give them a quick high-five." "I do not do high-fives on the job." Bill threw balloons at her. "Then offer a balloon. Do you have any balloons?" "I have lockpicks and a grapnel hook." "Useless. Get balloons from the dashboard." Molly sighed. "We are professionals, Bill. Not party entertainers." "Tonight we are both. Put on the red nose." "The foam nose?" "It is perfect camouflage." Bill put his nose on. "This is professional degradation." "It is strategy. Now, discuss the exit route." "We go out through the inflatable slide?" "Exactly. It is fast and fun." Molly stood up. "I will not fit in the tube." "Take off the utility belt." "That holds my entire gear kit." Bill pointed at the door. "You will not need gear inside the slide." "What if I get stuck halfway?" "Then I will push you from behind." "Charming." Bill opened the glove box. "Look, do you want the vintage Nintendo or not?" "Is it actually vintage?" "Sealed in box. Extremely rare." Molly pocketed the map. "Okay. The slide works for me." "Good. The timer starts right now." Bill checked his watch. "Wait. One more thing." "Make it quick." Molly rubbed her stomach. "What about the cake?" "What about it?" "Can we eat some during the op?" "No eating on the job." "Just a small slice." Bill shook his head. "Focus, Molly." "I am very hungry." "Eat a protein bar." "They taste like chalk." "Better than jail food." Molly opened the van door. "Fine. Let's move out." Bill opened the door. "Sounds like chaos." "Sounds like money." Molly put on the nose. "I still think the nose is optional." "Put it on." "You are wearing one too." "Leadership by example." "You look stupid." Bill stepped onto the grass. "Let's go make some bank." Molly clipped her earpiece. "Channel one?" "Channel one." "Radio check." "Loud and clear." Bill walked toward the house. "Remember, no kicking the kids." "I would never." "Even if they bite?" "Especially then." Molly followed him. "Okay. Breaching in three." "Two." "One." They stepped into the light.
created_at
"2026-02-16T10:56:08.692406+00:00"
2/16/2026, 10:56:08 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
866
moderation_latency
(null)
generation_time
248187
tokens_prompt
77
tokens_completion
10074
native_tokens_prompt
76
native_tokens_completion
10642
native_tokens_completion_images
(null)
native_tokens_reasoning
9911
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0383568
router
(null)
provider_responses
0
id
"chatcmpl-54d713b6-5343-972f-9529-39ad473bc8ff"
status
200
is_byok
false
latency
866
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771239119-kwNmhn4yiyzdV1stNixN"
upstream_id
"chatcmpl-54d713b6-5343-972f-9529-39ad473bc8ff"
total_cost
0.0383568
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
451
1.2297%
Dialogue to Total Word Ratio
Ratio: 79.38%, Deviation: 9.38%
neededClean
false
wordsTotal
451
wordsDialogue
358
0.6148%