NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2TECM8VZFR9PYM32N92HS
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
5m 22s
Cost:
$0.044974
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night-vision goggles, staring at the colorful bouncy castle. Bill chewed his fingernails nervously beside her in the van. "Remember, no witnesses," Molly whispered sharply while adjusting her glove. "What about the clown?" Bill asked, his voice trembling slightly. "Clowns do not count. They are merely illusions." "He looked solid to me." Molly sighed loudly, checking the detailed blueprint of the venue again. "Grab LEGO, cash, and teddy bear. Leave crayons." "Why no crayons at all?" "Resale value is terrible. They break easily inside sacks." Bill shifted uncomfortably in his sticky vinyl seat. "Is it wrong morally? They are tiny humans." "They are insured. Focus on the loot pile near the cake." "What if a toddler wakes up?" "Distract them with a shiny object. Or throw Bill into the ball pit." "That is not funny really." "It is purely tactical." Molly opened the heavy van door slowly. The sound of distant happy birthday echoes drifted out into the dark night air. "Move out now. And Bill?" "Yeah?" "If squeaky shoes, run fast." Bill nodded silently, gripping his empty burlap sack tightly. They crept toward the inflatable fortress, ready to commit the perfect crime against kindergarten security systems tonight.
created_at
"2026-02-16T11:24:47.71125+00:00"
2/16/2026, 11:24:47 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
871
moderation_latency
(null)
generation_time
321454
tokens_prompt
77
tokens_completion
9230
native_tokens_prompt
76
native_tokens_completion
12480
native_tokens_completion_images
(null)
native_tokens_reasoning
12185
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0449736
router
(null)
provider_responses
0
id
"chatcmpl-61c0f248-602f-9354-885e-38a66d2ed4d1"
status
200
is_byok
false
latency
871
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240765-rjtBzDQ1hKgHIB94nPNo"
upstream_id
"chatcmpl-61c0f248-602f-9354-885e-38a66d2ed4d1"
total_cost
0.0449736
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
201
100.0000%
Dialogue to Total Word Ratio
Ratio: 49.75%, Deviation: 0.25%
neededClean
false
wordsTotal
201
wordsDialogue
100
99.9995%