NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBFGEGSBKRP00XBMHVJ0XNE
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 47s
Cost:
$0.041593
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.74%
Dialogue to Total Word Ratio
(details)
99.8689%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly checked her reflection in the very dark window. Bill nervously straightened his wig. "The clown will distract them," Molly said, pointing at the backyard. "He looked angry," Bill whispered. "He had a knife." "A prop knife, Bill. The clown is part of the diversion." Molly pulled out a bag of cookies. "We need the dogs sedated." "Is that legal?" Bill asked, sweating through his tuxedo vest. "It's the principle, Bill. Now move." They crouched low, creeping toward the porch light. Balloons bobbed violently in night breeze, like tiny colorful ghosts guarding the loot. Bill checked his watch. Ten minutes. If the cake doesn't work, the dog wakes up, the alarm triggers, and we go to jail for stealing a teddy bear. Molly smirked, pulling out a pair of lockpicks made from straws. "We don't go to jail. We become the gift givers of the underworld." "Oh," Bill said. "What about laser grid?" "There is no laser grid, Bill. Just party." "A party?" "A birthday. It's the cover." "I hate birthdays." "Just do it." "Are you sure?" "I am positive." "Okay." "Wait here." "Okay." "Let's go." They entered quietly now. "It was easy." Door opened. "The lights stayed off."
created_at
"2026-02-25T22:42:56.162Z"
2/25/2026, 10:42:56 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
397
moderation_latency
(null)
generation_time
107267
tokens_prompt
77
tokens_completion
16339
native_tokens_prompt
76
native_tokens_completion
20787
native_tokens_completion_images
(null)
native_tokens_reasoning
20468
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.041593
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-ac2ec73a-e3b0-9cee-963e-709965292765"
is_byok
false
latency
397
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059376-27LdCmKO2AqMEN0UgBtW"
upstream_id
"chatcmpl-ac2ec73a-e3b0-9cee-963e-709965292765"
total_cost
0.041593
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
199
99.74%
Dialogue to Total Word Ratio
Ratio: 47.74%, Deviation: 2.26%
neededClean
false
wordsTotal
199
wordsDialogue
95
99.8689%