NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJT9D8QGBS93M811Q44WD4
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 16s
Cost:
$0.028371
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
93.62%
Dialogue to Total Word Ratio
(details)
49.8551%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly checked the tension on the rope one more time, ensuring it would support the weight of three hundred dollars worth of battery operated bears while avoiding the cat watching from the windowsill. Bill stood by the window, watching the party planner blow up another balloon with very loud aggression. They had spent weeks rehearsing the stealth approach, though stealth was largely theoretical given their tendency to trip over shoelaces and their own incompetence. The house was a fortress of plastic wrapping and aggressive wrapping at the house, guarding the hoarded treasures of childhood. The plan required absolute precision during the opening of gifts, a time when the owners were most vulnerable to distraction and they had no plan B. A dropped cupcake would serve as a perfect diversion while the thieves made off with the valuable inventory. There was no moral ambiguity about it, only the sheer logistics of transporting a thousand pounds of toys without waking the sleeping dogs or alerting the neighbors who were actually very friendly and owned angry guard dog. Molly adjusted her gloves carefully, signaling they were close to the deadline for the window of opportunity before the cake was cut or candles lit. "Are you ready to go in?" Bill whispered nervously into the night air. "Just the presents, nothing else." Molly replied sharply. They dropped.
created_at
"2026-02-25T23:40:44.334Z"
2/25/2026, 11:40:44 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
403
moderation_latency
(null)
generation_time
75441
tokens_prompt
77
tokens_completion
12019
native_tokens_prompt
76
native_tokens_completion
14176
native_tokens_completion_images
(null)
native_tokens_reasoning
13914
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.028371
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-f45a9567-1837-97a0-8eb0-dadc494d03ce"
is_byok
false
latency
403
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772062844-w3lJljSoX4sqChrrr3L8"
upstream_id
"chatcmpl-f45a9567-1837-97a0-8eb0-dadc494d03ce"
total_cost
0.028371
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
223
93.62%
Dialogue to Total Word Ratio
Ratio: 4.93%, Deviation: 5.07%
neededClean
false
wordsTotal
223
wordsDialogue
11
49.8551%