NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBE7PQ6WVVBY4QM4P7NYTZM
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
3m 13s
Cost:
$0.032839
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain slicked the pavement outside the mansion where the boy turned seven. Molly adjusted her clown nose, checking the reflection in a puddle. Beside him, Bill polished a tranquilizer dart. Inside, laughter bubbled through the window like soda pop. They needed a distraction. The plan involved a smoke bomb and nap time. Molly tapped her watch. The clock ticked toward eight. Bill nodded, eyes gleaming in the dark. They moved closer to the hedge, hearts thumping against ribs. This was professional work. Not a game. Not a hobby. A heist. Bill pointed to the gift table under the tree. The loot was stacked like bricks. Too easy. Too bright. They needed to be ghosts. Molly pulled a grappling hook from her coat, wrapping the chain around her wrist. Bill checked the timer. Three minutes until the cake cutting. The perfect moment to vanish. They weren't criminals. Just opportunists with style. Molly leaned in close. She whispered, Are you ready? Bill wiped sweat from his brow. He nodded, eyes locking on the window. He said, Go now. Molly threw the smoke bomb inside. The window glowed orange. They waited for the coughing to start. Then they struck. It had to be silent. No alarms. Just the sound of wrapping paper tearing. Bill whispered, Don't drop it. Molly smiled. Her teeth gleamed like pearls. She grabbed the first box. It was heavy. A game console. Perfect. They needed to vanish fast. The guests would leave soon. Bill checked the timer again. He said, Go now. Molly nodded. She moved toward the fence. She jumped over the wood. Bill followed her lead. They landed softly in the grass. The night air was cold. Their hearts raced. They had the bags. Bill said, Nice. Molly laughed silently. They drove off into the dark.
created_at
"2026-02-25T22:20:41.066Z"
2/25/2026, 10:20:41 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1365
moderation_latency
(null)
generation_time
193148
tokens_prompt
77
tokens_completion
13174
native_tokens_prompt
76
native_tokens_completion
16410
native_tokens_completion_images
(null)
native_tokens_reasoning
16020
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.032839
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-2c0c6405-29f7-9691-835a-865c2b38899f"
is_byok
false
latency
1362
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058041-IV7pJbA5oGaRVWJAHGaP"
upstream_id
"chatcmpl-2c0c6405-29f7-9691-835a-865c2b38899f"
total_cost
0.032839
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
299
36.79%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 10.00%
neededClean
false
wordsTotal
299
wordsDialogue
0
18.3940%