NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBG7XV33WQDJEBJ1WH0YZKH
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
2m 33s
Cost:
$0.043247
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.95%
Dialogue to Total Word Ratio
(details)
99.9742%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her disguise, peering through the living room window where colorful balloons bobbed above a mountain of wrapped paper. Bill checked his watch, grimacing at the second hand ticking toward the quick cake cutting. They wore matching neon tracksuits that smelled of chlorine and pizza. The blueprints taped to Bill’s forehead depicted a ventilation shaft leading to the mantelpiece. Inside, the hostess hummed a nursery rhyme while cutting a three-tiered sponge cake decorated with dinosaurs. Molly wiped sweat from her forehead, ruining the powder makeup she’d applied over her nose to simulate a runny snot. Plan clear. Infiltrate during the piñata strike. Swap the gift bags with identical empty ones. Adrenaline pumped through Bill’s veins like carbonated soda. He felt the heavy sack. It was heavy with possibility and greed. Molly signaled with three fingers. The air conditioning unit rattled above them. A stray cat yowled from the hedge, startled by the flash of flashlights. Bill stepped back. Molly nodded, checking the lockpicking set in her pocket. They slipped through the back door. Bill whispered, 'Are we really ready to do this today?' 'Yes,' she said, 'Just go get the bags quickly now.' 'Please.' 'Okay. Yes.' 'Go! Now.'
created_at
"2026-02-25T22:55:45.512Z"
2/25/2026, 10:55:45 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
421
moderation_latency
(null)
generation_time
152798
tokens_prompt
77
tokens_completion
16597
native_tokens_prompt
76
native_tokens_completion
21614
native_tokens_completion_images
(null)
native_tokens_reasoning
21340
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.043247
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-fad0d63b-e5b1-9c25-914a-1c26770be5d4"
is_byok
false
latency
421
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772060145-ic8AVezBOlnYjNatYRsK"
upstream_id
"chatcmpl-fad0d63b-e5b1-9c25-914a-1c26770be5d4"
total_cost
0.043247
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
199
99.95%
Dialogue to Total Word Ratio
Ratio: 11.50%, Deviation: 1.50%
neededClean
false
wordsTotal
200
wordsDialogue
23
99.9742%