NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2VV6QG2C1NZ0CQBWB5X5V
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
13.0s
Cost:
$0.000665
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her purple sequined mask, which looked less like professional gear and more like a discarded disco ball fragment. She crouched behind a giant inflatable T-Rex, peering through the plastic scales at the fortress of wrapping paper stacked near the bounce house. Bill shivered violently in his oversized clown suit, the red nose bobbing with every nervous twitch as he clutched a burlap sack decorated with sad faces. The air smelled overwhelmingly of sugar frosting and despair. Molly signaled silently, pointing toward the golden throne where the birthday boy sat, surrounded by a moat of果汁 boxes. Bill nodded, his face pale beneath the white greasepaint, and mimed opening a lock with trembling fingers. They moved forward, their sneakers squeaking loudly on the polished gym floor, sounding exactly like rubber ducks being strangled. A sudden gust from the ventilation system sent a cascade of glitter raining down upon them, making them sparkle menacingly in the dim light. Molly froze, eyes wide, as a single balloon popped somewhere in the distance with the sound of a gunshot. Bill dropped to his knees, pretending to tie a shoe that was already tied, while sweating profusely through the thick polyester fabric. They held their breath, waiting for the inevitable arrival of angry parents or perhaps a very confused mascot.
created_at
"2026-02-16T11:20:25.564974+00:00"
2/16/2026, 11:20:25 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1094
moderation_latency
(null)
generation_time
12930
tokens_prompt
77
tokens_completion
338
native_tokens_prompt
78
native_tokens_completion
264
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0006648
router
(null)
provider_responses
0
id
"chatcmpl-39cb99e8-f6af-9ed7-8c65-bb13874fac40"
status
200
is_byok
false
latency
1094
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771240811-JfTXNwOX3x5FFCIJ0ovn"
upstream_id
"chatcmpl-39cb99e8-f6af-9ed7-8c65-bb13874fac40"
total_cost
0.0006648
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
51.9255%
Matches word count
n/a
neededClean
false
words
216
36.7879%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 10.00%
neededClean
false
wordsTotal
216
wordsDialogue
0
44.3567%