NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1ZHXSDCKZWQPP4SGCWBEY
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
12.7s
Cost:
$0.000792
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tiny plastic tiara, glaring at the colorful streamers. "This is the worst target we have ever picked, Bill." Bill shrugged, stuffing a handful of balloon animals into his oversized clown pocket. "Come on, Molls. Think of the loot. Those shiny tablets, the Lego sets, the cash envelopes taped to cards. It is a goldmine." "It is a room full of seven-year-olds with sticky fingers and zero fear," she hissed, peeking through the window. "One screams and the whole gymnasium descends on us." "Details, details," Bill waved a gloved hand dismissively. "We go in during the magic show. Everyone watches the rabbit; nobody watches the clowns stealing the mountain of wrapped boxes." "The magician is a guy named Sparkles who looks like he could bench press a minivan," Molly argued, tightening her grip on her crowbar. "And I am not fighting a toddler for a toy train." "You will not have to fight anyone," Bill promised, checking his watch. "We grab the big pile by the tree, slip out the back, and vanish before cake time. Easy money." "If I get glitter in my hair, I am quitting the crew," she warned, stepping toward the side door. "Deal," Bill grinned, pulling a fake nose from his pocket. "Now, let us go make some bad children very sad." Molly sighed deeply, pushing open the creaky exit. "Lead the way, you idiot."
created_at
"2026-02-16T11:04:57.572409+00:00"
2/16/2026, 11:04:57 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
773
moderation_latency
(null)
generation_time
12621
tokens_prompt
77
tokens_completion
348
native_tokens_prompt
78
native_tokens_completion
317
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000792
router
(null)
provider_responses
0
id
"chatcmpl-fa36c094-d3a4-9650-8d49-40566c4fbd02"
status
200
is_byok
false
latency
773
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239884-tQPbY9q3coA7AEvOMnus"
upstream_id
"chatcmpl-fa36c094-d3a4-9650-8d49-40566c4fbd02"
total_cost
0.000792
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0002%
Matches word count
n/a
neededClean
false
words
234
0.0004%
Dialogue to Total Word Ratio
Ratio: 68.80%, Deviation: 18.80%
neededClean
false
wordsTotal
234
wordsDialogue
161
0.0003%