NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2X7YB2R4PTBD4QVFV9DYW
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
13.0s
Cost:
$0.000749
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the tin roof of their getaway van as Molly adjusted her fake mustache. "Why a birthday party, Bill? Really?" she whispered, eyeing the colorful bunting across the street. Bill shrugged, checking his lock-picking set with serious devotion. "It is the perfect crime, Molls. Zero security. Just clowns and exhausted parents drinking cheap punch." Molly sighed, tapping her fingers on the steering wheel. "We are stealing from toddlers. That is a new low even for us." Bill grinned, his eyes gleaming with misguided ambition. "Think of the loot! LEGO sets, video games, those huge plushies nobody ever throws away." He leaned closer, voice dropping to a conspiratorial whisper. "Plus, the distraction is built right in. Sugar crashes happen like clockwork at four PM." Molly shook her head but started the engine anyway. "If we get caught by a six-year-old, I am leaving you behind." Bill laughed, opening the van door into the downpour. "Just remember, no crying children. We are ghosts, not monsters." They crept toward the bouncing castle, silhouettes against the festive lights. "Wait," Molly hissed, freezing mid-step. "Is that a magician?" Bill paused, watching a man in a sequined hat pull a rabbit from thin air. "Perfect," he muttered. "Now he will be busy sawing ladies in half while we clean out the gift table."
created_at
"2026-02-16T11:21:10.782102+00:00"
2/16/2026, 11:21:10 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
778
moderation_latency
(null)
generation_time
12911
tokens_prompt
77
tokens_completion
337
native_tokens_prompt
78
native_tokens_completion
299
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007488
router
(null)
provider_responses
0
id
"chatcmpl-4a483e20-31e5-934a-8acb-f7ac63e128f6"
status
200
is_byok
false
latency
778
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771240857-ehXau83wX94YT8xgEWGQ"
upstream_id
"chatcmpl-4a483e20-31e5-934a-8acb-f7ac63e128f6"
total_cost
0.0007488
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
3.6234%
Matches word count
n/a
neededClean
false
words
224
99.9996%
Dialogue to Total Word Ratio
Ratio: 49.55%, Deviation: 0.45%
neededClean
false
wordsTotal
224
wordsDialogue
111
51.8115%