NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJX1AWSZN6QAVE764DH37B
Run Details
Model:
Qwen 3.5 Flash
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
1m 23s
Cost:
$0.006230
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
77.39%
Dialogue to Total Word Ratio
(details)
86.6878%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Okay," Molly whispered, "the balloons are definitely red today." "Red definitely means danger," Bill asked. "Red means party." "You got stuck in a piƱata last week." "That was tactical entry into the very house." "You were eating candy out of a clear plastic bag." "We go in through the front door silently." "Is the dog very asleep right now?" "The dog is a pitbull named Sparky." "Sparky loves squeaky toys terribly much." "We bring one specifically for him." "We're stealing presents from children tonight." "We're saving them from wasted plastic." "That is so weak for a thief." "Look, the cake is at ten o'clock now." "Toys scattered everywhere like shiny fish in water." "Can we fit a sack that big?" "We bring the industrial vacuum cleaner for trash." "The room is full of people shouting loudly." "Kids love surprises, but we are the greatest surprise." "Mom is watching the door." "Mom's tipsy on wine and punch." "Mom's drunk on punch entirely right now." "Move when the singing starts immediately." "Happy Birthday to who? Exactly now." "Nobody cares about the birthday boy right now." "Bill, this is literally kidnapping the joy." "It's total redistribution of wealth, Molly." "You're a total criminal clown, absolutely." "A really wealthy clown," Bill grinned.
created_at
"2026-02-25T23:42:14.37Z"
2/25/2026, 11:42:14 PM
model
"qwen/qwen3.5-flash-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
401
moderation_latency
(null)
generation_time
82649
tokens_prompt
77
tokens_completion
11123
native_tokens_prompt
76
native_tokens_completion
15557
native_tokens_completion_images
(null)
native_tokens_reasoning
15258
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0062304
router
(null)
provider_responses
0
endpoint_id
"5f6b71b5-8672-4b27-8b2d-f674429f9e63"
id
"chatcmpl-77b134bc-aa77-9277-96f2-fbc315b3f8a4"
is_byok
false
latency
401
model_permaslug
"qwen/qwen3.5-flash-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772062934-gItcSLoh05xqw0UWuu5r"
upstream_id
"chatcmpl-77b134bc-aa77-9277-96f2-fbc315b3f8a4"
total_cost
0.0062304
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
208
77.39%
Dialogue to Total Word Ratio
Ratio: 97.12%, Deviation: 7.12%
neededClean
false
wordsTotal
208
wordsDialogue
202
86.6878%