NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ6Z88N8812E69YX5AC50BWS
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
1m 10s
Cost:
$0.011054
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: We need a plan. The target is Timmy's seventh birthday at the community center. Loot includes three hundred dollars worth of Legos, a Switch cartridge, and that cursed limited-edition robot dog that screams when it detects movement. Bill: A children's party? Really? Their security consists of a cardboard sword and a naptime enforcer named Susan. Molly: Don't mock Susan, Bill. She wields a juice box with lethal accuracy and knows our MO. The real threat is the perimeter defense. Bill: The bouncy castle? Molly: Bubbles the Bouncer. He patrols on a unicycle and carries a compressed air seltzer bottle like a sidearm. One hit to the eyes and you're blind until dessert. We need disguises. Bill: I packed the glitter cannons and a spare fake mustache. Molly: Excellent. You infiltrate as the hired entertainment, distract the horde with aggressive balloon animals while I breach the gift table. We bag the valuables, exit through the bouncy castle chute, and fence the haul before the piƱata swing. Bill: What about the cake? Molly: No cake, Bill. We're professionals. Bill: Triple fudge with rainbow sprinkles and ganache. Molly: Plan B. You secure the presents; I'm neutralizing the confectionery. If we get caught, we claim we're method actors filming a documentary. Bill: I look great in face paint. Just make sure the robot dog screams at the clown. Molly: The dog only reacts to red. Wear blue. Bill: I have blue overalls, but they pinch. Molly: Suffer the pinch, Bill. Lives are on the line. Our careers. Bill: Got it. Blue trousers, glitter, and a heart full of theft. Molly: And a bag big enough for the ganache. We roll in ten.
created_at
"2026-04-27T08:00:22.192Z"
4/27/2026, 8:00:22 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
680
moderation_latency
(null)
generation_time
69551
tokens_prompt
77
tokens_completion
4856
native_tokens_prompt
76
native_tokens_completion
4593
native_tokens_completion_images
(null)
native_tokens_reasoning
4203
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0110536
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-c9880349-10e8-9b04-a062-853cafaa2368"
is_byok
false
latency
678
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777276822-GHYp5QsroF9rzM0iTpBX"
session_id
(null)
api_type
"completions"
id
"gen-1777276822-yFdHZvmv2vX1ll3M4WKn"
upstream_id
"chatcmpl-c9880349-10e8-9b04-a062-853cafaa2368"
total_cost
0.0110536
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
279
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 90.00%
neededClean
false
wordsTotal
280
wordsDialogue
0
0.0000%