NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBG3CKW5YQ1ZVPBE3ZNTWC8
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
57.7s
Cost:
$0.028270
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.96%
Dialogue to Total Word Ratio
(details)
0.4823%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped the blueprints spread across the table. The room was dark. "The security system is a joke. It's a motion sensor light." "A light?" Bill chewed a pen cap. "That's it? High-end operation." "It turns on when you move. We move slow." "Slow? I'm a sprinter. Built for speed." "You're a liability. The kids are the threat." "Kids? They're five. Barely tie shoes." "Sticky fingers. Zero moral compass. Plus, parents." "Parents. Armed?" "Wine. Lots of it. Passed out by the piƱata." Bill nodded. "Target?" "Everything. Legos, games, plushies." "Plushies? They take space." "Don't ask. We bag them. No questions." "Cake?" "Leave it. Messy evidence. Frosting stains." "I love cake. Cake is life." "You love prison. Focus." "Extraction point?" "Back alley. Garbage bins. Hide loot." "Smells terrible. I hate garbage." "Tax write-off. Distract the DJ." "I can't distract a DJ." "Cut the music." "I don't know how." "Pull the plug. Simple." "Okay. You?" "Tree. Under the staircase." "Loaded?" "Heavily. PS5 in red foil. Maybe two." "Sweet. I want the PS5." "No. You get Legos. I need electronics." "Why? Legos are gold." "Resale. You sell Legos for pennies." "I sell to kids." "We sell to pawn shops. Cleaner." Bill frowned. He looked nervous. "Feels wrong." "It's a heist. Not charity. Professionals." "Kids cry." "Targets. In or out? Schedule." Molly stared hard, her eyes narrowing in the dim light. Her voice was low. Bill looked at blueprints. "In. Slice of cake. Big slice." "Crumb. Deal." "Deal. Work with a crumb." "Bags. Move at four." "Four? Party peaks. Loud." "Chaos is friend. Noise masks noise." "Chaos is my middle name." "Middle name is Gary." "Let's go." They stood up slowly, the chairs scraping against the floor. Molly grabbed coat. Bill grabbed bags. "Dog?" "What dog?" "Neighbor's dog. Barks." "Chihuahua. Barks at everything." "Even us?" "Especially us. Ignore it." "Ignore dog. Ignore parents." "Ignore guilt." "Don't say that word." "Sorry. Ignore noise. Silence golden." "Silence expensive." "Let's go." They walked out. The night air was cold. "Lock picks?" "In pocket." "Don't drop them." "Won't." "Bill?" "Yeah?" "Caught. You take fall." "Why me?" "You look guilty. Always guilty." "I look innocent! Saint." "You look like you stole something." "I stole your pen." "Exactly. Move." They moved into the shadows of the alleyway, blending with the darkness.
created_at
"2026-02-25T22:53:16.801Z"
2/25/2026, 10:53:16 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
420
moderation_latency
(null)
generation_time
57618
tokens_prompt
77
tokens_completion
8294
native_tokens_prompt
76
native_tokens_completion
8825
native_tokens_completion_images
(null)
native_tokens_reasoning
8103
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0282704
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-345b8217-8907-9448-9848-ca308281818e"
is_byok
false
latency
417
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059996-2YOiHsfavk1piTRxorhZ"
upstream_id
"chatcmpl-345b8217-8907-9448-9848-ca308281818e"
total_cost
0.0282704
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
381
0.96%
Dialogue to Total Word Ratio
Ratio: 79.63%, Deviation: 9.63%
neededClean
false
wordsTotal
383
wordsDialogue
305
0.4823%