NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72RCQ9A9B9R8KV9TT9JWFP
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 26s
Cost:
$0.036658
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
92.75%
Dialogue to Total Word Ratio
(details)
46.3760%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the napkin-blueprint over the greasy table, her nails clicking a rhythm of greed against the waxed paper. The single bulb flickered, casting long shadows over the scattered tools of their impending misfortune. "The target is Timmy Turner's bash. Security includes Drool the Retriever and a mom wielding a spatula like a short-sword." Bill winced as the mustache glue pulled a stray eyebrow hair, revealing a scar from a misguided encounter with a hedge trimmer. "Drool is a variable I can mitigate with squeaky toys. As for the spatula matriarch, I suggest the glitter bomb. Glitter distracts the suburban psyche." Molly smashed her palm onto the grease-stained trampoline drawing, nearly upending a cup of cold coffee. "Glitter is a nightmare. We're hitting the gift table, not the walls. We're talking LEGO sets, electric toothbrushes, and the holy grail: the Nintendo Switch. We go in during the piƱata distraction. Bat cracks the donkey, chaos ensues, we grab the loot and vanish out the dog flap." Bill rubbed his aching back, the vertebrae popping like bubble wrap. "The dog flap? Molly, my waist is two sizes larger than the dog's. I tried that in Jersey and nearly left a testicle. We pose as the entertainment. I heard they hired a fire breather." Molly tossed a velvet bag onto the table with a heavy clink. "You can't breathe fire. You breathed ketchup last week and choked on a crouton. We stick to the Relatives from Out of Town ruse. You wear the wig, I carry the sacks labeled Party Favor Return Policy. Timmy opens the gifts, we swap them for hollowed-out cardboard boxes. Classic switcheroo." "And what happens when Timmy opens the box for the PlayStation and finds a heartfelt note from Santa explaining tax evasion?" Bill chewed slowly, savoring the hypothetical disappointment of a confused child. "He won't. He'll find a coupon for twenty percent off at Toys R Us. Kids love coupons. It's what they dream of." Molly kicked the table leg, earning a stare from the window raccoon. "This is the dumbest job we've ever done. Last month was a jewelry store. This? We're robbing a toddler's dopamine factory. Why are we like this?" Molly's eyes lit up with greedy calculation. "It's wealth redistribution. Nobody gets this many gifts and keeps them all. Besides, the payout on a mint-condition Switch to a shady collector is triple retail. Plus, there's cake." "You're bribing me with cake?" She slid a photo of a sequined unicorn costume across the table. "I'm bribing you with cake and the promise that if we succeed, you don't wear the unicorn costume from the clearance bin." Bill dropped his burrito. "You wouldn't." Molly uncapped the glitter glue, oozing it toward his knee. "Try me." "Fine," Bill sighed, snatching the mustache. "But if Drool marks the sneakers, I'm leaving you to the spatula lady." "Deal. Smile. We arrive in ten, and I need you to look less like a man who eats paste and more like Uncle Barry from Ohio." Bill wiped sauce from his chin with a trembling hand. "Ohio?" "It adds credibility. Eat your burrito. We have a party to crash and a lifetime of joy to steal."
created_at
"2026-04-27T09:01:36.375Z"
4/27/2026, 9:01:36 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1293
moderation_latency
(null)
generation_time
145744
tokens_prompt
77
tokens_completion
9520
native_tokens_prompt
76
native_tokens_completion
10170
native_tokens_completion_images
(null)
native_tokens_reasoning
9418
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0366576
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-817304e0-53c8-9ffb-a5ce-ab942461c223"
is_byok
false
latency
1293
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280496-TfWigVoqYA0utLGcUXPM"
session_id
(null)
api_type
"completions"
id
"gen-1777280496-HsJ5ck83HjWcM1SxMHKp"
upstream_id
"chatcmpl-817304e0-53c8-9ffb-a5ce-ab942461c223"
total_cost
0.0366576
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
533
92.75%
Dialogue to Total Word Ratio
Ratio: 68.77%, Deviation: 1.23%
neededClean
false
wordsTotal
538
wordsDialogue
370
46.3760%