NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73K8PVT6Y1QYSZAZH4SN6Q
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 23s
Cost:
$0.031610
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
58.45%
Dialogue to Total Word Ratio
(details)
29.2424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her crooked clown wig, which kept slipping over her left eye. "Bill, please tell me honestly the grappling hook isn't attached to a piñata." Bill didn't look up from his cracked tablet, which was covered in sticky fingers and fruit snacks. "It's an adaptive grip system, Molls. Plus, the piñata adds camouflage. Kids love piñatas. If we swing down holding one, they'll think we're early party animators." Molly unrolled a napkin sketch that smelled of pepperoni grease. The Henderson backyard was a fortress of inflatable tunnels and a sandpit rigged with sprinklers. "We're stealing a LEGO Star Wars Ultimate Millennium Falcon, Bill. Not winning a trust fund." Bill finally looked up, his grin as wide as a birthday cake. He patted his chest, causing a hidden compartment to rattle suspiciously. "Exactly! The Falcon is the jewel of the Henderson bash. Do you know what seven-year-old Timmy Henderson did for charity last year? He walked three miles on his hands. That kid has focus. His parents spared no expense." She tugged at her tight collar. The spandex itched with the promise of failure. It was an outfit that screamed incompetence, which Bill apparently considered the height of security bypass. "The party starts in ten minutes. If we're seen wearing spandex with cupcake patterns, we're done for." Bill tapped his helmet, which was shaped like a cupcake. He pulled a checklist written in crayon from his pocket and squinted at it. "Wiggle room, Molls. Cupcakes hide the gear. Besides, I practiced my 'Gotcha!' sound effect. It's terrifying." Outside, a mariachi band was tuning their saxophones, and the scent of frosting wafted through the cracks in the van doors. Molly kicked the tires of their van, which was painted pink and plastered with stickers reading 'Tots & Trucks Moving Services.' "You sound like a very damp sponge." "I like to keep them guessing. Now, grab the foam swords. We might need to parry a juice box attack." Molly groaned loudly, pulling a grappling line from a tube labeled 'Juice Puffs.' The line snapped back and hit her in the forehead with a soft thwump. "This is the absolute stupidest plan we've ever executed." Bill adjusted his goggles, smearing jelly across the lenses. "And yet, here we are, aiming for the glitter and the gold. Ready to make some memories? And by memories, I mean an empty party favor bin?" "Just don't trip over the cake, Bill." "I never trip. I stumble strategically. Now, let's bounce into action." Molly grabbed the heavy door, pointing at the house through her distorted vision. "They're opening the gates! Move, or we're serving finger sandwiches for free!" Bill scrambled for the door, knocking over a hollowed-out drum set. The Falcon box was wedged inside, labeled 'Return to Sender.' "Wait for my glitter bomb signal!" "Go! Quickly!"
created_at
"2026-04-27T09:16:16.994Z"
4/27/2026, 9:16:16 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
7067
moderation_latency
(null)
generation_time
143132
tokens_prompt
77
tokens_completion
8749
native_tokens_prompt
76
native_tokens_completion
8768
native_tokens_completion_images
(null)
native_tokens_reasoning
8105
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0316104
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-8385258e-417f-9e8e-b67b-11df0b1324ec"
is_byok
false
latency
7067
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281376-s6hmVMazPuRiKfGY3XuB"
session_id
(null)
api_type
"completions"
id
"gen-1777281376-vHt3xHK4hYeKgzDun8Su"
upstream_id
"chatcmpl-8385258e-417f-9e8e-b67b-11df0b1324ec"
total_cost
0.0316104
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
470
58.45%
Dialogue to Total Word Ratio
Ratio: 53.28%, Deviation: 3.28%
neededClean
false
wordsTotal
473
wordsDialogue
252
29.2424%