NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ727MMJH0Q8JCTS58HDERZ6
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 18s
Cost:
$0.030401
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
83.66%
Dialogue to Total Word Ratio
(details)
91.7919%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted night-vision goggles stuck on sunset mode and scanned the backyard. A bouncy castle pulsed like a mechanical lung near the patio. She checked her taser, found it set to tickle, and fumbled for a cartridge. "Perimeter check, Mol. What's the lay of the land?" Bill whispered, clutching a grappling hook secured with aggressive amounts of duct tape. Molly leaned against the splintered fence, her tactical vest creaking. Through the lens, a clown juggled pies with malicious precision, his ruffled collar twitching in the breeze. "Disaster zone. We've got a clown. The sewer kind with a pie-launcher. He'll blind us from three blocks. Check your goggles. If you trip, you take the presents out." Bill swallowed hard, his neck brace squeaking as he reached for a flask labeled Eye Drops and took a nervous gulp. "Pie-launcher? Nasty. Does he have the keycard?" Molly tapped her temple, smoothing the frayed edge of her disguise wig. "Forget the door. The slide exits three feet from the loot table. We use the slide. We body-surf straight to the prize table. No touching the plastic." She pointed toward a plastic chute where a toddler was wedged. "The slide. I'm in, but if we have to do the shimmy, I'm out. My knees hate me, and my dignity hates me more." Bill jingled against a belt heavy with bent spoons and a cheese grater, shifting his weight on the uneven grass. "No shimmies. Just body-surfing. We're professionals. Look at the haul. Drones, limited figures, enough Lego to cause structural damage. That red box is a Switch. The fence will pay cash for that instantly. That's rent money and steak for a week." Bill's stoic face broke into a grin, his gold tooth flashing under the moonlight as he adjusted his elf costume, which strained against his torso. "Lego means wealth. What about the parents? Are they armed? I saw a barbecue fork earlier." Molly glanced at the house. Silhouettes moved behind blinds, holding iced lattes and radiating the sheer exhaustion of suburban life. A piñata dragon swayed from a tree. "They're armed with caffeine and anxiety. Their focus is the piñata. When that dragon falls, the candy rush creates a thirty-second window of pure chaos. The parents won't react fast enough. Panic moves slower than sugar. We'll be a blur in elf green." Bill nodded, testing the seal on a garbage bag that reeked of fermented hot sauce. "Thirty seconds. Tight. Extraction is vital, Mol." Molly stepped aside, revealing a sanitation truck where the driver idled with the bin lid open, chewing a sandwich. "The truck. We slide, grab, and roll. But Bill, leave the clown wig in the van. You looked like a peacock losing a war. And Bill, watch the sprinklers. I am not leaving you for the clown. Do you hear me?" Bill blushed beneath his elf hat, tugging the tight fabric from his sweaty neck. "Copy. Elf suit. I promise no sprinklers. Let's make some kids cry."
created_at
"2026-04-27T08:52:27.418Z"
4/27/2026, 8:52:27 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
644
moderation_latency
(null)
generation_time
137902
tokens_prompt
77
tokens_completion
8134
native_tokens_prompt
76
native_tokens_completion
8432
native_tokens_completion_images
(null)
native_tokens_reasoning
7730
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0304008
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-6d578509-1373-9275-bb83-599e2cc93f9c"
is_byok
false
latency
644
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777279947-skcaXfBIAvkNyEmaSqbk"
session_id
(null)
api_type
"completions"
id
"gen-1777279947-Y2yurGaNIWRK2EgD57ha"
upstream_id
"chatcmpl-6d578509-1373-9275-bb83-599e2cc93f9c"
total_cost
0.0304008
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
497
83.66%
Dialogue to Total Word Ratio
Ratio: 51.89%, Deviation: 1.89%
neededClean
false
wordsTotal
503
wordsDialogue
261
91.7919%