NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73BCGZMM1V81EZME0VVCFB
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 16s
Cost:
$0.030116
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The stolen delivery van smelled of burnt popcorn, industrial floor wax, and the distinct metallic tang of suppressed anxiety. Molly peered through the van blinds at the suburban target, a pastel house strangled by balloon arches. Beside her, Bill vibrated with stress, juggling lockpicks and a bagel, his knuckles white as he tapped a rhythm that threatened to shake the dashboard tablet. A duffel bag on the floor pulsed with the faint blue light of the jamming emitter, its zipper stained with chocolate. The heat waves shimmering off the driveway distorted the view, but Molly's enhanced optics compensated, revealing the security loop cycling every twelve seconds. "Focus on the thermal overlay," Molly murmured, tracking a procession of tiny heat signatures circling an inflatable dinosaur. "We're looking at forty distinct biological threats. Small, fast, high-pitched, and prone to screaming at randomized intervals. The acoustic profile is a nightmare." Bill swallowed bagel. "Small targets, Molls. Quick hit. We grab the loot, vanish before they miss the pony bags. Soft job. They live on a walking path if you dodge sprinklers." Molly held the optics. "Surgical extraction. Check the guardian. Golden retriever dismantling a loafer. That dog has combat discipline." Bill produced a neon silicone duck. "Decoy of Doom. Throw this, dog frenzies, we bypass to the gift table. Physics irrefutable." Molly stared him down, looking like a bomb tech watching someone use nail clippers on C-4. "You can't throw a toy at a hound and bypass the parents. You see the mother? She hasn't slept since 2019 and will end you with a tiara." "Variables," Bill dismissed, nearly launching a pick. "We use the goodie bag chaos. Tiny predators focus on plastic, we strike. Legos are worth three grand to buyers with storage issues and trauma." "We take everything," Molly tapped her tablet. "Especially the atrocities. That hand-knitted sweater is black market gold. Check your rig. Clown nose appears, I leave you to Fluffy." Bill panicked, hiding a red foam circle. "Negative nose. Dark ops. Matte black, non-reflective soles. Winch rigged to helium. We ascend, net the gifts, exfiltrate by buoyancy. Reverse Icarus." Molly fogged the glass with a sigh. "Balloon lift is Phase Two. We're pizza delivery. My disguise is absolute. Hat, pepperoni, aura of mediocrity. With me?" Bill checked a paper-plate watch. "Locked and loaded. Let's fracture innocence. For the mission." "For the payout," Molly kicked the door open. "And the Lego. Mostly the Lego."
created_at
"2026-04-27T09:11:58.757Z"
4/27/2026, 9:11:58 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
655
moderation_latency
(null)
generation_time
135911
tokens_prompt
77
tokens_completion
8907
native_tokens_prompt
76
native_tokens_completion
8353
native_tokens_completion_images
(null)
native_tokens_reasoning
7758
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0301164
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-89c0c0a8-0c7a-93c7-bbaf-da49a976036d"
is_byok
false
latency
655
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281118-qaIqgzgD8wISuD8jTVcd"
session_id
(null)
api_type
"completions"
id
"gen-1777281118-i2a0w1O8RotPxYC8yoGZ"
upstream_id
"chatcmpl-89c0c0a8-0c7a-93c7-bbaf-da49a976036d"
total_cost
0.0301164
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
405
0.00%
Dialogue to Total Word Ratio
Ratio: 55.26%, Deviation: 25.26%
neededClean
false
wordsTotal
409
wordsDialogue
226
0.0000%