NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ74MMDFRFTBJNN01M0BMG08
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 31s
Cost:
$0.051540
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9920%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain tapped an impatient rhythm against the warehouse window as Molly spread the blueprints across a crate of expired beans. She adjusted her crooked glasses and tapped a red marker against her chin. The target was a sprawling suburban split level hosting the Henderson birthday extravaganza. Five year old Timmy was expecting plastic dinosaurs, but he was walking into a coordinated extraction of wrapped goods. Molly traced the perimeter fence with a calloused finger and looked up at her partner. We need to bypass the automatic sprinklers before they soak the wrapping paper. Bill leaned against a rusted support beam, polishing a pair of wire cutters on his sleeve. He had spent twenty years perfecting vault cracks, yet here he was mapping a route around a plastic pony playground. You want to risk a trip hazard over a trampoline. Molly rolled her eyes and slid a folded map toward him. She adjusted her gloves. The air smelled damp near the metal shelves. The sprinklers pulse on thirty second intervals, which gives us a perfect window to sprint across the lawn. We slide under the patio door when the mother steps out to grill burgers, then we clear the gift table before the cake arrives. Bill wiped his forehead with a grease stained handkerchief. That implies we are competing against nap time. Children collapse at unpredictable moments. One sneeze and the operation ends before we touch a single box. Molly grabbed a roll of black duct tape and began wrapping it around her wrists. Panic is never a component in my playbooks. We treat the gifts like bonds. We move with surgical precision, and we leave before the first juice box pops open. Bill finally pushed off the beam and stepped toward the crate. He picked up a crumpled grocery list and squinted at it. This operation is fundamentally flawed. Who even buys birthday gifts for their own child. Molly snatched the paper back and smoothed it flat. They are guests, Bill. Cousins, neighbors, the dentist office. It is a cultural phenomenon wrapped in polyester and ribbons. You have never attended a weekend gathering. Bill crossed his arms over his faded flannel shirt. I attended exactly one when I was twenty two. A man threw a balloon animal at my head and I had to negotiate hostage terms with a toddler holding a crayon. Molly laughed softly and tapped the blueprint twice. You have survived worse than pastel chaos. We slip in, we bag the loot, and we vanish into the night while they argue over who gets the last stuffed monkey. Bill stared at the rain streaked glass and sighed deeply. Just promise me we leave the cake. I have a strict carbohydrate policy before midnight. Molly already was shoving the blueprints into her leather satchel. Deal. Now grab the vacuum seals and stop looking like a man facing his final exam. Molly zipped the bag shut and checked her watch twice. The moon was rising early above the city skyline.
created_at
"2026-04-27T09:34:30.324Z"
4/27/2026, 9:34:30 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
787
moderation_latency
(null)
generation_time
211039
tokens_prompt
77
tokens_completion
11106
native_tokens_prompt
76
native_tokens_completion
14304
native_tokens_completion_images
(null)
native_tokens_reasoning
13711
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.05154
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-25d2a1e5-688a-9db8-87b2-c2f23d4c92b5"
is_byok
false
latency
781
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777282470-H1tn9nzdWoGvXP7M0KCy"
session_id
(null)
api_type
"completions"
id
"gen-1777282470-rLtR5nTlLVXLLJEGDArj"
upstream_id
"chatcmpl-25d2a1e5-688a-9db8-87b2-c2f23d4c92b5"
total_cost
0.05154
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
502
wordsDialogue
0
49.9920%