NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72W7S41E18WD0NJQ5HK2A5
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 31s
Cost:
$0.022164
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her velvet cape and peered through the dusty venetian blinds at the suburban chaos beyond, her stomach rumbling with greed. Bill was currently hopelessly tangled in a structural balloon arch he hadn't realized was coated in superglue, his face turning the violent shade of a bruised plum as he mouthed frantic apologies to the rubbery entrapment. Their target stood imposing ahead: the Henderson bungalow, currently hosting Timmy's seventh birthday, a fortress of sticky juice boxes and shrieking toddlers guarding the Mother Lode. The treasure trope lay on a reinforced table: three remote-controlled dinosaurs, a bioluminescent slime kit, and a tablet disguised as a ham sandwich. Bill managed to extricate one arm, waving a grappling hook fashioned from twisted licorice, though the tensile strength appeared comically insufficient. He attempted a tactical stealth roll but immediately tripped over a concealed glitter mine, detonating a cloud of sparkles that drifted toward a nearby cupcake tray with lethal precision. Molly winced, recalling the strict no-mess protocols, and gripped her grappling line with white knuckles, sweat beading on her brow. Finally. She signaled. "Bill, you idiot, drop the licorice and dive! Watch the glitter fly! Focus on the dinosaurs! Don't touch the cake!"
created_at
"2026-04-27T09:03:42.378Z"
4/27/2026, 9:03:42 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
682
moderation_latency
(null)
generation_time
90587
tokens_prompt
77
tokens_completion
5927
native_tokens_prompt
76
native_tokens_completion
6144
native_tokens_completion_images
(null)
native_tokens_reasoning
5866
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.022164
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-674c3344-7d40-90c3-b625-52323e27c7ca"
is_byok
false
latency
682
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280622-epc3x4I4KBGH93tG7Ljg"
session_id
(null)
api_type
"completions"
id
"gen-1777280622-Q4rPXH6lsKA8z3sKmnmA"
upstream_id
"chatcmpl-674c3344-7d40-90c3-b625-52323e27c7ca"
total_cost
0.022164
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 9.90%, Deviation: 0.10%
neededClean
false
wordsTotal
202
wordsDialogue
20
100.0000%