NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ76TQE1FZX2NQ6H3NM78H5N
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
51.6s
Cost:
$0.014894
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
92.91%
Dialogue to Total Word Ratio
(details)
94.4462%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her fake mustache in the cracked rearview mirror of the stolen minivan. Bill wiped grease from his goggles as he adjusted the confetti cannon. Molly kicked the dashboard with her combat boot. She smirked, tapping a finger against the wheel. Bill sighed, resigning himself to the sprinkles. "Bill, focus," Molly said, gesturing violently toward the yard. Bill swallowed hard. "I'm focused, Mol. But what's the distraction?" Molly glanced at her phone. "Juice box labels switched. Sugar rush in three minutes. Table seven holds the stash." Bill squinted at the unicorn piƱata. "That looks dangerous. What about extraction?" "Magic slide, loaded in the diaper bag," Molly commanded. "Grab the suction cups." "Are you sure the bag fits the loot?" Bill asked, voice trembling. Molly slapped his knee. "It's tactical. Expandable fabric. Stop whining." "Do we really have to risk the glitter?" Bill whispered. Molly checked her watch. "Glitter is currency. Keep moving." Bill gripped the door handle. "What if the baby wakes up?" "Earplugs. We have earplugs." Bill looked at the cat lurking by the fence. "And the cat?" Molly shrugged, checking her mask fit. "The cat loves money. Let's go."
created_at
"2026-04-27T10:12:47.18Z"
4/27/2026, 10:12:47 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
639
moderation_latency
(null)
generation_time
51420
tokens_prompt
77
tokens_completion
8974
native_tokens_prompt
76
native_tokens_completion
9917
native_tokens_completion_images
(null)
native_tokens_reasoning
9613
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0148945
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-a2a804c7-8c50-9df9-8b59-4e7f165ddd9f"
is_byok
false
latency
639
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777284767-04JbZUAaBIHWYXj5X5lh"
session_id
(null)
api_type
"completions"
id
"gen-1777284767-90xgpECgvRLSjH7DaUoV"
upstream_id
"chatcmpl-a2a804c7-8c50-9df9-8b59-4e7f165ddd9f"
total_cost
0.0148945
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
192
92.91%
Dialogue to Total Word Ratio
Ratio: 44.79%, Deviation: 5.21%
neededClean
false
wordsTotal
192
wordsDialogue
86
94.4462%