NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBDTZ06EA45SYBC6ST88CED
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 42s
Cost:
$0.025333
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
99.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon tutu while Bill checked his laser cutter. The suburban house glowed brightly with colorful birthday balloons. Shadows moved inside. Bill nodded nervously. The driveway was empty except for a single red tricycle. Moonlight hit the grass. Bill wiped sweat. They were. Tonight was big. Molly signaled the team. They wore matching pajamas to blend in perfectly. No one suspected them. Bill sighed, tightening his grip on the rusty crowbar. His hands shook slightly. They crept toward the front door slowly. The lock clicked open. "Is it actually ethical to do this?" Bill whispered softly, peeking through the window. "Money doesn't care about ethics, Bill. It cares about plastic dinosaurs only." Molly grinned, adjusting her mask. "What about the clown?" "We distract him with glitter bombs. Easy." "How many presents do you think?" "Enough to retire us to a very tropical island. Maybe Bali." "Bali sounds nice. Less humidity than here." "Focus, Bill. We hit the garage first." "What if the parents wake up?" "Then we blame the birthday boy. Classic misdirection." "This feels wrong. Stealing from kids?" "They'll get lots more next year. It's renewable energy." "You're twisted, Molly." "I'm efficient. Now move." "Ready?" "Yes." "Go." "Now."
created_at
"2026-02-25T22:13:43.564Z"
2/25/2026, 10:13:43 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
424
moderation_latency
(null)
generation_time
101668
tokens_prompt
77
tokens_completion
8687
native_tokens_prompt
76
native_tokens_completion
10546
native_tokens_completion_images
(null)
native_tokens_reasoning
10246
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0253332
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-dfccd9ce-1c78-9dbe-aa9a-58308b02ed16"
is_byok
false
latency
424
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057623-OmxmHIlvOJ9QMmhY26Yi"
upstream_id
"chatcmpl-dfccd9ce-1c78-9dbe-aa9a-58308b02ed16"
total_cost
0.0253332
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
201
100.00%
Dialogue to Total Word Ratio
Ratio: 50.25%, Deviation: 0.25%
neededClean
false
wordsTotal
201
wordsDialogue
101
99.9995%