NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBHBBDSWBEHX790YBRDRXTW
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 42s
Cost:
$0.024248
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
99.9578%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her ski mask, which was slightly too tight around her small ears. She stared at the glowing house across the street. Confetti cannons were visible through the window. "Are you sure about this?" Bill whispered, clutching a crowbar like a teddy bear. "It's a birthday party. Kids are inside." "Maximum chaos, minimum security. The parents are too busy blowing up balloons to notice the van." Molly replied softly, checking her watch. Bill grimaced, rubbing his chin nervously. "I don't know. Stealing from children feels wrong. What if I get a conscience?" "You won't," she said, tossing him a bag. "We're not stealing the joy. We're redistributing the plastic dinosaurs." "Plastic dinosaurs?" Bill asked, eyeing the house. "What about the cake?" "Leave the cake. We're here for the loot." Molly pointed at the driveway. "Wait for the clown to leave." "The clown?" Bill looked terrified. "There's a clown?" "Pink nose. Green wig. He's the lookout." Molly grinned. Bill sighed slowly, tightening his grip on the crowbar. "If I get caught, I'm telling them you made me." "Deal." They slipped quietly into the shadows as the music swelled. The big heist began. Night air smelled of sugar.
created_at
"2026-02-25T23:15:06.305Z"
2/25/2026, 11:15:06 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
501
moderation_latency
(null)
generation_time
101582
tokens_prompt
77
tokens_completion
8526
native_tokens_prompt
76
native_tokens_completion
10094
native_tokens_completion_images
(null)
native_tokens_reasoning
9798
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0242484
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-0bbab0f8-1c27-99a4-b768-afcfb9ffd0c4"
is_byok
false
latency
501
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772061306-NCMhMUDy0yJ5fT3YyvpT"
upstream_id
"chatcmpl-0bbab0f8-1c27-99a4-b768-afcfb9ffd0c4"
total_cost
0.0242484
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
197
100.00%
Dialogue to Total Word Ratio
Ratio: 49.24%, Deviation: 0.76%
neededClean
false
wordsTotal
197
wordsDialogue
97
99.9578%