NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBE9RVFSH3Y3598DEAN814Z
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 13s
Cost:
$0.016842
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night vision goggles, though it was broad daylight. She squinted at the suburban house across the street. Balloons bobbed in the wind like trapped helium prisoners. She tapped the tablet in her lap, scrolling through security schematics drawn on napkins. The van hummed quietly, a diesel beast disguised as a delivery truck parked behind a bush. Target confirmed, Molly said. Seventh birthday. High value. Bill chewed on a toothpick, leaning against the hood of their unmarked van. He looked at the house with skepticism. High value? It's mostly plastic dinosaurs and glow sticks. Glow sticks are liquid light, Bill. Liquid light sells for top dollar on the black market. Right. And the dinosaurs? Vintage T-Rexes. Collectible. Rare edition. Bill nodded slowly, though he clearly knew she was lying. He checked his watch, the metal ticking loudly. Party starts in twenty. The parents are distracted by the punch bowl. That's our window. I've got the grapple hook ready, Molly whispered, tapping the grappling device strapped to her thigh. Just need to scale the fence. Fence is four feet tall. You don't need a grapple hook for a four-foot fence. It's about the principle, Bill. We are professionals. We never compromise on the gear. We're stealing a box of crayons. We're stealing potential, Molly corrected. That crayon could be the one that draws the next Mona Lisa. You can't put a price on art. Bill sighed, pulling a crowbar from the back seat. What's the extraction plan? Do we run? Do we hide in the cake? Cake is a trap. Too much sugar. We'll load the van and drive off. What if the birthday boy sees us? Then we offer him a trade. A trade? I have a shiny rock. He'll be too busy playing with the rock to notice the missing presents. He's seven, Molly. He won't care about the rock. He'll care if we call it a dragon egg. Bill chuckled. You're insane. I'm a visionary. Now, get the bag. We need to move fast before the piñata breaks. The piñata? Distraction. When it breaks, everyone looks at the candy. That's when we sweep the gift table. What if the parents are watching? They won't be. They'll be fighting over who gets the last cupcake. You're counting on parental negligence. I'm counting on human nature. Bill hopped out of the van. Alright. Let's go steal some plastic toys. Not toys. Treasures. Molly opened the van door. Remember, no touching the cake. Only the cake? Only the presents. The cake is evidence. Right. Evidence. Bill cracked his knuckles. Ready? Born ready. They sprinted toward the house, blending in with the flow of children. Molly grabbed a party hat. Bill grabbed a bag. The heist began.
created_at
"2026-02-25T22:21:48.79Z"
2/25/2026, 10:21:48 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
708
moderation_latency
(null)
generation_time
73134
tokens_prompt
77
tokens_completion
7251
native_tokens_prompt
76
native_tokens_completion
7008
native_tokens_completion_images
(null)
native_tokens_reasoning
6351
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.016842
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-720d64ff-f924-9d5d-b65d-02e23af6ed18"
is_byok
false
latency
708
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058108-pKquk6YBAIQiPHZUxMC8"
upstream_id
"chatcmpl-720d64ff-f924-9d5d-b65d-02e23af6ed18"
total_cost
0.016842
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
457
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
458
wordsDialogue
0
0.0000%