NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBE1Y0CESFWJW5XE12YHPAZ
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
57.8s
Cost:
$0.013343
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
4.09%
Dialogue to Total Word Ratio
(details)
2.0452%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night vision goggles, squinting at the suburban house across the street. The lawn was littered with colorful balloons and a bouncy castle. The moon hung low above the oak trees. "Bill, look at the perimeter. It's just a plastic fence and a trampoline." Bill checked his clipboard, tapping a pen against his chin inside the van. The engine idled softly in the background. "Security is lax. But the prize is high value. We're talking action figures, dolls, and probably a lot of cheap plastic swords." "It's about the thrill, Bill. The sheer audacity of emptying a gift table while the parents are inside screaming at a clown." "Right. The clown is the wildcard. What's the extraction plan?" "We wear the costumes. The ones from the trunk." "The inflatable dinosaur suits? You're joking." "No joke. We walk in, grab the loot, and walk out. Who stops a T-Rex at a birthday party?" "Nobody. But what about the cake?" "Leave the cake. It's a distraction. We hit the presents hard. Focus on the wrapped boxes." "What if a kid recognizes us?" "They won't. They'll think we're the early gift delivery service. We've got the timing down." "Okay. I'm worried about the balloon arch. It's a tripping hazard." "Watch your step. I'll handle the latch on the back door." "You sure about the back door? The front is open." "Front is for guests. Back is for professionals. Plus, there's a dog." "A golden retriever?" "A chihuahua. Don't underestimate the chihuahua." "Right. So, we suit up, slip past the chihuahua, grab the loot, and vanish." "Exactly. And Bill?" "Yeah?" "If we get caught, we say we're lost." "We're not lost. We're precise." "Fine. But if I get glitter in my hair, I'm leaving you behind." "Deal. Now check your gear. Do you have the loot bags?" "Got them. Black, silent, and spacious. Enough for three hundred dollars of LEGO." "Good. What about the getaway vehicle?" "The ice cream truck." "You bought an ice cream truck?" "It's the perfect cover. Nobody suspects the ice cream man of theft. Plus, the music distracts the neighbors." "You're a genius, Molly. Truly." "I know. Now, listen to the music. It's starting." "Happy Birthday to You? That's our cue." "That's our cue. Let's move out." "Wait, one more thing." "What?" "Do I look like a dinosaur or a pterodactyl?" "Does it matter?" "It matters to the kids." "Fine. You look like a T-Rex. Now go." "Moving out." Bill opened the van door, the wind blowing his hair. Molly stepped out into the cool night air. The party was in full swing. She pulled the dinosaur head over her face. Bill grabbed the other suit. They moved toward the fence. The grass crunched under their boots.
created_at
"2026-02-25T22:17:31.922Z"
2/25/2026, 10:17:31 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
431
moderation_latency
(null)
generation_time
57394
tokens_prompt
77
tokens_completion
5626
native_tokens_prompt
76
native_tokens_completion
5550
native_tokens_completion_images
(null)
native_tokens_reasoning
4861
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0133428
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-81aa7c5f-3e1f-979b-87bc-893491c9d731"
is_byok
false
latency
430
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057851-QGPVlFYh3ckzZIj1Xnpe"
upstream_id
"chatcmpl-81aa7c5f-3e1f-979b-87bc-893491c9d731"
total_cost
0.0133428
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
459
4.09%
Dialogue to Total Word Ratio
Ratio: 78.00%, Deviation: 8.00%
neededClean
false
wordsTotal
459
wordsDialogue
358
2.0452%