NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBFVZTM3DRNHBHP7261SDJ9
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 1s
Cost:
$0.028688
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.18%
Dialogue to Total Word Ratio
(details)
48.0819%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van idled silently behind the hedges. Confetti balloons bobbed against the twilight sky like colorful, plastic tumors. Molly adjusted her night vision goggles while Bill polished a grappling hook. The engine ticked as it cooled. The smell of cut grass drifted through the open window. "Okay, look at the layout," Molly whispered, tapping the blueprint with a grease-stained finger. "The kid turns six tomorrow. That means the gifts are still under the tree. Unwrapped." Bill adjusted his tie, which was slightly too large for his neck. He looked nervous. "Six years old? Molly, that’s a lot of LEGOs. Do we have space for the LEGOs?" "We have space for everything, Bill. We are professionals." Molly folded the map. "I was thinking more about the emotional damage. If we take the presents, who gets the joy?" "The joy is ours. We sell the toys on the black market. Think of the profit margins on unopened action figures." Bill nodded slowly, chewing on a toothpick. A sprinkler system hissed nearby. "What about the cake?" "Leave the cake. It’s too messy. We’re after the plastic, Bill. The shiny, wrapped plastic." Molly checked her watch. The hands moved past ten. "The parents are inside watching the piñata. That’s our window. You take the window, I take the chimney." "The chimney? It’s summer, Molly. There’s no fireplace." Bill pointed at the roof. "Metaphorically, Bill. I mean the ventilation shaft. It’s right above the nursery." "Right. Ventilation. Got it." Bill pulled a grappling hook from his belt. It clattered loudly. "Do I need to wear a mask?" "Yes. A clown mask. It’s a party. Blend in." "I have a jester mask. Is that okay?" "No, Bill. It’s too specific. Just a clown mask. Red nose, white face." "I don’t have a red nose." "Take a tomato from the grocery bag. Crush it." Bill grimaced. The tomato looked fresh. "That seems unsanitary." "It’s a heist. Nothing is sanitary. Now, look at the security system. It’s a motion sensor." "Can we disable it?" "No. We have to dance through it. Literally." "Dance?" "The sensor triggers on sudden movement. Slow, rhythmic motion confuses the algorithm. It thinks we’re part of the decor." Bill sighed. The wind rustled the leaves. "So, we’re dancing thieves." "We’re artistic thieves. There’s a difference." "Focus, Bill. The van is locked. The bags are ready. We move in three, two, one." They opened the van door. The grass was wet with sprinkler water. A dog barked in the distance. They stepped out into the cool night air. Molly moved silently across the lawn while Bill struggled with his boots. The house loomed ahead, glowing with warm light. They crept toward the back door. The music inside grew louder. "Do you think the kid will cry?" "Probably. But he won’t remember us." "That’s the worst part." "It’s the best part." Molly kicked the door. It unlocked. "Easy entry." "Too easy." "Trust me." "I do." They stepped inside.
created_at
"2026-02-25T22:49:14.332Z"
2/25/2026, 10:49:14 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
412
moderation_latency
(null)
generation_time
121050
tokens_prompt
77
tokens_completion
11701
native_tokens_prompt
76
native_tokens_completion
11944
native_tokens_completion_images
(null)
native_tokens_reasoning
11209
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0286884
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-d675de81-decd-92f4-a8d9-be26520caa74"
is_byok
false
latency
412
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059754-zGYa7Vf8IeabuieKTH5c"
upstream_id
"chatcmpl-d675de81-decd-92f4-a8d9-be26520caa74"
total_cost
0.0286884
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.18%
Dialogue to Total Word Ratio
Ratio: 61.26%, Deviation: 11.26%
neededClean
false
wordsTotal
493
wordsDialogue
302
48.0819%