NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBH6Q5VABAF962MTHRFW4QW
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 7s
Cost:
$0.030143
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
2.36%
Dialogue to Total Word Ratio
(details)
51.1396%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night-vision goggles, squinting at the suburban house below. Confetti cannons were primed on the lawn like tactical weaponry waiting for a signal. Bill leaned against the van door, eating a granola bar with the intensity of a man inspecting a live bomb. The target was a seventh birthday party, but the loot potential was astronomical. Plastic dinosaurs, glitter glue, and boxed electronics waited inside the living room. The moonlight reflected off the plastic wrapping, signaling high-value assets. The van hummed quietly, engine cold but ready. The engine was silent, a ghost in the night. Stealing from children felt wrong, but the market was strong. "You know the parents are the real security detail," Bill said, crumbs falling onto his tactical vest. "They are distracted, but they are not stupid." Molly checked her watch. They are on cake duty. That buys us twelve minutes before the sugar crash hits. The clock was ticking louder than the bass from the stereo inside. Time was the enemy. Every second counted. "Twelve minutes is tight for a full sweep," Bill replied, wiping his hands on his pants. "We need a distraction." Molly rolled her eyes. Fine. We go in the back. The dog is asleep. The shadows stretched long across the pavement. The grass was wet with dew. "The dog is a golden retriever named Buster," Bill corrected. "He likes treats." "You have a granola bar," Molly said. "That is not a bribe." Molly sighed and checked her lockpicks. Just get the bag ready. If we get caught, we say we are lost. "We are not lost," Bill said. "We are undercover." They stepped into the darkness. They descended the ladder silently. The party roared with noise inside. Molly kicked the back door open, but it was locked. Bill pulled out a slim jim. He jimmied the lock while Molly scanned the perimeter for motion sensors. A single string of lights buzzed overhead. The sound was rhythmic, like a heartbeat. The air was still. "That is not a sensor," Bill whispered. "That is a decoration." Bill popped the lock. The door swung open. Inside, the smell of frosting and excitement filled the air. They moved like shadows through the kitchen. The presents were stacked in the corner. A mountain of joy wrapped in paper. Molly grabbed the bag. It was heavy with potential. "How many?" Bill asked. "Forty," Molly counted. "Heavy lift." They loaded the bag. The timer on Molly’s watch ticked. Five minutes left. They slipped out the back door just as the cake was cut. The parents cheered. Molly and Bill drove away into the night. The city lights faded behind them. "Did we get the Play-Doh?" Bill asked. "All of it," Molly said. "Next job," Molly said. "A wedding." "Too much security," Bill said. "Stick to birthdays." "We need to be quick," Bill said. "I know the plan," Molly said. "What about the cake?" Bill asked. "Leave the cake," Molly said. "Okay," Bill said. "Yes," Molly said. They drove very fast.
created_at
"2026-02-25T23:12:34.5Z"
2/25/2026, 11:12:34 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
478
moderation_latency
(null)
generation_time
126091
tokens_prompt
77
tokens_completion
13038
native_tokens_prompt
76
native_tokens_completion
12550
native_tokens_completion_images
(null)
native_tokens_reasoning
11862
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0301428
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-425ed2ef-f2da-9484-99ff-22d7a3e004be"
is_byok
false
latency
478
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772061154-ADTZfr8ghfDIxiXA2e93"
upstream_id
"chatcmpl-425ed2ef-f2da-9484-99ff-22d7a3e004be"
total_cost
0.0301428
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
2.36%
Dialogue to Total Word Ratio
Ratio: 21.34%, Deviation: 8.66%
neededClean
false
wordsTotal
506
wordsDialogue
108
51.1396%