NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72V9BQ05BVNR7Y1H1MRFEZ
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 9s
Cost:
$0.009658
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint was a crumpled party invitation. Bill adjusted his glasses and tapped a grease pencil against the paper. Across the table, Molly unfolded a neighborhood map. They had done this before, but never at a children’s party. The stakes were different when the targets were under ten. The fluorescent light hummed overhead, casting long shadows across the scattered blueprints. A cold cup of coffee sat untouched beside a stack of numbered envelopes. Neither of them blinked as they calculated wind speed, door weight, and the exact decibel level required to mask a rolling dolly. Okay, listen. We go in at six-oh-five. The parents will be distracted by the balloon animal guy. He takes twelve minutes to make a giraffe. Twelve minutes is generous. You know how nervous he gets when he runs out of pink balloons. We have eight minutes of absolute chaos. Perfect. That leaves us a two-minute window to breach the living room, secure the gift table, and vanish before the cake arrives. Sugar rush creates chaos. Agreed. We prioritize the big ticket items. The trampoline, the scooter, the giant plush dinosaur. We strap them to our backs like military rations. No dinosaur straps, Molly. It will throw off our center of gravity. We will trip over a spilled juice box. Then we use the dolly. The one with the squeaky wheel we oiled last Tuesday. It sounds like an alarm. We mask it with the piñata music. I told the DJ to cue the Macarena early. If we hit the right chord, the noise will drown out the acoustics of our getaway. You bribed a man named Greg with a coupon for free churros to play the Macarena at six-thirteen. I negotiated. Greg wanted out of his day job. Now he is a professional distraction specialist. We meet him at the back door. He will be wearing the sombrero. We double-check the timepieces. The party starts at six, but the parents linger for small talk. That gives us a buffer. What about the mother of the birthday boy? The one with the clipboard. I clocked her at the last job. She notices everything. We deploy the snack table. We leave a trail of fruit kebabs and mini muffins leading straight to the recycling bins. She will follow the evidence. She always does. Perfect. I will signal when the first slice of cake hits the table. That means the sugar has fully engaged. I learned from the best. Now, what is the drop zone? The alley behind the auto shop. The dumpster is clear until seven-thirty. We will load everything into the van, cover it with the tarp, and we are gone. Gone like a unicorn at a petting zoo. Quick, loud, and completely unbelievable. Exactly. So, are we in, or are we going home to explain to our landlord why we sold his lawnmower again? I am in. But if the giraffe takes longer than ten minutes, we are leaving with the socks. Deal. Pack the dolly. And remember, if we get caught, we are just enthusiastic gift delivery volunteers. Practice the smile. The one that shows teeth but not desperation. We move fast. We move quiet. We leave nothing but confetti and confusion behind. Molly traced a red line from the front door to the living room, her finger hovering over the gift table. The layout was deceptively simple. She reached into her coat pocket and pulled out foam earplugs, setting them beside masking tape. Bill watched her arrange the equipment. Outside, the van idled, fueled and ready to swallow a mountain of wrapped cardboard.
created_at
"2026-04-27T09:03:11.229Z"
4/27/2026, 9:03:11 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1014
moderation_latency
(null)
generation_time
68821
tokens_prompt
77
tokens_completion
9848
native_tokens_prompt
76
native_tokens_completion
9993
native_tokens_completion_images
(null)
native_tokens_reasoning
9211
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009657994
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"a67ee35c5ed647b5973c3328ff134fad"
is_byok
false
latency
1014
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280591-CM9cWLbTwjOtaCZK2tzB"
session_id
(null)
api_type
"completions"
id
"gen-1777280591-v13ANtUHkSseoIjw70lE"
upstream_id
"a67ee35c5ed647b5973c3328ff134fad"
total_cost
0.009657994
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
600
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
606
wordsDialogue
0
0.0000%