NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ7405X37CFFN20Z7WY3Z6QB
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 37s
Cost:
$0.011965
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
71.63%
Dialogue to Total Word Ratio
(details)
35.8170%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly traced the grease pencil line across the butcher paper with surgical precision. The living room of the Henderson estate was now a grid of entry points, ventilation shafts disguised as helium balloons, and a clear path to the mantle where the wrapped casualties awaited. Bill leaned against the peeling wallpaper, chewing the end of a red marker like it owed him money. He adjusted his tactical vest, which was actually a repurposed fanny pack stitched together with baling wire. Shadows stretched across the hardwood. A cold draft slipped through the cracked foundation, carrying the scent of damp earth. The fluorescent lights hummed overhead, casting long shadows across the scattered blueprints. Molly smoothed the creased paper against the scarred wooden table while Bill reviewed the timing charts. The air grew thick with anticipation as they finalized every detail of the operation. "Tell me again why we are targeting a six year old birthday." Molly didn't look up from her diagram. "The security is pathetic. One motion sensor, dogs that sleep through thunderstorms, and a mother who thinks supervision means shouting over the inflatable bounce house." Bill tapped the diagram near the tree. "We go in at two twenty. They scatter for sugar. That is our window." Molly finally straightened, wiping grease onto her jeans. "The vault is twenty steps from the back door. That includes navigating the laser grid they call a jump rope maze." Bill reached into his duffel and pulled out a coil of thick nylon. "We have the jump ropes. I dyed them to match the birthday banner. We swing over, drop the net, and bounce out before the first frosting gets on the carpet." Molly crossed her arms. "And the distraction? I am not wearing the unicorn onesie again. The last time you said it was aerodynamic, I nearly tripped over my own tail during the getaway." Bill adjusted his visor. "Aerodynamic is relative. We need a spectacle. I was thinking the chocolate fountain. We tip it near the front porch. They clean for hours. Enough time to bag everything and vanish through the laundry chute." Molly tapped her chin. "The laundry chute is two feet wide. We cannot fit three hundred pounds of wrapped merchandise through a porcelain drain." Bill shrugged. "We leave the jumbo sized robot. The kids will cry anyway. Target the toys. We split at the alley, use the bicycle delivery vans for transport, and meet at the warehouse by four." Molly pulled a small vial from her pocket. "And the dogs? I synthesized a sedative that smells like bacon and costs forty dollars to produce." Bill grinned. "Perfect. We toss it near the gate, we are ghosts. Pack the zip ties, Molly. We move at eighteen hundred hours." She stepped back to examine the structural weaknesses of the false wall. He unclipped a small pair of wire cutters from his belt. The concrete floor felt cold beneath their boots as they organized the equipment. The walls trembled. Sunlight filtered through the dusty blinds, illuminating floating particles of chalk dust. They double checked every knot and verified the weight distribution on each strap. The rain began to fall against the cracked windowpane, washing away the chalk lines on the floor. The plan was flawless.
created_at
"2026-04-27T09:23:20.105Z"
4/27/2026, 9:23:20 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
703
moderation_latency
(null)
generation_time
96857
tokens_prompt
77
tokens_completion
10922
native_tokens_prompt
76
native_tokens_completion
12383
native_tokens_completion_images
(null)
native_tokens_reasoning
11714
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.011964941
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"76c958c606534e969855755e3ea04e4d"
is_byok
false
latency
702
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281800-96uzgiNlIcFczx9Gvouh"
session_id
(null)
api_type
"completions"
id
"gen-1777281800-Ow57EL7b2qgAH1snjcom"
upstream_id
"76c958c606534e969855755e3ea04e4d"
total_cost
0.011964941
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
542
71.63%
Dialogue to Total Word Ratio
Ratio: 47.42%, Deviation: 2.58%
neededClean
false
wordsTotal
542
wordsDialogue
257
35.8170%