NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ749P1Z1QR477Z0X6H9HVSC
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 39s
Cost:
$0.014774
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain tapped against the windshield like impatient fingers. Molly adjusted her visor while Bill studied a crumpled flyer. The target sat across the street, a modest suburban home draped in plastic streamers. They had spent three nights researching juvenile psychology and gift distribution logistics. The van smelled faintly of old maps and ambition. Neither spoke immediately, knowing silence built tension. Bill traced the perimeter of the driveway with a gloved finger, calculating sightlines and escape routes. Molly reviewed the schedule, noting when the birthday boy would inevitably crash from sugar exhaustion. Success depended on precision, patience, and perfect timing. They were professionals, after all. We enter through the garage, Molly said, pointing at the architectural diagram. I prefer the side yard, Bill replied, adjusting his rubber gloves. The dog sleeps in the kitchen, Molly noted. We feed him cheese, Bill countered. Exactly. The presents cluster near the fireplace. We grab the boxes, leave the wrapping paper, and exit before the singing begins. What if a parent notices us? Bill asked, checking his reflection in the window. They will assume we are hired entertainers, Molly explained. I hope the children do not suspect us, Bill murmured. They never suspect the adults, Molly said, starting the engine. I still feel guilty, Bill confessed. Guilt is inefficient, Molly replied. I agree completely, Bill said. We move at eight, Molly instructed. Understood. Bill checked his watch. No sudden movements, Molly warned. I will remain perfectly still, Bill promised. Remember the blue hats, Molly added. They are in the bag, Bill confirmed. Do not touch the cake, Molly commanded. I respect the cake, Bill assured her. Check the bags, Molly instructed. Locked tight, Bill replied. Focus on the target, Molly said. Eyes forward, Bill acknowledged. Ready, Molly asked. Always, Bill responded. Proceed, Bill stated. Go now, Molly said. Excellent. Molly shifted into gear. We will be legendary, Bill whispered. Fame requires persistence, Bill reasoned. Molly agreed. Indeed. Molly pressed the accelerator. This is our moment, Bill declared. It is our Tuesday, Molly noted. Still magnificent, Bill said. Drive carefully, Molly instructed. I always do, Bill replied. The van rolled forward. The plan remained sound. Tomorrow, they would vanish with every wrapped box in existence. Professional thieves rarely targeted minors, but the payoff justified the ethical compromises. They had survived worse heists, including the infamous cheese warehouse incident. Bill checked his pulse. Molly checked her mirrors. The street remained quiet. Children slept upstairs while adults gathered below. The perfect storm of opportunity and vulnerability. They would slip through the walls like ghosts. Nothing would stop them now. Sirens wailed in the distance, harmless background noise for seasoned criminals. They practiced breathing exercises to maintain steady heart rates. The mission required absolute focus and zero hesitation. Every second counted toward their inevitable success. They had rehearsed this exact sequence until it felt automatic. Professional conduct demanded unwavering discipline and flawless execution. The neighborhood awaited their arrival with oblivious peace. They prepared to transform chaos into calculated victory. The moon watched silently from above.
created_at
"2026-04-27T09:28:31.557Z"
4/27/2026, 9:28:31 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
686
moderation_latency
(null)
generation_time
99036
tokens_prompt
77
tokens_completion
12732
native_tokens_prompt
76
native_tokens_completion
15293
native_tokens_completion_images
(null)
native_tokens_reasoning
14600
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.014773819
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"2f15c8028d874ce7b133b29fd3b51002"
is_byok
false
latency
686
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777282111-Kb7PGMEiCfaaKGFJEEKF"
session_id
(null)
api_type
"completions"
id
"gen-1777282111-Hg8XTncm2UpZPu6ayEID"
upstream_id
"2f15c8028d874ce7b133b29fd3b51002"
total_cost
0.014773819
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
506
wordsDialogue
0
49.3562%