NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73QWHK2JJ3CYP9ZBQNWV1X
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 21s
Cost:
$0.010680
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
99.95%
Dialogue to Total Word Ratio
(details)
99.9348%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain slicked the alleyway behind the suburban residence. Molly adjusted her tactical visor, checking the thermal imaging display. The backyard glow suggested a heavily guarded operation. Bill fumbled with his utility belt, nearly dropping a coil of bungee cords. He wiped condensation from his goggles and nodded toward the sliding glass doors. Through the window, they observed the chaotic staging area. Wrapping paper strewn across polished hardwood floors formed a colorful obstacle course. Several deflated streamers hung from the ceiling fan like surrender flags. The target sat beneath a plastic pine tree, stacked in precarious towers. Parents moved through the haze of cheap fog machines, carrying paper plates and juice boxes. The atmosphere reeked of synthetic vanilla and desperate cheerfulness. Molly tapped her boots against the wet concrete, calculating the approach vector. She traced a route through the flowerbeds, avoiding the sprinkler system. Bill checked his wrist chronometer, syncing it with the neighborhood security schedule. He adjusted his grip on the velvet rope spools, ensuring the tension would hold. They crouched behind a plastic flamingo, watching the host mother fill a punch bowl. The liquid swirled with floating marbles and candy pieces. Everything was positioned for maximum impact. They waited for the perfect moment to breach the perimeter. The wind carried distant birthday music. Molly brushed a leaf from her shoulder and checked her gloves. Bill nodded silently, his breath fogging the cold air. They prepared to execute the operation. The evidence pointed toward immediate action. She checked the weather forecast. "The inflatable dinosaur is our perimeter, is it deflating?" "Slightly. We have a ten minute window before the cake arrives." "Cake means chowdows, which means guards are drunk on sugar and juice." "Exactly. The mountain of wrapped boxes sits under that terrible tinsel tree." "Why do parents force children to watch this much glitter?" "It disorients them. Perfect for our approach. You take the left flank, I will handle the gift bags." "What about the puppy?" "The puppy is in the kitchen. We avoid the kitchen." "Agreed. What is the extraction route?" "Through the laundry room, over the fence, and into my cargo van." "My van smells like old gym socks and regret." "It is reliable. And it has extra blankets for the merchandise." "What merchandise?" "The presents, Bill. We are stealing the presents." "Right. The presents. Should we leave the drawing of a stick figure horse?" "Leave it. It is worthless paper." "And the giant stuffed walrus?" "That is high value. Secure it with the bungee cords." "What if the host mother wakes up?" "She will not. I swapped her decaf beans for regular grounds earlier." "You poisoned her?" "I accelerated her sleep cycle. There is a difference." "Noted. Ready when you are." "Check your pockets for the velvet ropes." "I have three rolls and a spare key." "Good. The back door has a pressure sensor." "We will use the rubber mallets." "Just tap gently." "Loud and clear." "Do not drop anything." "I never drop anything." "Very lucky." "Come on then."
created_at
"2026-04-27T09:18:48.377Z"
4/27/2026, 9:18:48 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
860
moderation_latency
(null)
generation_time
81213
tokens_prompt
77
tokens_completion
10239
native_tokens_prompt
76
native_tokens_completion
11052
native_tokens_completion_images
(null)
native_tokens_reasoning
10366
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010680194
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"81e9c9b579d44b0d87be7bb017ed4b01"
is_byok
false
latency
860
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281528-dwtT7AQ41n9lP59oAQQe"
session_id
(null)
api_type
"completions"
id
"gen-1777281528-9lHleDOsMXcAaWctbkBx"
upstream_id
"81e9c9b579d44b0d87be7bb017ed4b01"
total_cost
0.010680194
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
99.95%
Dialogue to Total Word Ratio
Ratio: 50.10%, Deviation: 0.10%
neededClean
false
wordsTotal
503
wordsDialogue
252
99.9348%