NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ794V49YCS0QFZNFG9SARHN
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5m 3s
Cost:
$0.083652
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9595%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The laminated blueprints covered the kitchen table, secured by a heavy wrench and a crumpled fast food bag. Molly adjusted her magnifying glass while studying the suburban backyard layout. Bill polished his tactical sneakers with obsessive dedication. The neighborhood looked quiet. Rain tapped gently against the glass. Coffee grew cold in chipped ceramic mugs. Are we completely sure about this target? Bill whispered, leaning over the schematic. Absolutely, Molly answered, tapping a red circle near the patio. Thirty premium gifts. Gaming consoles, remote control vehicles, and one mysteriously heavy box marked for Timothy. We strike during the puppet show. The wooden alligator provides cover. Expect zero resistance. Bill nodded reluctantly, fastening his utility belt across his waist. He studied the fence measurements while calculating wind resistance. The suburban fortress demanded precision. Molly organized her lockpicks into neat rows, checking each metallic tip for flaws. Streetlights flickered outside their window, casting long shadows across the planning documents. I prefer waiting for the cake cutting, Bill muttered. Sugar distractions work wonders. Too dangerous, Molly countered. Hyperactive children move like startled gazelles. We stick to the original timeline. What about the hired entertainer? Reports mention a magician with suspiciously deep pockets. Ignore him. Watch the fence line carefully. She slid a velvet pouch across the scratched wood, revealing custom suction cups. Bill inspected the rubber grips with professional skepticism. The equipment looked surprisingly legitimate for a weekend operation. He tested the tension springs, listening for mechanical defects. Satisfied, he arranged the climbing ropes into perfect coils. Silence filled the cramped kitchen. I engineered these for silent ascents, Molly explained. Clean entry, zero footprints. We operate like shadows. Shadows who rob first graders, Bill sighed. My grandmother would weep with disappointment. Your grandmother pickpocketed tourists at the county fair. This is merely aggressive recycling. Timothy already owns three bicycles. We help. Molly unrolled the escape route diagram, highlighting the drainage ditch with yellow ink. The neighboring properties offered excellent concealment. She calculated travel times between checkpoints, accounting for potential obstacles like garden hoses or aggressive terriers. Precision mattered more than speed. Amateur mistakes triggered neighborhood watch patrols instantly. We are balancing the economic scales, she continued. By storing everything in our garage? Bill asked. By converting toys into retirement funds. Now memorize the exit path. We slip past the rose bushes, crawl through the drainage pipe, and meet the transport vehicle. I detailed the station wagon yesterday. The hinges are completely silent, he confirmed, packing spare batteries into his vest. Molly approved his thoroughness with a rare smile. She checked her wristwatch, synchronizing the countdown. Forty-eight hours remained until the celebration began. Every minute demanded preparation. Failure meant facing disappointed children and furious parents. Breaths hitched nervously. Wear the microfiber gloves, she ordered. Leave no evidence on the ribbons. Do not sample the cupcakes. Your sweet tooth ruined Cleveland. Those were lemon frosted, Molly. I possess refined tastes. You possess a criminal lack of discipline. Stay sharp. We rehearse at dawn. The presents await us, my friend.
created_at
"2026-04-27T10:53:15.793Z"
4/27/2026, 10:53:15 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1355
moderation_latency
(null)
generation_time
302850
tokens_prompt
77
tokens_completion
11027
native_tokens_prompt
76
native_tokens_completion
10712
native_tokens_completion_images
(null)
native_tokens_reasoning
10052
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0836524
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-c6491ba8-dcd3-9c45-871f-069ed36145b1"
is_byok
false
latency
1355
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777287195-QdKRvHOGkTjSa4KXKQqG"
session_id
(null)
api_type
"completions"
id
"gen-1777287195-4MiaqkrjzfRQMCJlBKja"
upstream_id
"chatcmpl-c6491ba8-dcd3-9c45-871f-069ed36145b1"
total_cost
0.0836524
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
504
wordsDialogue
0
49.9595%