NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS87F4WP3C6Y87GY1RBJCWJ0
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 49s
Cost:
$0.096610
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crinkled blueprint across the sticky diner table. She tapped a red marker against the layout of the suburban mansion. Bill slurped his milkshake loudly, ignoring the glare from the waitress. He wiped a thick milk mustache with his dirty sleeve and leaned over the large paper map. "We enter through the bouncy castle because it provides excellent cover from the neighbors and the loud music will completely mask the sound of us cutting the back fence while the parents are busy eating cake and drinking cheap wine near the big brick patio area out back right now." Bill choked on his plastic straw and coughed violently. He slammed his tall glass down, spilling chocolate syrup everywhere. He pointed a trembling finger at the backyard section of the detailed drawing while shaking his head in total and complete disbelief at her very ridiculous and absurd new tactical plan. "That is a terrible idea because the clown is guarding it with a very sharp sword made of shiny balloons and he looks like a crazy guy who would gladly attack anyone trying to ruin his magic show for the little kids today my dear old friend so think again." Molly rolled her eyes and drew a circle around the patio. She clicked her pen twice to emphasize her absolute authority. She knew the security guards were just bored teenagers who wanted to play video games instead of watching the gate for any sneaky little party crashers trying to enter. "The clown is just a guy named Steve who smokes behind the garage all day long while ignoring his actual job duties and he will not notice us sneaking past the giant inflatable slide to grab the wrapped gifts from the dining room table inside the big fancy brick house." Bill crossed his arms and slumped back into the vinyl booth. He stared at the ceiling, contemplating his life choices. He wondered if he could still become an accountant or maybe a dentist because this criminal lifestyle was getting far too weird for his fragile mental health and general sanity. "Steve has a radio and he will definitely call for backup when he sees two adults crawling through a bouncy castle filled with screaming toddlers holding sticky juice boxes while we try to steal a bunch of cheap plastic toys that are probably made in a giant overseas sweatshop factory." Molly sighed and folded the map neatly. She tossed it into her leather bag and stood up. She dropped a five dollar bill on the table for the drinks before grabbing her dark sunglasses and marching toward the glass exit door with absolute and unwavering professional criminal confidence and grit. "We are doing this, Bill, and you are wearing the red clown nose because it provides excellent camouflage in the ball pit, plus it really brings out your eyes while we sneak past the magician and grab all the wrapped gifts from the big wooden dining room table tonight, buddy."
created_at
"2026-05-22T16:14:34.407Z"
5/22/2026, 4:14:34 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
933
moderation_latency
(null)
generation_time
168339
tokens_prompt
77
tokens_completion
8837
native_tokens_prompt
76
native_tokens_completion
12856
native_tokens_completion_images
(null)
native_tokens_reasoning
12292
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.09661
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-294bfb6d-300e-9fbb-be1c-6d33376ea611"
is_byok
false
latency
933
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779466474-ugATmCvOoFFS1aumTBTV"
session_id
(null)
api_type
"completions"
id
"gen-1779466474-60Y97hx7oZskVlqG1dku"
upstream_id
"chatcmpl-294bfb6d-300e-9fbb-be1c-6d33376ea611"
total_cost
0.09661
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
250
100.0000%