NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS8659CS8KZBXSXNX5GXBQ20
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4m 6s
Cost:
$0.132550
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
99.97%
Dialogue to Total Word Ratio
(details)
99.9759%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the crumpled blueprint across the sticky diner table, weighing down the corners with heavy salt shakers. The schematic detailed a sprawling suburban backyard, complete with a manicured lawn and a brick patio. Bill sat opposite her, carefully balancing a sugar packet on his spoon, completely ignoring the meticulous plan today. "The bouncy castle is our primary distraction." Molly tapped the paper impatiently, trying to draw his attention to the main entry point. "I absolutely despise bouncy castles." Bill just sighed loudly and looked away, clearly uninterested in the logistical challenges of their upcoming operation. "Just listen to my instructions." Molly glared at him, her patience wearing incredibly thin. "What about the clown?" She pointed directly at the wooden gate, emphasizing the perimeter breach. "The clown is a trained professional. We avoid him." Bill merely shrugged his shoulders and picked at his fingernails, acting like a bored teenager. "He makes balloon animals." Molly slammed her fist down on the table, making the silverware rattle. "Focus on the presents." Bill flinched slightly, finally looking at the map. "They are wrapped in shiny paper." Molly took a deep breath to calm her rising anger. "We grab the gifts and run." Bill smiled weakly, offering a silent apology. "What if a child cries?" Molly rubbed her temples, feeling a massive headache forming. "We do not care about crying children." Bill sipped his cold coffee, trying to appear busy. "That seems incredibly mean." Molly traced the property line with her index finger, outlining the escape route. "We are literal thieves." Bill yawned widely, showing zero enthusiasm. "I still feel slightly guilty." Molly shook her head in pure disbelief. "Just bring the large duffel bag." Bill checked his wristwatch, eager to leave. "I forgot the bag." Molly leaned forward to explain the final phase. "You forgot the bag?" Bill leaned back in his vinyl booth. "I brought a basket instead." Molly sighed in deep frustration. "A wicker basket?" Bill grinned foolishly at her reaction. "It has a nice handle." Molly crossed her arms, waiting for him to speak. "We cannot steal fifty presents in a basket." Bill tapped his foot against the linoleum floor. "I will make two trips." Molly uncrossed her arms and grabbed her pen. "You will get caught." Bill stopped tapping and grabbed his paper napkin. "I am very fast." Molly drew a large circle around the garage. "You have bad knees." Bill folded his napkin into a tiny triangle. "My knees are perfectly fine." Molly drew an arrow pointing to the side door. "Just stay by the van." Bill unfolded his napkin to start over. "I want to see the magic show." Molly circled the exact location of the gift table. "There is no magic show." Bill crumpled his napkin into a small ball. Molly tapped the garage again for emphasis. Bill tossed the paper ball across the table. Molly watched it land near her glass. Bill watched it fall short. Molly picked it up. Bill looked down.
created_at
"2026-05-22T15:51:42.76Z"
5/22/2026, 3:51:42 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1538
moderation_latency
(null)
generation_time
245796
tokens_prompt
77
tokens_completion
13152
native_tokens_prompt
76
native_tokens_completion
17648
native_tokens_completion_images
(null)
native_tokens_reasoning
16948
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.13255
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-33d6a73a-0e43-97ff-9eca-6a6ee267ce00"
is_byok
false
latency
1538
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465102-f0ByLfRsbcVCaEWl8cik"
session_id
(null)
api_type
"completions"
id
"gen-1779465102-LW8BmBFYyC25b3d3u7Yl"
upstream_id
"chatcmpl-33d6a73a-0e43-97ff-9eca-6a6ee267ce00"
total_cost
0.13255
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
99.97%
Dialogue to Total Word Ratio
Ratio: 29.92%, Deviation: 0.08%
neededClean
false
wordsTotal
498
wordsDialogue
149
99.9759%