NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86NAQN23E227RX3WST1G5E
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 10s
Cost:
$0.093310
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crumpled map across the sticky diner table, smoothing out the creases with her palm. She tapped a red circle near the backyard fence. Bill leaned forward, squinting at the drawing while chewing on a plastic stirrer. He pointed a thick finger at the shaded rectangle in the center. "We need a blueprint of the bouncy castle." Molly sighed, rubbing her temples to ward off the impending headache. She had worked with him for five years, yet his logic still baffled her. "It is an inflatable structure Bill. There are no supporting walls." Bill crossed his arms, defensive and entirely serious about his tactical concerns. He glanced toward the window, checking for imaginary surveillance vans. "What if the clown is armed? Those squeaky shoes hide deadly secrets." Molly took a slow sip of her cold coffee, staring at him with profound exhaustion. The diner smelled of burnt grease and stale pancakes. "He makes balloon animals. His only weapon is mild entertainment." Bill tapped his chin, considering his options for stealth and infiltration. He reached into his jacket pocket and pulled out a fuzzy brown fabric swatch. "I will wear a disguise. Maybe a giant teddy bear costume." Molly shook her head, visualizing the absolute disaster that would unfold. She gestured to his broad shoulders and towering height. "You are six foot four. You will look like a mutated bear." Bill grinned, clearly pleased with his own terrifying ingenuity. He stuffed the fabric back into his pocket and leaned closer to the map. "Perfect. Kids love monsters. I will grab the gifts while they scream." Molly pinched the bridge of her nose, questioning every life choice that led her to this specific moment. She tapped the table for emphasis. "They are just plastic toys. We are supposed to be professional thieves." Bill looked down at his hands, suddenly adopting a deeply sentimental expression. He swallowed hard, thinking of his sister and her demanding offspring. "My nephew wants the robotic dinosaur. I am a family man." Molly rolled her eyes, accepting defeat against his bizarre familial devotion. She folded the map and slid it into her leather satchel. "Fine. We infiltrate during musical chairs. Grab the loot." Bill perked up, his mind already racing with unnecessary tactical gadgets. He patted his belt, checking for gear that he definitely did not need. "Should I bring a grappling hook for the gym?" Molly stood up, tossing a crumpled five dollar bill onto the table. She adjusted her coat and gave him one final warning look. "Just bring a duffel bag and try not to cry." Bill looked offended, crossing his arms tightly over his chest. He remembered the rooftop incident very clearly and still felt unjustly judged. "I never cry on the job. Except that time with pigeons." Molly walked toward the diner exit, pushing the heavy glass door open. She paused, looking back at her ridiculous partner in crime. "Focus. We strike at two. Pin the tail is our final cue."
created_at
"2026-05-22T16:00:28.419Z"
5/22/2026, 4:00:28 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
902
moderation_latency
(null)
generation_time
190186
tokens_prompt
77
tokens_completion
8858
native_tokens_prompt
76
native_tokens_completion
12416
native_tokens_completion_images
(null)
native_tokens_reasoning
11792
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.09331
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-c22d0246-ea04-9747-b181-35a0ee094742"
is_byok
false
latency
902
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465628-zlvR9eHUjsXsTvTyOmKp"
session_id
(null)
api_type
"completions"
id
"gen-1779465628-9ojeEeDn6ui7wvZRXX1C"
upstream_id
"chatcmpl-c22d0246-ea04-9747-b181-35a0ee094742"
total_cost
0.09331
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%