NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS872JESD8CTKZHTC415FQBA
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 28s
Cost:
$0.085060
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the hood of their rusted sedan. She tapped a sticky finger against a poorly sketched rectangle. The afternoon sun beat down on the suburban driveway, baking the asphalt and melting the chocolate chips in her cookie. Bill leaned against the bumper, sipping a lukewarm soda and questioning every life choice that had led him to this exact moment. He watched a minivan drive past, hoping the neighbors were not paying attention to the two suspicious adults studying a toddler artwork. Molly adjusted her sunglasses, completely oblivious to his growing despair. She traced a line from the driveway to the backyard with a purple marker. The smell of sunscreen and cut grass filled the air, masking the faint scent of her cheap perfume. Bill crushed his empty aluminum can and tossed it into the backseat. He rubbed his temples, feeling a headache blooming behind his eyes. "The bouncy castle is our primary obstacle," Molly said. "It is a balloon. A giant inflatable balloon," Bill replied. "Exactly. A tactical air trap. If we step wrong, we bounce into the punch bowl," she explained. "We are stealing wrapped gifts from young children. Why do we need a tactical plan?" he asked. "Because Timmy is a monster. He saw me at the grocery store. He knows my face," Molly insisted. "You wore a fake mustache," Bill pointed out. "It was a brilliant disguise. Anyway, we enter through the petting zoo enclosure," she whispered. "The petting zoo?" he questioned. "Yes. The goats provide excellent cover. We blend in with the livestock," Molly stated. "I am not dressing as a goat," Bill groaned. "You will wear the goat suit, Bill. It has tactical pockets for the presents," she commanded. "This is the worst heist ever planned," he muttered. "Just grab the Lego sets and run," Molly instructed. "Fine. But I get to keep the robot dog," Bill negotiated. "Deal. Now memorize the clown patrol routes," she ordered. "There are no clown patrols," he sighed. Molly finally looked up, her expression entirely serious despite the juice stain on her collar. She pointed at the garage, then at the side gate, outlining their infiltration path with absolute confidence. Bill sighed heavily, realizing there was no escaping this ridiculous endeavor. He pushed himself off the bumper and walked over to examine the crude map. The neighborhood was quiet, save for the distant sound of children screaming in pure joy. Molly folded the paper carefully and shoved it into her jacket pocket. She checked her watch, noting they had exactly forty minutes before the large paper donkey was completely destroyed. Bill opened the trunk, revealing a pile of absurd costumes and an oversized burlap sack. He pulled out the furry white suit, staring at it with profound disgust. Molly patted his shoulder, offering a sympathetic but firm smile while the distant sound of laughter echoed down the quiet suburban street where they stood waiting for the perfect moment to begin their very silly heist.
created_at
"2026-05-22T16:07:42.312Z"
5/22/2026, 4:07:42 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
876
moderation_latency
(null)
generation_time
147187
tokens_prompt
77
tokens_completion
8746
native_tokens_prompt
76
native_tokens_completion
11316
native_tokens_completion_images
(null)
native_tokens_reasoning
10683
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.08506
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-668772f6-f6fa-9cfd-a863-cfe2b5cb267d"
is_byok
false
latency
876
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779466062-7C3V1WPjTYKJbQOxE6Br"
session_id
(null)
api_type
"completions"
id
"gen-1779466062-3HIYE54BrPTrz1QIgHDE"
upstream_id
"chatcmpl-668772f6-f6fa-9cfd-a863-cfe2b5cb267d"
total_cost
0.08506
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%