NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS85Y8NCK6098T00DTPVPTM3
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 25s
Cost:
$0.075573
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the crayon drawing of the Henderson residence on the sticky diner table. She tapped a greasy fingernail against the poorly rendered backyard, right where a bouncy castle was sketched in aggressive purple wax. Bill leaned in, squinting through his thick spectacles while nervously shredding a paper napkin into tiny, snowy confetti pieces. This was supposed to be their masterpiece, yet they were studying a map drawn by a child. "We enter through the pet door. The golden retriever is easily bribed with premium liver treats. He loves them." Molly shook her head, her expression deadpan as she pulled a laminated schedule from her trench coat pocket. She pointed to a block of time labeled cake and ice cream, highlighting the chaotic window they needed to exploit. The sheer absurdity of planning a tactical insertion around a magic show was not lost on her, but professionalism demanded absolute focus. "The dog is irrelevant. We strike during the rabbit trick. While the kids are distracted, we rappel from the oak tree and bag the Lego sets. It is a solid plan." Bill frowned, adjusting his glasses as he mentally calculated the structural integrity of the oak tree branches. He had never been particularly athletic, and the thought of dangling above a swarm of hyperactive children fueled by pure sugar made his stomach perform uncomfortable acrobatics. He looked back down at the crayon map, tracing the route from the patio to the dining room. "What about the pinata? If the candy scatters, it creates a slip hazard. I cannot afford another torn meniscus. My chiropractor says my lumbar is already compromised. I need surgery." Molly sighed, pinching the bridge of her nose as she stared at her overly cautious partner. She reached across the table and snatched the shredded napkin from his trembling hands, tossing it into a half empty mug of cold coffee. They had robbed armoured trucks and bypassed laser grids, yet a papier mache donkey filled with chocolate coins was threatening to derail the entire operation. "I will neutralize the pinata. Just grab the wrapping paper. We are stealing birthday gifts, not defusing nuclear warheads. Stay focused." Bill nodded slowly, though his pale complexion suggested he was not convinced. He folded the crayon map and tucked it into his breast pocket, right next to his lockpicks and a canister of Silly String he had acquired for crowd control. The diner waitress walked past, refilling their coffee mugs. "I brought the Silly String. If the clown attacks, I will spray him directly in the eyes. It is highly effective and completely nonlethal. Trust me." Molly slowly closed her eyes, silently praying for patience. She had trained under the greatest cat burglars in Europe, learning the delicate art of silent entry. Now she was coordinating a strike team armed with aerosol toys against a suburban birthday celebration. "Put the Silly String away. We do not engage the clown. We secure the loot and vanish before the parents cut the cake."
created_at
"2026-05-22T15:47:52.799Z"
5/22/2026, 3:47:52 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1052
moderation_latency
(null)
generation_time
144724
tokens_prompt
77
tokens_completion
8546
native_tokens_prompt
76
native_tokens_completion
10051
native_tokens_completion_images
(null)
native_tokens_reasoning
9417
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0755725
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-00e287b4-5cb6-9afa-8409-020bafcc9fc9"
is_byok
false
latency
1052
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779464872-xXqNz4iPG4hjx5WyCcCo"
session_id
(null)
api_type
"completions"
id
"gen-1779464872-7sDH37LRJ3GIiLrnfaPY"
upstream_id
"chatcmpl-00e287b4-5cb6-9afa-8409-020bafcc9fc9"
total_cost
0.0755725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%