NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS860X3FZ1AB7CZCD3GX64SZ
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 21s
Cost:
$0.102527
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the dashboard of their rusted sedan. She tapped a greasy finger against the poorly sketched bouncy castle, ignoring the cookie crumbs embedded in the paper. Bill adjusted his binoculars, staring through the windshield at the chaotic suburban backyard. A dozen hyperactive toddlers were currently tearing across the manicured lawn, wielding plastic swords and screaming at a terrifyingly cheerful magician. The afternoon sun beat down on the vinyl roof of their vehicle, making the interior smell like stale fries and bad decisions. Molly checked her watch, calculating the precise moment the sugar crash would hit the tiny hostages. She reached into the backseat and pulled out two black ski masks, tossing one onto his lap. Bill caught it clumsily, nearly dropping his binoculars onto the floorboards. He looked at the mask, then back at the sprawling fortress of inflatable entertainment. The magician was now pulling an endless stream of colorful handkerchiefs from his sleeve, mesmerizing the small crowd. Molly sighed heavily, recognizing the familiar hesitation in the posture of her own partner. "The bouncy castle is the primary distraction. Once the kids are inside, we breach the patio." She leaned over and snatched the mask from his hands, pulling it over her own head. "What about the clown? He looks like he knows martial arts." The scratchy wool immediately made her nose itch, but she maintained her fierce glare. "He makes balloon animals. He is not a threat. We grab the wrapped boxes and exit through the petting zoo area." Bill finally picked up his own mask, turning it over in his hands as he watched a golden retriever chase a rogue frisbee past the gift table. "The goats will eat our getaway rope." The presents were piled high, wrapped in shiny paper and topped with elaborate bows, completely unguarded. "Then we use the ladder. Just focus on the pinata. When it breaks, we move." He took a deep breath, steeling himself for the ridiculous operation ahead. "I still think we should wear mascot costumes." Molly opened her door, letting the sounds of shrieking children and upbeat carnival music flood the quiet interior. "We are stealing Lego sets. We are not robbing a bank. Keep it simple." She stepped out onto the pavement, adjusting her dark jacket. "What if the pinata is a decoy?" Bill followed, shutting the door behind him with a soft click. "It is a cardboard donkey filled with candy. It is not a decoy." They walked toward the wooden fence, moving with the exaggerated stealth of cartoon villains. "I am just saying the target is seven. These kids are absolutely ruthless." The wooden gate creaked loudly as they pushed it open, revealing the glorious treasure trove waiting just beyond the slippery slide. "Stick to the plan. Grab the gifts. Ignore the juice boxes." Molly grinned beneath her wool mask. "Fine, but I get to keep the giant pink teddy bear." Bill gulped very nervously in sheer absolute fear. "Drive the van."
created_at
"2026-05-22T15:49:19.102Z"
5/22/2026, 3:49:19 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1088
moderation_latency
(null)
generation_time
200837
tokens_prompt
77
tokens_completion
10258
native_tokens_prompt
76
native_tokens_completion
13645
native_tokens_completion_images
(null)
native_tokens_reasoning
13013
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.1025275
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-3ccff259-7a95-9442-9d19-b8627876c0b8"
is_byok
false
latency
1088
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779464959-oYuXV9MRkH5KwlzCw61F"
session_id
(null)
api_type
"completions"
id
"gen-1779464959-Y7HbFTpBBd44kjQL01Da"
upstream_id
"chatcmpl-3ccff259-7a95-9442-9d19-b8627876c0b8"
total_cost
0.1025275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%