NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86EM0B86TDM9SD60DKA9GG
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 52s
Cost:
$0.082668
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the dashboard. She pointed at a poorly sketched bouncy castle. "The target is the dining room," she said. "Presents are stacked near the sliding glass door." Bill squinted at the map, adjusting his binoculars. "What about the bouncy castle?" he asked. "It blocks the sightline from the street." "Exactly," Molly replied, tapping the paper. "We use the clown as a distraction. When he starts making balloon animals, the parents will look away." Bill rubbed his chin, looking extremely nervous. "Clowns are unpredictable," he whispered. "What if he makes a balloon sword and attacks us?" Molly rolled her eyes and sighed heavily. "He is a minimum wage entertainer, Bill. Focus on the pinata. That is our primary breach point." Bill leaned closer to the dirty windshield. "The pinata is shaped like a donkey," he noted. "It looks heavily reinforced." "It is cardboard," Molly explained patiently. "Once the kids break it open, the sugar rush will cause chaos. That is when we strike." Bill nodded slowly, finally understanding the plan. "So we grab the gifts while the children are vibrating from candy," he said. "Precisely," Molly said, starting the engine. "But we must avoid the juice boxes. Spilled fruit punch ruins our stealth approach." Bill checked his watch and then adjusted his tactical vest. "What about the parents?" he asked while checking his gear. "They will be busy cutting the large chocolate cake." Molly pulled out a pair of dark sunglasses. "The mother is highly distracted by the photographer today." She checked her reflection in the rearview mirror carefully. "We only have three minutes before the magic show begins." Bill grabbed his canvas bag from the back seat. "Should I bring the grappling hook just in case?" Molly shook her head and opened the van door. "No grappling hooks near the inflatable play structures today." She stepped out into the bright afternoon suburban sunlight. "A sharp hook would pop the castle and ruin everything." Bill followed her out and closed the door quietly. "I still think the clown is a major security risk." They walked across the perfectly manicured green lawn together. "He keeps looking directly at our parked van suspiciously." Molly ignored his paranoid complaints and approached the gate. "Just smile and pretend we are distant relatives visiting." She unlatched the wooden gate and slipped inside smoothly. "Remember to grab the large boxes first," she whispered. "The small boxes are probably just cheap clothing items." Bill nodded and wiped sweat from his forehead quickly. "Got it," he replied while staring at the massive pile of brightly wrapped gifts sitting near the patio table. "I will grab the biggest one," he added with a greedy smile on his face. "Hurry," she hissed while watching the children run around the yard. "They are finishing," she added while pointing toward the large patio table. "Go." Bill sprinted toward the house while Molly distracted everyone. She loudly asked the clown to make a very large swan. He nodded happily.
created_at
"2026-05-22T15:56:48.538Z"
5/22/2026, 3:56:48 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
972
moderation_latency
(null)
generation_time
171440
tokens_prompt
77
tokens_completion
8764
native_tokens_prompt
76
native_tokens_completion
10997
native_tokens_completion_images
(null)
native_tokens_reasoning
10320
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0826675
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-56adda54-b172-97b5-a439-4237caa6039e"
is_byok
false
latency
972
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465408-70xp0U9Rhzkb8aDScNf1"
session_id
(null)
api_type
"completions"
id
"gen-1779465408-HPKPJQxHwg1khlOaZ2kp"
upstream_id
"chatcmpl-56adda54-b172-97b5-a439-4237caa6039e"
total_cost
0.0826675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
250
100.0000%