NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86WHNSSFRFC3SD3MF7QE02
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 45s
Cost:
$0.087227
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
26.78%
Dialogue to Total Word Ratio
(details)
13.9092%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the sticky kitchen table, smoothing out wrinkles with her bare palms. Bill squinted at the messy blueprint, adjusting his thick glasses to make sense of the chaotic scribbles. "Is this a house or a giant potato?" Bill asked. "It is the primary target location," Molly said. "Timmy turns seven tomorrow at exactly two," she added. Bill tapped a large purple circle drawn on the thick paper. "What exactly is this purple blob?" Bill asked. "That is the inflatable bouncy castle," Molly explained. "We must avoid it at all possible costs," she warned. "Why should we avoid it?" Bill asked. "It is all about static electricity," Molly replied. She shuddered at the thought of their expensive gear failing during the critical extraction phase. "It will completely ruin our tactical stealth suits," she said. Bill nodded slowly and pointed to a bright yellow square near the back door. "And what about the yellow box?" Bill asked. "That is the main present table," Molly whispered. "That is where the valuable loot sits," she said. "Are there any armed guards?" Bill asked. Molly took a sip of her cold coffee. "Only a clown named Bozo," Molly said. "He makes balloon animals for the guests," she added. "I really hate those stupid balloon animals," Bill muttered. He remembered a childhood birthday where a latex poodle popped in his face. "Focus on the mission," Molly snapped. "We enter through the kitchen window," she said. "Then we neutralize the juice boxes," she added. Bill looked confused and scratched his bare chin while staring at the map. "Why the juice boxes?" Bill asked. "Yes, that is absolutely correct," Molly said. "Sticky fingers compromise the physical evidence," she explained. "We swap them for plain water bottles," she said. Bill sighed deeply and leaned back in his wooden chair, rubbing his tired eyes. "This whole plan just feels very wrong," Bill said. "We are literally robbing small children," he added. "We are robbing incredibly spoiled children," Molly corrected. "Timmy already has a tablet," she said. "He does not need a remote control car," she added. Molly firmly believed redistributing wealth from greedy toddlers was a moral obligation. Bill crossed his arms over his chest, clearly doubting the operational strategy. "What about the giant paper piƱata?" Bill asked. "It is a clever decoy," Molly said. "It is filled with cheap chocolate candy," she warned. "Do not touch the paper donkey," she ordered. She knew the structural integrity of the cardboard beast was highly unpredictable. The room grew quiet as the gravity of the situation settled between them. Bill grabbed his black beanie from the table and pulled it down over his ears. "That is absolutely fine with me," Bill said. "I will bring the plastic water bottles," he offered. "And I will handle the clown," Molly said. "Just do not make him cry," Bill said. "Clowns cry very loudly in public spaces," he warned. Molly folded the crayon drawing very carefully and slipped it into her hidden jacket pocket. "I never make anyone cry on purpose," Molly lied. She checked her watch and signaled for them to leave the safehouse immediately.
created_at
"2026-05-22T16:04:24.904Z"
5/22/2026, 4:04:24 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
870
moderation_latency
(null)
generation_time
164552
tokens_prompt
77
tokens_completion
10258
native_tokens_prompt
76
native_tokens_completion
11605
native_tokens_completion_images
(null)
native_tokens_reasoning
10911
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0872275
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-d06c6730-7fb2-9448-99a5-b0d6b00aaddf"
is_byok
false
latency
870
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465864-0IdtDvWvpUWFxTxfHMMm"
session_id
(null)
api_type
"completions"
id
"gen-1779465864-FULv5AoSEJYaNExeohJK"
upstream_id
"chatcmpl-d06c6730-7fb2-9448-99a5-b0d6b00aaddf"
total_cost
0.0872275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
526
26.78%
Dialogue to Total Word Ratio
Ratio: 44.87%, Deviation: 5.13%
neededClean
false
wordsTotal
526
wordsDialogue
236
13.9092%