NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS8745HE7NCBR1HYK8C3AH06
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 59s
Cost:
$0.106090
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
50.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the crayon blueprint across the sticky diner table. She tapped a heavily shaded square near the patio while looking very closely at Bill. "The bouncy castle is our primary infiltration point today. We drop in from the oak tree, bypass the ball pit, and quickly secure the perimeter." Bill squinted at the drawing, wiping a smear of ketchup off the edge. He adjusted his glasses and sighed very heavily at the crazy plan. "You want us to repel into an inflatable fortress? I am a professional safecracker, Molly. I just do not do ball pits or scary clowns." Molly leaned forward, her eyes narrowing with intense focus. She traced a line toward the garage, ignoring his loud complaints about the big bouncy castle. "The garage is where they are staging the loot. Forty wrapped boxes. I heard the birthday kid is getting the brand new gaming console today." Bill crossed his arms, looking entirely unconvinced by the crudely drawn map. He shook his head slowly, still thinking about the giant inflatable fortress outside. "There will be a clown. You cannot account for the clown. They are highly unpredictable and they carry balloon animals that could be dangerous weapons." Molly pulled a small vial of liquid from her jacket pocket and set it gently on the map, right over the poorly drawn clown figure. "I have synthesized a compound that dissolves latex. The clown will be neutralized before he can twist a single poodle or make any silly jokes." Bill picked up the vial, inspecting the murky liquid inside. He set it down and rubbed his temples, trying to understand her weird criminal logic. "What about the parents? There will be at least twenty adults drinking cheap wine and watching us suspiciously while we carry the heavy gift boxes." Molly smiled, revealing a perfectly executed plan that she had spent all week perfecting. She pointed to the driveway on the colorful crayon map drawing. "We hire a fake magician to perform in the front yard. While they watch the disappearing dove trick, we quickly load up the stolen goods." Bill stared at her, finally seeing the sheer brilliance of the ridiculous operation. He nodded in quiet agreement, accepting his vital role in this heist. "Fine. But I am not wearing the pointy hat. I have my dignity, and I refuse to look foolish in front of young toddlers today." Molly rolled up the crayon drawing and tucked it safely into her bag. She stood up and grabbed her coat from the empty wooden chair. "You will wear whatever I tell you to wear. Now let us go buy some invisible string and a rabbit for the big magic trick."
created_at
"2026-05-22T16:08:34.62Z"
5/22/2026, 4:08:34 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
910
moderation_latency
(null)
generation_time
179326
tokens_prompt
77
tokens_completion
10869
native_tokens_prompt
76
native_tokens_completion
14120
native_tokens_completion_images
(null)
native_tokens_reasoning
13574
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.10609
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-018be07b-ee38-9463-99f0-147a3dd2eb9f"
is_byok
false
latency
910
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779466114-8EE19XvVL3w7PQ2QfeWE"
session_id
(null)
api_type
"completions"
id
"gen-1779466114-YcXySQpVpN1Kf1ac8kBk"
upstream_id
"chatcmpl-018be07b-ee38-9463-99f0-147a3dd2eb9f"
total_cost
0.10609
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
450
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
450
wordsDialogue
225
50.0000%