NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS879M6JJX79AZFJM3JBP094
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 39s
Cost:
$0.088270
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.98%
Dialogue to Total Word Ratio
(details)
99.9905%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the wooden table, smoothing the edges while Bill leaned close. "This is the main perimeter. The bouncy castle is a distraction. We ignore the inflatable dragon." Bill squinted at the messy paper, tracing the crude lines before shaking his head. "What about the scary clown? I hate clowns. They have unpredictable movement patterns." Molly tapped her pen against the table, emphasizing the strict timeline she had calculated. "The scary clown is a mercenary. He leaves at four. We strike at four fifteen." Bill crossed his arms, his leather jacket squeaking as he shifted his weight nervously. "Four fifteen is cake time. The sugar crash will not hit until four forty five. The little kids will be feral." Molly smiled broadly, her eyes gleaming with the thrill of the heist as she pointed outside. "Exactly. Feral kids ignore wrapped boxes. They only want the cake. We slip through the side gate, bypass the ball pit, and bag all the gifts." Bill rubbed his chin, staring at the sheer volume of loot they needed to extract. "How do we carry them? There are at least thirty presents. Some are huge. I saw a giant box the size of a refrigerator." Molly reached under the table and pulled out a faded plastic toy, slamming it right down. "We use the wagon. The red wagon. It has big all terrain wheels." Bill scoffed loudly, throwing his hands up in utter disbelief at her proposed extraction vehicle today. "A wagon? We are professional thieves, Molly. We use tactical duffel bags, not a stupid toddler wagon." Molly shook her head, patiently explaining the psychological camouflage required to operate in broad daylight safely. "The wagon blends in. If a neighbor sees us pulling a wagon full of toys, we look exactly like exhausted parents. If we carry tactical duffels, we look exactly like real burglars." Bill sighed and tapped the drawing. "Fine. What about the dog? The paper blueprint shows a dog." Molly grinned. "It is a goldendoodle named Barnaby. I have a squeaky toy and a slice of cheddar. He will be fully compromised in seconds." Bill stared. "You are seriously bribing a guard dog with cheese?" Molly nodded slowly. "He is a doodle, Bill. He has no loyalty, only appetite. Now, memorize the blind spots. The patio camera is blocked by the paper volcano." "The volcano?" "Timmy loves dinosaurs. It erupts baking soda at three thirty. We use the thick smoke to breach the doors." "Baking soda smoke. We are risking actual prison for a baking soda volcano." "We are risking prison for a pristine Mega Fortress playset and twenty gift cards. Focus, Bill. This is the very big leagues." "I really think we should bring the duffel bags in case the wagon breaks." "The wagon will not break. It is built for rugged play. Just kindly smile if the mother offers you juice." "I really do not want juice." "You will drink the juice. That is a direct order."
created_at
"2026-05-22T16:11:33.474Z"
5/22/2026, 4:11:33 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
822
moderation_latency
(null)
generation_time
158560
tokens_prompt
77
tokens_completion
9417
native_tokens_prompt
76
native_tokens_completion
11744
native_tokens_completion_images
(null)
native_tokens_reasoning
11073
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.08827
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-260afb40-f545-9ff2-8800-5e4aa214a4a6"
is_byok
false
latency
822
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779466293-7Kh5vo1YSeFG8JhW4JtB"
session_id
(null)
api_type
"completions"
id
"gen-1779466293-DWy6cH3ykcx1iRRpUci0"
upstream_id
"chatcmpl-260afb40-f545-9ff2-8800-5e4aa214a4a6"
total_cost
0.08827
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
501
99.98%
Dialogue to Total Word Ratio
Ratio: 70.06%, Deviation: 0.06%
neededClean
false
wordsTotal
501
wordsDialogue
351
99.9905%