NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS867501TGNE0WJCAA00ZMD8
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 50s
Cost:
$0.058322
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
68.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hedges, studying the chaotic backyard through her binoculars. Seven toddlers were currently destroying a paper donkey while a stressed father desperately tried to distribute juice boxes. It was the perfect cover for their grand larceny. The sheer volume of wrapped toys piled on the patio table was staggering, a treasure trove of electronic gadgets and plastic monstrosities just waiting to be fenced on the black market. Bill adjusted his suspenders, sweating profusely in the afternoon heat. He checked his utility belt, ensuring they had enough juice to bribe any wandering toddlers. Molly turned to her partner, her eyes gleaming with criminal ambition. We need a giant cartoon bag. Bill nodded enthusiastically, wiping his brow with a stained handkerchief. He reached into the rusted trunk of their getaway sedan and pulled out their gear. I brought the fuzzy clown wig. Molly smiled, admiring his commitment to the disguise. She pointed toward the patio where the gifts were unguarded. Grab the wooden stick. Bill grinned, hoisting the heavy timber over his shoulder while eyeing the massive inflatable structure nearby. And the bouncy castle. They both marched forward, ready to execute the greatest birthday heist in suburban criminal history.
created_at
"2026-05-22T15:52:43.792Z"
5/22/2026, 3:52:43 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
856
moderation_latency
(null)
generation_time
109977
tokens_prompt
77
tokens_completion
5686
native_tokens_prompt
76
native_tokens_completion
7751
native_tokens_completion_images
(null)
native_tokens_reasoning
7493
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0583225
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-b785e61b-0267-9eb5-a291-639654f4a61f"
is_byok
false
latency
856
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465163-7FlPrQiqpnv7uZkCetQx"
session_id
(null)
api_type
"completions"
id
"gen-1779465163-SBsC7T78kDU5qbBiX6Si"
upstream_id
"chatcmpl-b785e61b-0267-9eb5-a291-639654f4a61f"
total_cost
0.0583225
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
36.79%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 10.00%
neededClean
false
wordsTotal
200
wordsDialogue
0
68.3940%