NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86R7QY0BM7TA4YX50GSWN4
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 44s
Cost:
$0.082818
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
75.97%
Dialogue to Total Word Ratio
(details)
87.9827%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped the crayon drawing of the suburban fortress. "The bouncy castle is the primary choke point. We go in through the ball pit." Bill squinted at the paper. "A ball pit? Molly, it is a backyard birthday party, not a casino. There is no ball pit." Molly slid a juice box across the table. "There is a bounce house. We use the structural integrity of the inflatable walls to mask our thermal signature." Bill rubbed his temples. "We are stealing wrapped boxes from children. Thermal signatures are not our biggest problem. The actual problem is the clown." Molly nodded slowly. "The clown is a mercenary. I ran his plates. He drives a rented sedan and owes money to a guy named Knuckles. He will be distracted by the cake." Bill leaned back in his chair. "You want to bribe a clown with cake?" Molly shook her head. "I want to neutralize the clown with a sugar crash. While he is passed out behind the tool shed, we breach the patio doors." Bill pointed at the map. "What about the parents? There will be at least fifteen enraged mothers armed with hot glue guns and intense judgment." Molly smiled confidently. "We wear camouflage." Bill frowned in confusion. "Camouflage? In a suburban backyard?" Molly stood up and demonstrated. "We dress as inflatable tube men. Nobody looks directly at an inflatable tube man. It is a proven psychological blind spot." Bill stared at her for a long moment. "You want us to flap our arms in the wind while carrying fifty pounds of plastic toys." Molly sat back down. "Exactly. Now, the extraction. We load the loot into the large pinata." Bill sighed heavily. "The pinata is made of paper and hangs from an oak tree." Molly tapped the map again. "Which is why we reinforce it with titanium struts and attach a winch to the minivan. When the kids hit it, it drops the toys into our getaway vehicle." Bill crossed his arms. "And if a kid hits it too hard and breaks the titanium struts?" Molly took a loud sip from her juice box. "Then we blame the clown. Are you in or out, Bill?" Bill closed his eyes and surrendered. "I will buy the tube man pants." Molly grabbed a purple marker and circled the driveway. "We also need a distraction for the neighborhood watch. Martha Higgins is always watching from her porch." Bill looked out the window. "Martha Higgins is blind and uses a walker." Molly capped the marker aggressively. "She has a very loud dog. We will throw a steak over the fence to keep the animal quiet." Bill nodded and grabbed his jacket. "Fine. If the dog eats my steak, I am stealing the shoes of the clown." Molly smiled and packed the crayons. "Deal. Let us go buy inflatable pants before the party store closes." Bill opened the heavy wooden door and stepped outside into the sun. "I hope they have my exact size."
created_at
"2026-05-22T16:02:03.661Z"
5/22/2026, 4:02:03 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1107
moderation_latency
(null)
generation_time
163844
tokens_prompt
77
tokens_completion
9366
native_tokens_prompt
76
native_tokens_completion
11017
native_tokens_completion_images
(null)
native_tokens_reasoning
10393
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0828175
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-12aa8292-f7cf-9484-a371-2dbf23c13fed"
is_byok
false
latency
1104
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465723-3wKT3iBAmCywOh8Nfkvs"
session_id
(null)
api_type
"completions"
id
"gen-1779465723-I8jAb8hyeCATTbexbb5h"
upstream_id
"chatcmpl-12aa8292-f7cf-9484-a371-2dbf23c13fed"
total_cost
0.0828175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
75.97%
Dialogue to Total Word Ratio
Ratio: 72.34%, Deviation: 2.34%
neededClean
false
wordsTotal
499
wordsDialogue
361
87.9827%