NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS85Y8NFE2CNX02312ESAHZ5
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 12s
Cost:
$0.079345
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
9.01%
Dialogue to Total Word Ratio
(details)
49.7446%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the blueprint across the sticky diner table, weighing down the corners with salt shakers to reveal the target details. "The target is Timmy. High value assets, mostly building blocks." Bill squinted at the crude crayon drawing she had bribed from the babysitter. "You brought a map drawn by a toddler. How do we read this?" Molly tapped a purple circle near the kitchen area. "That is the living room. The presents are stacked by the fireplace. We go in through the bouncy castle." Bill leaned back, crossing his arms over his leather jacket. "A bouncy castle. We are elite cat burglars, Molly. We do not do stupid bouncy castles." She slid a juice box across the table toward him. "You will do what the job requires. The castle masks our acoustic footprint. We bounce in and bounce out." Bill poked the straw into the box with unnecessary aggression. "What about the parents? There will be at least twenty adults drinking cheap wine and watching us." Molly pulled a stack of colorful invitations from her trench coat. "We blend in. I am Aunt Susan from Ohio. You are the hired magician. I bought you a top hat." Bill choked on his apple juice, coughing loudly into his napkin. "I am not doing magic tricks. I do not know any magic tricks for children." She reached into her bag and produced a plastic wand with a star on the end. "You just wave this and say the magic word. It buys me three minutes to sweep the gifts into the duffel bags." Bill stared at the wand, his professional pride visibly fracturing. "If anyone asks me to pull a rabbit out of a hat, I am shooting the ceiling." Molly patted his hand sympathetically, careful not to smudge the crayon map. "There are no rabbits. Just twenty screaming kids hopped up on blue frosting. Keep them distracted." He sighed, adjusting his collar as he looked back at the purple circle. "Fine. But I get to keep the giant stuffed bear. I need it for my couch." Molly rolled up the blueprint and tucked it into her coat. "Deal. Now go practice your sleight of hand. We hit the ball pit at noon sharp." Bill grabbed the plastic wand and gave it a halfhearted flick, knocking over a salt shaker. "What if the pinata gets out of hand? I am not getting hit with a bat by a crazed first grader." Molly righted the shaker and brushed the spilled salt onto the floor. "You will confiscate the bat during your grand entrance. Tell them it is part of the illusion." He rubbed his temples, already feeling a headache forming behind his eyes. "This is the worst plan we have ever made. We should just rob a bank." Molly smiled warmly, picking up her lukewarm coffee cup taking a slow, deliberate sip. "Banks have guards. Toddlers have short attention spans. Drink up."
created_at
"2026-05-22T15:47:52.834Z"
5/22/2026, 3:47:52 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
956
moderation_latency
(null)
generation_time
131652
tokens_prompt
77
tokens_completion
8012
native_tokens_prompt
76
native_tokens_completion
10554
native_tokens_completion_images
(null)
native_tokens_reasoning
9908
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.079345
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-8e9d0fac-9435-9065-b7a7-24ca4c232339"
is_byok
false
latency
956
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779464872-aW5QvBz8kSC77UQZLe4U"
session_id
(null)
api_type
"completions"
id
"gen-1779464872-RkyZduZ1DehYtGNlbxkl"
upstream_id
"chatcmpl-8e9d0fac-9435-9065-b7a7-24ca4c232339"
total_cost
0.079345
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
9.01%
Dialogue to Total Word Ratio
Ratio: 56.94%, Deviation: 6.94%
neededClean
false
wordsTotal
490
wordsDialogue
279
49.7446%