NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86RQ0Y3W2AJRGD94A6DKA8
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 30s
Cost:
$0.077268
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped her pen against the crumpled blueprint of the suburban house. The living room was clearly marked with a large red circle indicating the exact location of the birthday presents. Bill sat across from her in the cramped surveillance van, nervously chewing on a stale pretzel. Crumbs tumbled down his stained shirt, landing on the steering wheel. He brushed them away with a clumsy swipe of his hand, smearing butter across the plastic. Molly sighed and rubbed her temples, feeling a familiar headache forming behind her eyes. She had worked with professionals, safecrackers, and getaway drivers, but Bill was something else entirely. He was a liability wrapped in a polyester jacket. The target was a seventh birthday party for the son of a wealthy local tech executive. The security was surprisingly light, mostly because the parents assumed no one would rob a birthday event for kids. Molly knew better. Opportunities existed everywhere if you looked hard enough. She adjusted the binoculars and peered through the tinted window at the brightly colored balloons tied to the mailbox. A minivan pulled into the driveway, dropping off three screaming children. The chaos was beginning. Perfect cover for their infiltration. She checked her watch, noting the time. They had exactly forty minutes before the parents started serving pizza. Bill shifted in his seat, the leather squeaking loudly in the quiet van. He reached for his coffee cup, knocking it over and spilling lukewarm liquid across the dashboard. Molly closed her eyes and took a deep breath, counting to ten in her head. When she opened them, Bill was frantically wiping the spill with a dirty rag. She snatched the rag from his hands and threw it into the back. There was no room for error today. The score was too big, and her patience was too thin. She grabbed her earpiece and tested the microphone one last time. The static cleared, leaving only the sound of heavy breathing. She turned to him, her expression hardening into a mask of pure determination. The game was about to begin. She smiled and then nodded slowly. "Are you sure about the bouncy castle?" "It blocks the patio. We use the kitchen." "What about the clown?" "He smokes at three. We strike then." "I hate clowns. They have crazy eyes." "Focus. The target is the gift pile." "Twenty kids means twenty presents. Maybe more." "We grab the big boxes first. Electronics." "What if it is just clothes?" "Then we fence them. Keep up." "I need a disguise. A mascot suit." "No suits. You will trip and crush a toddler." "I could be a magician." "You do not know magic tricks." "I can pull a coin from an ear." "That will not fool those little kids." "Fine. What do I wear?" "A catering uniform. You carry the dummy cake." "Is the dummy cake edible?" "It is styrofoam. Just carry it." "And do not eat the real cake." "I am allergic to fondant anyway." "Good. Now start the van."
created_at
"2026-05-22T16:02:19.309Z"
5/22/2026, 4:02:19 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
915
moderation_latency
(null)
generation_time
149928
tokens_prompt
77
tokens_completion
7303
native_tokens_prompt
76
native_tokens_completion
10277
native_tokens_completion_images
(null)
native_tokens_reasoning
9635
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0772675
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-7b8f509c-2639-97da-bbc6-2ab0b2b556b9"
is_byok
false
latency
915
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465739-pRHvDpOCUj1R2MU12u73"
session_id
(null)
api_type
"completions"
id
"gen-1779465739-KBhsFDzC87rF4eQnYiym"
upstream_id
"chatcmpl-7b8f509c-2639-97da-bbc6-2ab0b2b556b9"
total_cost
0.0772675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
150
100.0000%