NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86R5P6R0XJD9RPB95VPZAP
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 58s
Cost:
$0.060422
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the inflatable bouncy castle, carefully studying the chaotic birthday party through her binoculars. Toddlers screamed in pure joy while a stressed clown juggled flaming pins near the buffet table. The target was a massive pile of brightly wrapped gifts sitting vulnerably on the patio. Bill adjusted his fake mustache and checked his watch, looking entirely out of place in his oversized mascot costume. The furry orange paws made holding his blueprint incredibly difficult. He sighed, wiping sweat from his forehead before glancing at his partner. The plan was simple, yet incredibly risky. They would wait for the musical chairs game to reach its chaotic climax, slip past the distracted parents, and grab every single box. Security consisted of one exhausted grandmother and a sleeping golden retriever. Still, the sheer volume of wrapping paper made extraction complicated. Molly signaled with two fingers, gesturing toward the primary escape route near the slip and slide. "We definitely need a much bigger bag for all those large Lego sets." "I already brought the giant tactical sack." Bill nodded, adjusting his heavy canvas gear. He took a deep breath, preparing himself for the most ridiculous heist of their entire criminal careers today.
created_at
"2026-05-22T16:02:01.556Z"
5/22/2026, 4:02:01 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
963
moderation_latency
(null)
generation_time
118374
tokens_prompt
77
tokens_completion
5418
native_tokens_prompt
76
native_tokens_completion
8031
native_tokens_completion_images
(null)
native_tokens_reasoning
7783
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0604225
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-65fb99f6-6997-93ff-a3b6-5f5ffc97996a"
is_byok
false
latency
963
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465721-UvV3LCsqyuXF6MW1LNPw"
session_id
(null)
api_type
"completions"
id
"gen-1779465721-1iFjPgIU638PltnqtU93"
upstream_id
"chatcmpl-65fb99f6-6997-93ff-a3b6-5f5ffc97996a"
total_cost
0.0604225
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 10.00%, Deviation: 0.00%
neededClean
false
wordsTotal
200
wordsDialogue
20
100.0000%