NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-22-01KS86P6PB0K5Y3BQ19RKJNMBE
Run Details
Model:
Qwen3.7 Max
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 27s
Cost:
$0.041432
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon drawing across the sticky diner table and tapped her greasy finger on the bouncy castle schematic. "We enter through the ball pit, Bill, and you must disable the clown before he makes any distracting balloon animals." Bill adjusted his dark sunglasses while nervously chewing on a plastic stirring straw, clearly worried about the notorious birthday entertainer. "That clown is a wildcard, Molly, so what happens if he starts juggling flaming torches near the giant gift table?" She sighed loudly and pointed at the heavily fortified dessert station, knowing the sugar rush would create absolute chaotic cover. "Ignore the cake completely and focus on the presents, because those toddlers guarding the wrapping paper are absolutely ruthless monsters." He nodded slowly and pulled a crumpled inventory list from his jacket pocket to review the expected electronic toy haul. "I will bring the juice boxes to bribe the kids, but you must secure the biggest remote control truck immediately." Molly folded the crude map carefully and slid it into her purse, confident their elaborate heist would finally succeed today. "Let us go steal those birthday gifts before the parents arrive and ruin our perfectly planned criminal operation with vegetables."
created_at
"2026-05-22T16:00:57.05Z"
5/22/2026, 4:00:57 PM
model
"qwen/qwen3.7-max-20260520"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
910
moderation_latency
(null)
generation_time
86557
tokens_prompt
77
tokens_completion
4355
native_tokens_prompt
76
native_tokens_completion
5499
native_tokens_completion_images
(null)
native_tokens_reasoning
5250
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0414325
router
(null)
provider_responses
0
endpoint_id
"6d30fdc6-f2a7-4bdf-b6ba-32353496d723"
id
"chatcmpl-108407ff-bc81-9085-bf7c-6cc7aa073819"
is_byok
false
latency
910
model_permaslug
"qwen/qwen3.7-max-20260520"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779465657-LRdRoRK5UncGfR1TpOeh"
session_id
(null)
api_type
"completions"
id
"gen-1779465657-YeQpob81TUl1LG509Qkg"
upstream_id
"chatcmpl-108407ff-bc81-9085-bf7c-6cc7aa073819"
total_cost
0.0414325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
200
wordsDialogue
100
100.0000%