NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBKE5WAXTC6D5RJHHHQGHWW
Run Details
Model:
Qwen 3.5 Flash
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 46s
Cost:
$0.007955
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
90.01%
Dialogue to Total Word Ratio
(details)
45.0037%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his coat. The fabric felt heavy. Molly checked the time. She frowned at the watch. "Time to go," Bill said. "Okay," Molly said. The fabric felt very heavy. "We need to hide," Bill said. She frowned at the time. "I am hiding," Molly said. Bill looked at the window. "You are visible," Bill warned. She checked her phone screen. "I have a hat," Molly said. "It is orange," Bill said. "It is a clown hat," Molly smiled. "We are thieves," Bill noted. "Thieves wear clowns," Molly grinned. "We steal toys," Bill said. Molly stood by the gate. "Yes, toys," Molly agreed. The night was cold. "Not money," Bill said. "Toys are money," Molly said. Bill wiped his sweat. "Go inside," Bill ordered. The door opened slowly. "Through the glass," Molly said. "Or the door," Bill said. "The door is locked," Molly said. "I have a pick," Bill said. "Show me the pick," Molly said. "Here it is," Bill said. "Is it big?" Molly asked. "It is small," Bill replied. "It works well," Molly nodded. "Open it now," Bill said. The key turned. "I hear dogs," Molly said. "They are asleep," Bill noted. "Good luck then," Molly said. "We steal fast," Bill said. "The presents are loud," Molly said. "I hate loud," Bill said. "I love loud," Molly laughed. "Let us jump," Bill said. "Jump now," Molly said. They jumped. "Wait," Molly said. They entered. End.
created_at
"2026-02-25T23:51:36.078Z"
2/25/2026, 11:51:36 PM
model
"qwen/qwen3.5-flash-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
481
moderation_latency
(null)
generation_time
106284
tokens_prompt
77
tokens_completion
15242
native_tokens_prompt
76
native_tokens_completion
19869
native_tokens_completion_images
(null)
native_tokens_reasoning
19502
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0079552
router
(null)
provider_responses
0
endpoint_id
"5f6b71b5-8672-4b27-8b2d-f674429f9e63"
id
"chatcmpl-c32e9fe7-7e06-93c1-9eb2-b3f8a18d0f7e"
is_byok
false
latency
481
model_permaslug
"qwen/qwen3.5-flash-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063496-ZWM2kjKxjnBr8bIG2joe"
upstream_id
"chatcmpl-c32e9fe7-7e06-93c1-9eb2-b3f8a18d0f7e"
total_cost
0.0079552
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
237
90.01%
Dialogue to Total Word Ratio
Ratio: 44.30%, Deviation: 5.70%
neededClean
false
wordsTotal
237
wordsDialogue
105
45.0037%