NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFG6MJYT5KZDK6JZ66D9AYE
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 31s
Cost:
$0.001164
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
19.58%
Dialogue to Total Word Ratio
(details)
9.7911%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The storm arrived with a violence that shook the siding of the house. Bill could not stop shivering, not from the cold, but from the adrenaline. He adjusted his swim cap, which he had bought on sale at Five Below. The cap was blue, but it usually turned green under humidity. He looked at the window, which was open. Molly held the silence, waiting for the intruder. The house was empty of adults, but the kids were inside. Bill walked toward the gate. He did not know how he could get in. He needed a tool. He had one. It was a lighter. The lighter was silver. Bill wore a tuxedo suit, which was torn at the shoulder. Molly wore a suit, which was blue with buttons. They looked like guests, not thieves. The garden was full of spider webs. The webs were sticky. Bill stepped on a twig. It snapped. They were caught. "Is it secure?" Bill asked. Molly checked the perimeter fence. It was carved from wicker, making it look like furniture. "Nailed it," she whispered. "You didn't put a lock on the back door," Bill noted. "Because you didn't tell me the exit route," Molly snapped. "We need the presents. Do you know how hard it is to find hard candy this late at night?" Bill kicked the gate and the lock broke. "We're stealing the candy?" Bill asked. "Yes," Molly said. "That's illegal," Bill said. "It's commerce," Molly replied. "Still illegal, but thrilling." Bill kicked the gate. It opened. "We did it," Bill said. "No," Molly said. "Yes," Bill said. "I mean it," Molly said. The wind blew. The ground shook. Bill ran. Molly ran. They ran. They jumped. They stopped. "We are safe," Bill said. "No," Molly said. "Yes," Bill said. "I mean it," Molly said. "We need to go home," Bill said. "No," Molly said. "Yes," Bill said. "I mean it," Molly said. Bill looked at the gate. The gate was closed. The night was dark. "It's time," Bill said. "No," Molly said. "Okay," Bill said. "Do what?" Molly asked. "Stop," Bill said. "I mean it," Molly said. They stayed quiet. They watched the street. No one was there. "We are ready," Bill said. "Yes," Molly said. They left. They walked away. They were safe. They were thieves. They were friends.
created_at
"2026-03-11T22:27:42.826Z"
3/11/2026, 10:27:42 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
9986
moderation_latency
(null)
generation_time
90269
tokens_prompt
77
tokens_completion
8007
native_tokens_prompt
76
native_tokens_completion
7710
native_tokens_completion_images
(null)
native_tokens_reasoning
7409
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0011641
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagdLTX-3pDw3Z-9dadfcf0d1f12c4e"
is_byok
false
latency
9986
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773268062-zv63Ecc1tjhmFcCiiOsw"
upstream_id
"oagdLTX-3pDw3Z-9dadfcf0d1f12c4e"
total_cost
0.0011641
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
387
19.58%
Dialogue to Total Word Ratio
Ratio: 24.29%, Deviation: 5.71%
neededClean
false
wordsTotal
387
wordsDialogue
94
9.7911%