NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFDAEF2VFD8TED8WG9Z734V
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 35s
Cost:
$0.000970
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0062%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The smell of vanilla frosting and stale popcorn hung heavy in the air, masking the metallic tang of Bill’s criminal adaptability. Molly adjusted her collar, which was already too tight for the confetti she had been dusting in her hair. They stood before the swinging doors marked with a giant plastic star. The keycard beeped green, a sound that cut through the roar of the balloons. "Target acquired," Molly hissed, pressing her ear against the wood. She pulled out a high-powered listening device that she claimed was actually just a keychain talisman from her grandmother. "It’s promising." "Molly, we cannot play with grounding stakes here," Bill whimpered, clutching his disposable fingerless gloves. "It’s an opportunity, Bill. High-value, low-security. Kindergarten gold." She cracked the latch with a rusted coat hanger she’d salvaged from a dumpster behind a bakery. "The sound of children’s laughter rose from inside like a haunting choir." Little ones squealed with delight over plastic snakes. Bill's eyes darted toward the stage where a DVD was currently scrubbing through a soundtrack at maximum volume. "My hands are shaking," Bill admitted, watching the clock. "If they find us in a clown suit, they’ll put us in a cage." "No cages," Molly corrected, peeling back the canvas. "We hit the buffet table. That is where the big bags are. The VR toys are too heavy." She paused to listen to the hum of the projector in the dark corner. "There," she pointed to a window. "I clear the vision feed camera blindspot with a laser sight." Bill nodded, pulling a bag over his head. It was a plastic grocery sack with red glitter. "I look like a reckless chef," Bill said confidently. "Perfectly fitted," Molly smiled back. They slipped into the hallway. The air inside was filled with colorful streamers hanging from the ceiling like nooses. A toddler in one corner screamed while bouncing on a ball pit. It was a sauna of energy. "Stay in the dark corner," Bill warned as they crept forward. "We need gloves," Molly replied immediately. "I have these," he said, holding up rubber gloves. "Great." "Wait, those gloves are for cake decorating? You’re stealing food?" "We do not steal food, Bill. We steal desire." "I thought we were here for the discs." "Discs are noise. Children are the cube." "Okay, let’s do the cube." "Let’s stun the cubes." "There will be no stun." "Then you go get the juice." "I will gather the toys, Molly." "I will blow the fuse." "I will cut the camera." "I will watch cut the camera." "I will not watch." "I will watch you." "Okay." "It’s showtime."
created_at
"2026-03-11T21:37:21.902Z"
3/11/2026, 9:37:21 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
296
moderation_latency
(null)
generation_time
95144
tokens_prompt
77
tokens_completion
6605
native_tokens_prompt
76
native_tokens_completion
6417
native_tokens_completion_images
(null)
native_tokens_reasoning
5899
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00097015
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagMrNr-3pDw3Z-9dadb33005e56e1a"
is_byok
false
latency
296
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773265041-reKptRe07lO5ejR0iUQk"
upstream_id
"oagMrNr-3pDw3Z-9dadb33005e56e1a"
total_cost
0.00097015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
437
0.01%
Dialogue to Total Word Ratio
Ratio: 43.41%, Deviation: 13.41%
neededClean
false
wordsTotal
440
wordsDialogue
191
0.0062%