NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBK79DZBBTYW59RH3FEZM7K
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 42s
Cost:
$0.022309
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
99.58%
Dialogue to Total Word Ratio
(details)
99.6609%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted goggles while Bill checked the lockpick set very carefully. The house glowed very brightly with fairy lights. Colorful balloons bobbed against the windowpane. Bill asked, shaking his head slowly. "Are you sure?" "It's a party." Molly corrected, pointing at the pile under the tree. "It's a treasure vault," "Look at those tags. Designer toys." Bill sighed heavily. "Those are for kids, Molly. Not us." she said, pulling a shiny van key from her pocket. "We aren't keeping them," "We're reselling them on eBay. For charity." "You always say that." "This time it's true. Orphanages need Legos." Bill looked at the sliding glass door. "The security system is just a dog." "A golden retriever. He likes treats." Molly tossed a bag of jerky to him. "You distract, I crack." "I hate this job." "You love the money." "I love the money more than I hate this job." They moved slowly toward the patio. The music inside was loud. Laughter echoed through the walls. Molly signaled Bill to stop. A shadow moved inside. "Is that the dad?" "Asleep." "What if he wakes up?" "Then we run." Bill kicked the door. It opened. They slipped inside quietly.
created_at
"2026-02-25T23:47:50.343Z"
2/25/2026, 11:47:50 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
459
moderation_latency
(null)
generation_time
102048
tokens_prompt
77
tokens_completion
7624
native_tokens_prompt
76
native_tokens_completion
9286
native_tokens_completion_images
(null)
native_tokens_reasoning
9001
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0223092
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-5614ff3e-0ae1-9065-bcd1-da54d3c651ed"
is_byok
false
latency
459
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063270-o1pzeQneh5cXHEDrO29L"
upstream_id
"chatcmpl-5614ff3e-0ae1-9065-bcd1-da54d3c651ed"
total_cost
0.0223092
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
196
99.58%
Dialogue to Total Word Ratio
Ratio: 47.45%, Deviation: 2.55%
neededClean
false
wordsTotal
196
wordsDialogue
93
99.6609%