NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8YXNBQ015RNFEDX1Q5805
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
7.7s
Cost:
$0.000279
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
12.97%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The rain fell steadily on the cobblestone path leading to the old Victorian mansion surrounded by overgrown ivy and shadowy trees. Detective Clara Hale stepped from her car, black coat soaked, flashlight cutting through the fog as she approached the iron gate slightly ajar. A single scream had summoned her, though dispatch claimed no emergency call had ever been recorded from this desolate part of the county. She pushed the gate open wider, its hinges screaming in protest, revealing a crooked front porch and shattered lantern light spilling onto the steps. A man in a butler’s uniform lay facedown near the doorway, one hand clutching his chest, the other reaching toward broken glasses. Clara knelt, checked for a pulse, found none, then noticed the silver tray beside him, overturned, with a single bloodied teacup. The house loomed above, windows dark except for one flickering room on the second floor, curtains twitching as if someone had just passed. She stood, drew her sidearm, and advanced slowly, her boots echoing on the wooden planks leading into the silent, gaping front doorway. Inside, the air smelled of damp wallpaper, old cigars, and something metallic—fear, or perhaps fresh blood seeping from beneath a locked door. A grandfather clock in the hall ticked unevenly, stopping suddenly at ten past midnight, though her watch read exactly nine thirty-one p.m. She called out, identified herself, and waited; only silence answered, broken by a faint drip, drip, drip from somewhere deep in the house. Then, from upstairs, a floorboard creaked, followed by the soft closing of a door, deliberate, almost mocking in its precision and timing. Clara moved toward the staircase, each step groaning under her weight, her flashlight beam trembling slightly with the tension in her hand. She reached the top landing, where three doors stood closed, one slightly open, revealing a sliver of red carpet and a woman’s pearl earring. She entered the room, gun raised, and found a vanity cluttered with perfume bottles, a stopped watch, and a single typed note: “You’re too late.”
created_at
"2026-03-25T10:34:35.055Z"
3/25/2026, 10:34:35 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
386
moderation_latency
(null)
generation_time
7573
tokens_prompt
46
tokens_completion
544
native_tokens_prompt
44
native_tokens_completion
456
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000279266
router
(null)
provider_responses
0
endpoint_id
"043eb28f-fc28-4b0d-9800-b249fcfcfbf9"
is_byok
false
latency
8
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Parasail"
status
429
1
endpoint_id
"813ce28a-1334-47e3-881e-a7f8833b34b4"
id
"chatcmpl-9c065c50-6255-909d-b49d-def541aa6cd7"
is_byok
false
latency
386
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434875-c3nFlU9fqztv16GViMak"
upstream_id
"chatcmpl-9c065c50-6255-909d-b49d-def541aa6cd7"
total_cost
0.000279266
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
12.97%
Matches word count
n/a
neededClean
false
sentences
15
wordCounts
0
21
1
23
2
23
3
24
4
22
5
21
6
23
7
22
8
23
9
22
10
23
11
22
12
22
13
24
14
25