NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8DCGNM4KMBS5X4WWRM1P5
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
10.2s
Cost:
$0.000220
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a plastic jungle gym, binoculars pressed to her face. "Still no sign of the piñata guard." Bill adjusted his ski mask, which read "I ♥ BALLOONS" in glittery letters. "Are we *sure* this is the best time? Kids are unpredictable, Molly. One sneeze and we're exposed." “Relax, Bill. They’re doing the ‘Hokey Pokey’ right now. Peak distraction.” She scribbled notes on a cupcake napkin. "We hit during 'Musical Chairs. That's when the adults start drinking the punch." Bill chewed his lip. "And the cake? Do we disarm before or after?" "After. Never interrupt dessert. Rule number seven." "But what if Timmy opens his Dinosaur Dig Set early?" "Then we improvise. You distract them with a magic trick. You’ve got that rubber chicken, right?" “Lost it at the dentist heist. But I’ve got a whoopee cushion.” Bill sighed. “It's not the same.” “No, it’s *better*. Deploy on the parents’ table. While they’re embarrassed, I’ll vacuum the gift pile with the stroller.” “The stroller’s got a squeaky wheel, Molly.” “So grease it with frosting. That’s why we brought the spatula.” Bill stood, adjusting his tiny party hat. “You realize we’re robbing children, right?” Molly gave him a flat look. "We’re stealing from capitalism, Bill. These toys fund evil empires of plastic." “…Does that make us anti-heroes?” “Only if we split the loot with the janitor.” She zipped open her duffel. “Ready?” Bill saluted. "Let’s party."
created_at
"2026-03-25T10:25:00.44Z"
3/25/2026, 10:25:00 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1037
moderation_latency
(null)
generation_time
10184
tokens_prompt
77
tokens_completion
412
native_tokens_prompt
72
native_tokens_completion
369
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002205
router
(null)
provider_responses
0
endpoint_id
"cf186489-d252-4fb7-aca9-87ef7b557eff"
id
"dc363e710989ae8372ae8d3d1862a970"
is_byok
false
latency
1036
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434300-NMRrU1DGK2vX7XxtQDBJ"
upstream_id
"dc363e710989ae8372ae8d3d1862a970"
total_cost
0.0002205
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
236
0.00%
Dialogue to Total Word Ratio
Ratio: 76.69%, Deviation: 26.69%
neededClean
false
wordsTotal
236
wordsDialogue
181
0.0000%