NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7T6K529DGSBEGYXTXWMYB
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.5s
Cost:
$0.000423
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The glow of a single streetlamp flickered above the minivan parked crookedly behind Chuckles the Clown’s Pizza Palace. Inside, Molly adjusted her fake mustache in the rearview mirror while Bill gnawed on a novelty ice cream cone left behind by what he claimed was “an emotionally fragile eight-year-old.” “We’re gonna waltz into a birthday party full of sugar-raged toddlers and steal their loot,” Molly said, peeling the mustache off and inspecting it. “I still think we should’ve gone with the penguin band idea.” “The penguin band was vetoed,” Bill said, flinging the cone into a nearby trash can with a wet plop. “Mainly because penguins can’t legally perform in municipal zones after the 2019 Bylaw Incident. Besides, we’re ninjas now.” “We’re not ninjas, Bill. We’re dressed as a magician and a balloon animal guy.” “Same thing.” “No, it’s not. One does card tricks, the other makes a dog out of twisted latex. You can’t threaten people with a dachshund.” “You’d be surprised.” Bill adjusted his sequined cape and checked his watch. “Look, the clown just left. That’s our cue. Kids are in the middle of the piñata round. Chaotic. Emotional. Prime for distraction.” Molly sighed and pulled a rubber chicken from her duffel. “This better work. I spent six months getting that fake laugh down. The one that says, ‘I’m here for the children, but also possibly arson.’” “You’ll be fine. Just stick to the plan: you distract with ‘Surprise Giggle Time,’ I sneak into the gift pile, swap the presents with these.” Bill pulled out a stack of empty gift boxes labeled “World’s Okayest Thief.” “World’s Okayest Thief?” Molly raised an eyebrow. “It’s misdirection. Who’d suspect someone who owns that mug?” “Everyone. That mug is a cry for help.” “Focus.” Bill zipped up his clown-patterned fanny pack. “I go in after the monkey song, you start your bit during cake. We’re out before the thank-you notes are written.” They crept toward the back entrance, where a deflated inflatable T-rex drooped sadly beside a trash bin. Bill paused, nose twitching. “You smell that?” “Regret?” “Cupcake frosting. That’s not just frosting—it’s *vanilla betrayal*. They’re using store-bought.” “Priorities, Bill.” Inside, the party raged like a tiny, glitter-coated warzone. Confetti cannons exploded at irregular intervals. A child screamed over stolen juice boxes. And in the center of it all, a lopsided piñata shaped like a unicorn wept papier-mâché tears. Molly adjusted her bowtie and cleared her throat. “Right. I’m going in. If I don’t make it... tell my cat I finally forgave him for the incident with the hamster.” “That was two years ago.” “It haunts me.” Bill slipped toward the gift table, eyeing the pile like a starving raccoon. “Look at them. Naïve. Trusting. Wrapped in dinosaur paper. This is almost sad.” “Almost?” Molly whispered as she stepped into the party room, waving her rubber chicken. “Almost,” Bill muttered, stuffing a PlayStation box into his fanny pack. “But not quite.”
created_at
"2026-03-25T10:14:31.785Z"
3/25/2026, 10:14:31 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
963
moderation_latency
(null)
generation_time
19446
tokens_prompt
77
tokens_completion
856
native_tokens_prompt
72
native_tokens_completion
719
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004235
router
(null)
provider_responses
0
endpoint_id
"cf186489-d252-4fb7-aca9-87ef7b557eff"
id
"8eb2a268d66bf8ff41f26375e2508c9d"
is_byok
false
latency
963
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433671-ZUpiCItr6I1kZSD3TdeW"
upstream_id
"8eb2a268d66bf8ff41f26375e2508c9d"
total_cost
0.0004235
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.00%
Dialogue to Total Word Ratio
Ratio: 56.94%, Deviation: 26.94%
neededClean
false
wordsTotal
497
wordsDialogue
283
43.1902%