NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFXM5R2RXRKC8E4A2EPY0N
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5m 6s
Cost:
$0.020961
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
97.64%
Dialogue to Total Word Ratio
(details)
96.8136%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered the pavement outside, but inside the community hall the air crackled with the buzz of children and the sweet scent of birthday cake. A banner reading “Happy Birthday, Leo!” fluttered above a table laden with balloons, and a bouncy castle sagged in the corner like a tired elephant. In a cramped supply closet that smelled of dust and old popcorn, Molly and Bill hunched over a cardboard box that served as a desk, a map of the party scrawled on the back of a pizza flyer. Molly tapped the sketch. “We have twenty minutes before the cake. The presents are behind the DJ booth. We grab them when the lights go out.” Bill frowned, his eyebrows climbing like caterpillars. “Rob a kid's party? That's low. Parents will call the cops, and I'll end up in a clown suit.” Molly smirked. “Exactly. You take the clown outfit, distract the kids with a bad joke, and I'll slip behind the booth.” Bill sighed. “Fine. If I have to juggle, I’m blaming you.” Molly nodded. “Deal. When the candles blow, we move. Five seconds, then we’re gone.” Bill took a breath. “Got it. Let’s do this before I lose my nerve.” They slipped out of the closet, the door squeaking behind them. The hall thrummed with laughter, balloons popped, and a DJ spun a pop song that made the floor tremble. Molly slipped a bright wig over her hair and grabbed a stack of gift bags, while Bill squeezed into an oversized clown suit that smelled of cheap paint. “Hey, little heroes! Who wants a magic trick?” the clown shouted, waving a rubber chicken. Bill forced a laugh. “Uh, sure. Watch this!” The children giggled as Bill fumbled with three colored balls, missing each one spectacularly. Parents clapped politely, and the music swelled, covering the rustle of wrapping paper. In the chaos, Molly slipped behind the DJ booth, her gloved hands finding the pile of presents. She lifted a rectangular box wrapped in blue, slid it into her bag, and moved to the next. Molly whispered, “Every last one. Let’s go.” Bill glanced around, his face red beneath the clown paint. “We actually pulled it off.” Molly laughed, the sound bright against the hum of the engine. “Told you. The best heist is the one nobody sees.” They sprinted to the side door, hopped into the waiting van, and sped away, the streetlights blurring past. The rain had stopped, and a faint rainbow stretched over the city. Bill stared at the backseat, piled high with brightly wrapped boxes. “What if a kid spots us?” Molly shrugged. “We’ll claim we’re the surprise entertainment.” Bill nodded. “Alright, I’m in.” Molly grinned. “Parents love surprise acts.” Bill took a final look at the hall. “Here goes nothing.” Molly gave a quick thumbs‑up. “Five seconds. Go!” The van disappeared into the night, leaving only the echo of a child's laughter and the faint smell of cake.
created_at
"2026-02-17T09:51:57.225176+00:00"
2/17/2026, 9:51:57 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2790
moderation_latency
(null)
generation_time
306374
tokens_prompt
77
tokens_completion
17351
native_tokens_prompt
101
native_tokens_completion
17442
native_tokens_completion_images
0
native_tokens_reasoning
16797
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0209607
router
(null)
provider_responses
0
id
"139dc621bd4eec775f0c9bc6a48d34cb"
status
200
is_byok
false
latency
2789
endpoint_id
"05387b87-6e10-4520-87e9-be7c49693ba1"
provider_name
"Novita"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321610-rdmXIKi0g6feyEy31N2U"
upstream_id
"139dc621bd4eec775f0c9bc6a48d34cb"
total_cost
0.0209607
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
97.64%
Dialogue to Total Word Ratio
Ratio: 30.69%, Deviation: 0.69%
neededClean
false
wordsTotal
492
wordsDialogue
151
96.8136%