NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF9Z45B4PXDQN1TV23KDSE
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
40.9s
Cost:
$0.001806
Provider:
Fireworks
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement smelled like old pizza and broken dreams, which was perfect for a planning session. Molly spread a napkin across the grimy table, revealing a hand-drawn map of the Hendersons' suburban mansion. "Alright, here's the play," she whispered, tapping a finger on the crude blueprint. "Saturday afternoon, three to five PM. The kid's turning seven, so expect forty-five minutes of screaming and balloon animals." Bill leaned forward, his chair creaking in protest. "The presents are in the living room, right near the bounce house rental. I scouted it yesterday while walking my fake emotional support dog." "The golden retriever?" "Labrador, actually. Much more trustworthy looking." Molly nodded, pulling out a tiny notebook covered in cat stickers. "I've got the catering schedule. Cake arrives at four-fifteen. That's our window. The parents will be distracted by the gift-opening ceremony, and the entertainment guy—some guy named Disco Dan—will be doing his thing." "What's his thing?" "Exactly. That's our advantage." Bill scratched his chin thoughtfully. "What about the neighbors? Mrs. Patterson watches everything from her window like some kind of retiree hawk." "I've already handled her. She's getting a free spa day for two at that new salon on Fifth Street. I told her it was a church raffle prize." "You forged a church raffle ticket?" "I forged an entire church, Bill. Don't complicate things." The plan was simple—well, as simple as stealing from children could be. They'd pose as the entertainment. Molly had already rented a costume that promised "ultimate magic for ages three to seven." "I'm going as a clown?" Bill groaned. "A party clown. Very different. It's professional. I'll handle the crowd while you work the present table. When I signal, you grab the loot bags and we slip out the side door." Bill's eyes widened. "We're stealing from a seven-year-old's birthday party?" "Technically we're stealing FROM the presents, not the kid. The kid gets to keep the memory." Molly shrugged. "And honestly, that little guy has like six gaming consoles at home. He won't even notice." "That's the most morally flexible thing you've ever said." "I learned from the best. You." Bill stood up, stretching his back. "Alright, I'm in. But if we get arrested, I'm telling them everything." "Of course you will." Molly grinned. "That's why I love working with you. Now let's go steal some birthday presents from a seven-year-old. What could possibly go wrong?" As they climbed the stairs, Bill paused. "Wait. What about the piñata?" Molly's eyes lit up. "Oh, that's just an extra. Now let's move."
created_at
"2026-02-17T09:36:47.762373+00:00"
2/17/2026, 9:36:47 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4542
moderation_latency
(null)
generation_time
20670
tokens_prompt
77
tokens_completion
1841
native_tokens_prompt
97
native_tokens_completion
1481
native_tokens_completion_images
(null)
native_tokens_reasoning
1146
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0018063
router
(null)
provider_responses
0
status
(null)
is_byok
false
latency
10077
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
1
status
(null)
is_byok
false
latency
10077
endpoint_id
"05387b87-6e10-4520-87e9-be7c49693ba1"
provider_name
"Novita"
model_permaslug
"minimax/minimax-m2.5-20260211"
2
id
"a2925283-1687-4e7f-96d7-a5850c3d5164"
status
200
is_byok
false
latency
4542
endpoint_id
"c61d37ec-745c-4efd-8129-a68371e056b2"
provider_name
"Fireworks"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320966-G6i6mFseglt8wD22fPRi"
upstream_id
"a2925283-1687-4e7f-96d7-a5850c3d5164"
total_cost
0.0018063
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Fireworks"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
433
0.00%
Dialogue to Total Word Ratio
Ratio: 73.21%, Deviation: 23.21%
neededClean
false
wordsTotal
433
wordsDialogue
317
0.0000%