NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFHWZVF2Y3Q2GZJC09ANAV
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
8.2s
Cost:
$0.000962
Provider:
Fireworks
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.5180%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"So the target is the Henderson house on Maple Street," Molly said, tapping a photograph pinned to the corkboard. "Birthday party for some kid named... Trevor?" "Trevor. Age seven. Forty-three guests, a clown, and a bouncy castle," Bill added, not looking up from his phone. "According to my cousin's daughter who follows the family on Instagram." "Convenient. What's the security situation?" "Front door has one of those decorative wreath cameras, nothing recording. Back door leads to a kitchen that connects to the living room where all the presents will be under the tree." "Under the tree. In front of forty-three kids and their parents." "Here's the beauty part." Bill finally looked up with a grin. "The party runs from two to five. Parents will be exhausted, sugar-high kids will be screaming, and at exactly four-fifteen, the clown has a fifteen-minute magic show where everyone gathers around." "So we slip in during the magic show." "We slip in during the magic show. Classic distraction technique." Molly circled a point on the blueprint. "And the exits?" "Three. Front door, back door through the kitchen, and a side door that leads to the garage. But I say we go in through the garage, grab the presents from under the tree, and exit through the kitchen." "Garage door is automated. How do we get in?" "I've got that covered." Bill held up a small device that looked like a garage door opener. "Bought it at a garage sale last week. Turns out the Hendersons have the same model as the guy who sold it to me." "Bill, that's genuinely impressive." Molly raised an eyebrow. "What's the catch?" "The catch is the presents. We're talking about forty-three kids who each brought a gift. That's a lot of wrapping paper to juggle." "We don't need to take everything. We're not amateurs." "I thought we were taking all the presents?" "I said we were stealing the presents. I meant the good ones. The iPads, the gaming consoles, the expensive toys. We leave the clothes and the books." "So we're cherry-picking." "We're being efficient. And less conspicuous. You try carrying forty-three gift boxes out of a house in the middle of a birthday party." "Fair point." Bill scratched his chin. "What about the parents? What if someone catches us?" "We wear costumes. Clown makeup, maybe. Blend in with the chaos." "We're not exactly the clown type." "Neither am I, but I've got a wig and a nose. We go in, grab the high-value items, and leave before Trevor opens his presents." "Stealing a seven-year-old's birthday presents. This is a new low, even for us." "Bill, we stole a penguin once." "That penguin was worth eight thousand dollars and you know it." Molly laughed and shook her head. "Okay, here's the plan. Saturday, two-thirty, we arrive in the party supply van I borrowed from my cousin. We blend in, wait for the magic show, then execute." "Execute. Nice heist terminology." "I've been practicing." She grinned. "Ready to ruin a seven-year-old's birthday?" "Absolutely not. But I'm absolutely ready to steal his presents."
created_at
"2026-02-17T09:40:34.954637+00:00"
2/17/2026, 9:40:34 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
535
moderation_latency
(null)
generation_time
7965
tokens_prompt
77
tokens_completion
928
native_tokens_prompt
97
native_tokens_completion
777
native_tokens_completion_images
(null)
native_tokens_reasoning
119
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0009615
router
(null)
provider_responses
0
id
"b13cf3a0-84e3-42c2-9305-7c7039a60c49"
status
200
is_byok
false
latency
535
endpoint_id
"c61d37ec-745c-4efd-8129-a68371e056b2"
provider_name
"Fireworks"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321226-QYAAxBKAsmApRzmEz0NE"
upstream_id
"b13cf3a0-84e3-42c2-9305-7c7039a60c49"
total_cost
0.0009615
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Fireworks"
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
526
0.00%
Dialogue to Total Word Ratio
Ratio: 88.59%, Deviation: 18.59%
neededClean
false
wordsTotal
526
wordsDialogue
466
0.5180%