NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEY3W8EF0YER51972AC2KC
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6m 44s
Cost:
$0.022371
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
96.61%
Dialogue to Total Word Ratio
(details)
97.6631%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Moonlight flickered over the Hendersons' lawn, where pink balloons bobbed in the night. Inside, a birthday party roared with giggles and the tinny beat of a karaoke machine. Outside, Molly and Bill crouched behind a hedge, whispering. Molly: "Alright, Bill, here's the plan. The cake comes at five, the presents are stacked in the living room like a Jenga tower, and the parents will dash to the kitchen for ice cream. That's our three‑minute window." Bill: "Three minutes? I can barely juggle a single balloon in that time, let alone a mountain of toys and a glittery pink bike. My back still aches from the circus tent escapade." Molly: "That's why we're bringing the collapsible wagon. We fold it, roll it in like a magic carpet, and when the parents turn around we claim to be the 'party cleanup crew.' They'll think we belong to the event." She drew a quick sketch in the dirt, complete with an arrow labeled 'Wagon Route'. Bill: "Fine, but what about the dog? The Hendersons' golden retriever thinks he's a guard dog. He might lick the wagon and alert everyone." Molly: "We bring bacon strips. Two minutes of slobber, and the beast will be busy chewing. Then we grab the biggest box—the video‑game console—while the parents are still fighting the ice‑cream scoop." Bill rolled his eyes, but the idea was absurd enough to work. Bill: "If we get caught, I'm blaming the clown outfit." Molly: "No clown needed—just a 'Party Services' polo and a badge. The kids will be too busy fighting over cake to notice us." The ice‑cream truck jingled in the distance, its melody drifting like a siren. Molly: "That's our cue. We slip in, load the wagon, and vanish before the first 'Happy Birthday' echo fades." Bill: "Let's do it. And if anyone asks, we're the cleanup crew, not the presents thieves." The night smelled of frosting and popcorn. They slipped through the side gate, the wagon humming, the laughter of children echoing behind them. The plan was ridiculous, the stakes were trivial, but for Molly and Bill it was the perfect birthday heist. Bill: "What about the decoy? We swap the console with a box of old socks and a note that says the elves are on strike." Molly: "Perfect. The parents will think the kids stole it, and we'll be long gone." Bill: "And the other presents? We fill each bag with a mix of real toys and empty boxes, then shuffle them like a magician." Molly: "Exactly. The wagon will look like a moving birthday parade, and nobody will suspect a thing." Bill: "Alright, let's roll. The music's blasting, the kids are dancing, and the parents are busy filming. This is our moment." Bill: "On three. One, two, go!" They lunged forward, the wheels squeaking, the night alive with giggles. Molly: "Let's make this birthday unforgettable for everyone." Molly: "Ready?" Bill: "Go!" The stars twinkled above, the night promising mischief. They grinned, the wagon wheels humming, ready for the next adventure.
created_at
"2026-02-17T09:36:21.996845+00:00"
2/17/2026, 9:36:21 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2685
moderation_latency
(null)
generation_time
403479
tokens_prompt
77
tokens_completion
16878
native_tokens_prompt
101
native_tokens_completion
18617
native_tokens_completion_images
(null)
native_tokens_reasoning
17937
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0223707
router
(null)
provider_responses
0
id
"05e36801db928367e21043fb682f26dc"
status
200
is_byok
false
latency
2685
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320577-4vXEHgbWoFOTXLwa0R6t"
upstream_id
"05e36801db928367e21043fb682f26dc"
total_cost
0.0223707
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
96.61%
Dialogue to Total Word Ratio
Ratio: 69.17%, Deviation: 0.83%
neededClean
false
wordsTotal
506
wordsDialogue
350
97.6631%