NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM1G4ZYWFYXJEBZMSNHD77KW
Run Details
Model:
MiniMax M2.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4m 5s
Cost:
$0.018832
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9920%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Outside the glittery bounce house, the afternoon sun glinted off a stack of presents, and Molly and Bill hunched over a folding table, whispering a plan that was equal parts genius and absurdity. Molly pulled out a hand-drawn map, and Bill smoothed a crumpled list of party guests. Molly: Alright, here's the scoop: the party starts at three, the kids are all wearing superhero capes, and the presents are piled in a glittery mountain near the cake table. Bill: Got it. We'll need a distraction that outshines a three-year-old's shriek when the clown trips over a balloon. Any ideas? Molly: How about we fake a magical unicorn visit? I can blow a glitter cannon, shout 'Unicorn poop!' and while the kids chase sparkles, we slip in under the cake's shadow. Bill: Brilliant, but what if the parents start snapping photos? We need a cover story, maybe a surprise 'gift-wrapping show' where we claim to be professional party entertainers. Molly: Exactly! We'll hand out shimmering ribbons, claim it's a 'Ribbon Relay,' and while the kids dash around, we slip the presents into our magic bags, labeled 'Super Secret Stash.' Bill: The only snag is the cake. It's a three-tiered chocolate monstrosity, and if I even get a whiff, I might turn into a chocolate-obsessed monster. Can we bribe the baker with a free ticket to a 'Gummy Bear Gala'? Molly: Deal. And I already printed the invites: 'All kids get a surprise glitter bomb after the cake.' We'll toss the glitter when the lights flicker for the birthday song, giving us the perfect cover. Bill: Alright, we need a signal. How about I shout 'Pudding Panic!' and you respond with 'Unicorn Surge'? The kids will think it's a game, and we slip out. Molly: Perfect! And I have a tiny remote-controlled drone that can hover over the cake, drop confetti, and double as a distraction. We'll claim it's a 'Magic Cloud' for birthday wishes. Bill: Great, now the escape route: the back gate leads to a bouncy castle, which conveniently collapses after a certain weight. I'll set the timer for 15 minutes after the party starts. Molly: And I’ll have the loot bags ready—sturdy canvas totes with a ‘Thank You for Attending’ tag, so the parents think we’re just cleaning up. We'll be out before the cake is cut. Bill: One last thing: if the clown gets suspicious, I'll flash a fake ‘Birthday Wizard’ badge and claim the presents are cursed, needing a special extraction team—us. Molly tucked the glitter cannon into a bright pink backpack, and Bill slipped a rubber chicken into his coat, muttering about its 'sonic squeak'. They rehearsed their lines in whispered, theatrical tones, each trying not to laugh. As the clock struck two, they split up, with a final nod and a promise to meet at the 'Unicorn Surge' sign, ready to turn a child's celebration into the greatest present heist the town had ever seen. The children laughed, unaware of the chaos about to unfold.
created_at
"2026-03-18T22:13:08.706Z"
3/18/2026, 10:13:08 PM
model
"minimax/minimax-m2.7-20260318"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1935
moderation_latency
(null)
generation_time
244841
tokens_prompt
77
tokens_completion
14728
native_tokens_prompt
101
native_tokens_completion
15668
native_tokens_completion_images
(null)
native_tokens_reasoning
15000
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0188319
router
(null)
provider_responses
0
endpoint_id
"471cf4ea-3548-4dd7-9f86-8c0f9c21d3ca"
id
"060a567480c87a9e0d24496360ffbfda"
is_byok
false
latency
1935
model_permaslug
"minimax/minimax-m2.7-20260318"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773871988-Bc9x8tUJOGASP6CVgfeW"
upstream_id
"060a567480c87a9e0d24496360ffbfda"
total_cost
0.0188319
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
0.00%
Dialogue to Total Word Ratio
Ratio: 27.11%, Deviation: 42.89%
neededClean
false
wordsTotal
509
wordsDialogue
138
49.9920%