NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM1FV5GC3HR28WW3YP1A55QT
Run Details
Model:
MiniMax M2.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 1s
Cost:
$0.007621
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
36.72%
Dialogue to Total Word Ratio
(details)
19.3664%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
In the dim basement of an old candy shop, Molly and Bill hunched over a map of the upcoming birthday bash. The basement smelled of old chocolate and fresh tape. A dim bulb swung overhead, casting jittery shadows on a hand‑drawn layout of the community center where the party would be held. Molly's eyes glittered like stolen candy, while Bill wiped a grease stain from his elbow, muttering about the last time he tried to juggle. The map was circled in bright pink marker, marking each exit like a trail of frosting. Molly: "Okay, Bill, you ready for the biggest gig of the year? All those presents just waiting for us to swipe 'em." Bill: "Ready? I'm still trying to figure out how to get past the bouncy castle without blowing my cover." Molly: "The castle's a red herring. The real entrance is through the back door, disguised as a delivery driver with a tray of cupcakes." Bill: "Cupcakes? Nice. And we can hide the loot in a giant inflatable dinosaur the kids are obsessed with." Molly: "Exactly! We'll stash the toys inside the dino, then waddle out when everyone's distracted by the clown's balloon pop." Bill: "And if the clown actually pops a real balloon, we lose our chance. Maybe we should bring a backup noise maker." Molly: "We could use a tiny speaker playing a recording of a crying baby—parents will be so worried they'll freeze." Bill: "That's deviously clever. But we also need a quick escape route. The parking lot will be packed with minivans." Molly: "I mapped a side alley that leads straight to the van. We just have to time the exit with the birthday song finale—everyone sings, we slip out." Bill: "And what about the gift list? If we take everything, the kid's grandma will definitely notice." Molly: "We won't take it all. We'll selectively grab the pricey stuff—video games, drones, the giant Lego set." Bill: "And the rest of the loot? We could leave a fake thank‑you note signed by the birthday boy, maybe a candy wrapper as a clue." Molly: "A little humor never hurts. The parents will think the kids are playing a prank, and we get out clean." Bill: "Alright, here's the plan: I go in as the clown's assistant with a bag of balloons, you slip in as the caterer with cupcakes. We'll meet behind the dinosaur." Molly: "And when the music swells, we signal each other with a quick wink and a thumbs‑up. Then we grab the stash and vanish." Bill: "If anything goes sideways, we retreat to the kitchen, pretend to refill the frosting, and use the fire exit." Molly: "Sounds like a plan. Let's rehearse the handoff. You hand me a balloon, I hand you a cupcake. No one's the wiser." Bill: "Alright, the stage is set. Let's get those kids' loot and make this birthday the most unforgettable—for us, anyway." Molly: "Here's to a sweet heist and sweeter profits. On three, we break!" Molly counted down, and the two vanished into the night, leaving only the echo of giggling children and a faint scent of frosting.
created_at
"2026-03-18T22:07:46.707Z"
3/18/2026, 10:07:46 PM
model
"minimax/minimax-m2.7-20260318"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
733
moderation_latency
(null)
generation_time
120803
tokens_prompt
77
tokens_completion
6537
native_tokens_prompt
101
native_tokens_completion
6326
native_tokens_completion_images
(null)
native_tokens_reasoning
5655
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0076215
router
(null)
provider_responses
0
endpoint_id
"471cf4ea-3548-4dd7-9f86-8c0f9c21d3ca"
id
"060a55322bed0a676049ee2b94f1ace1"
is_byok
false
latency
733
model_permaslug
"minimax/minimax-m2.7-20260318"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773871666-WlNXuoCW8iMye9ZsN9tp"
upstream_id
"060a55322bed0a676049ee2b94f1ace1"
total_cost
0.0076215
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
525
36.72%
Dialogue to Total Word Ratio
Ratio: 74.48%, Deviation: 4.48%
neededClean
false
wordsTotal
525
wordsDialogue
391
19.3664%