NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM1G1BCRASCW1S3NZTC2THJ9
Run Details
Model:
MiniMax M2.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 12s
Cost:
$0.012821
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
22.70%
Dialogue to Total Word Ratio
(details)
61.0397%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The moon hung low over the Hendersons' backyard, a pale slice of cheese in a sky of blueberry jam. Colorful balloons bobbed like buoys in a sea of grass, and the air was thick with the sweet perfume of vanilla frosting and the faint squeak of a squeaky toy. Inside, the birthday party was a riot of shrieks, glittery hats, and sugar‑fueled chaos. A towering cake shaped like a dragon loomed over a table piled high with gift bags, each one bulging with toys that seemed to whisper 'take me' to any passing thief with a taste for mischief. From behind a hedge of overgrown roses, Molly adjusted her oversized sunglasses and whispered to Bill, who was busy untangling a set of lock picks disguised as candy cigarettes. The plan, such as it was, involved a glitter‑filled piñata that would burst on cue, a fake clown who could juggle a present or two, and a silent alarm disguised as a singing birthday card. "Okay, Bill, here's the scoop. The piñata's the main distraction, right? The kids will swing, it bursts, and while they're scooping up candy, we slip under the table and start snatching gift bags." "Got it. And I'll handle the singing card. Punch in 1234, and the alarm goes silent for fifteen seconds. That's our window." "Fifteen seconds is an eternity if you count in heartbeats. We'll be in and out faster than a toddler can yell 'more cake'." "And if someone spots us, we just claim we are the magical entertainers hired for the party. We'll have a backup story about being late for the cake tasting." "Perfect. Also, the glitter bomb? It's optional, but it adds a nice shimmer to the getaway. We can claim we spilled a bag of sparkle while practicing a trick." "Alright, we have the distraction, the code, and a glittery alibi. Let's do this before the cake gets sliced." Molly slipped through the gate, her sneakers crunching on the grass as the music from a tiny speaker pulsed like a heartbeat. The piñata hung like a neon orange lantern, swaying gently in the night breeze. She gave Bill a thumbs‑up, and he crept toward the birthday card that sat propped against a stack of napkins. With a swift tap on its hidden keypad, the card emitted a soft chime, and the room's soft alarm fell silent. Bill's fingers danced over the keypad, punching in the code 1‑2‑3‑4. The screen flickered green, signaling success. The two thieves moved like shadows, scooping up gift bags as if they were picking ripe fruit. A toddler tugged at a glittery box, and Molly swooped in, snatching it with a grin that could outshine the birthday candles. Just as the last present disappeared into their sacks, the piñata burst in a spectacular shower of candy, sending the children into a frenzy of giggles. Molly tipped her hat, Bill winked, and they vanished into the night, their pockets jingling with stolen joy and a plan already forming for the next birthday blowout.
created_at
"2026-03-18T22:11:09.346Z"
3/18/2026, 10:11:09 PM
model
"minimax/minimax-m2.7-20260318"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
799
moderation_latency
(null)
generation_time
191638
tokens_prompt
77
tokens_completion
10656
native_tokens_prompt
101
native_tokens_completion
10659
native_tokens_completion_images
(null)
native_tokens_reasoning
10015
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0128211
router
(null)
provider_responses
0
endpoint_id
"471cf4ea-3548-4dd7-9f86-8c0f9c21d3ca"
id
"060a55fd1c25834b02114c0012455405"
is_byok
false
latency
798
model_permaslug
"minimax/minimax-m2.7-20260318"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773871869-uBq4NN2nFt4Qc62t3qjB"
upstream_id
"060a55fd1c25834b02114c0012455405"
total_cost
0.0128211
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
22.70%
Dialogue to Total Word Ratio
Ratio: 35.45%, Deviation: 5.45%
neededClean
false
wordsTotal
505
wordsDialogue
179
61.0397%