NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GJ0QP9R0BQFVACP0KDJCJ
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
43.3s
Cost:
$0.000965
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the bouncy castle, her pink tutu rustling in the gentle breeze of an overpriced outdoor birthday venue. She adjusted her tiara—essential for blending in with the six-year-olds—and checked her watch for the fourteenth time in three minutes. "Right, Bill," she whispered. "Target acquired. Twelve kids, two exhausted parents, one piñata shaped like a horse." Bill emerged from behind a helium balloon arrangement, his Spider-Man costume straining at the seams. He was six foot four and had chosen the most ridiculous muscle-padded costume available, which had seemed funny at the online checkout and less funny now. "Tell me again why we're doing this?" he muttered, scratching under his mask. "We could rob a bank. Banks have vaults. Vaults don't have children screaming about sugar highs." "Because, you great lummox, banks have security guards and these parents have a bored teenager DJ who keeps playing Baby Shark on repeat. That's our security guard. He's already losing the will to live." Bill considered this, watching a small girl attempt to headbutt the piñata with catastrophic results to her frosting-smeared face. "Fair point. But I still feel like a monster." "Oh, grow a spine. We've done bigger jobs. The Louvre? Easy. The Crown Jewels? A doddle. This?" Molly gestured grandly at the carnage of half-eaten sandwiches and trampled party bags. "This is amateur hour." "What's our exit strategy?" Molly grinned beneath her fake mustache, which she had insisted was a "disguise" and not a leftover from a previous job at a magician's convention. "The bouncy castle, obviously. We bounce our way over the fence while everyone's distracted by the clown." Bill squinted at the clown in question, who was currently trying to fold himself into a Volkswagen Beetle. "Distracted might be an understatement." "Exactly. Now, the presents are in that blue gift table near the cake. Twenty-three wrapped packages. I count at least five that look like they're hiding gift cards. The rest are probably DVDs and socks, but we'll take our chances." "What about the parents?" "Bill. The mother is crying into her prosecco. The father is trying to assemble a trampoline without instructions. We are invisible." Bill took a deep breath, psyching himself up. "I can't believe I'm saying this, but I'm ready. For the heist of the century." "It's a children's birthday party." "The heist of the century relative to our dignity, Molly. Our dignity is in tatters." "Then let's retrieve it, along with every polly pocket set in that pile." Molly stood, brushed off her tutu, and produced from her bag not a weapon but a comically oversized candy cane. "Stick of distraction. Oldest trick in the book." "I thought the oldest trick was telling kids there's healthy candy." "Bill, you monster, that's genius. Deploy that immediately. I'll be on the gift table." With synchronized nods, two of Britain's most wanted thieves waddled toward a cluster of delighted children, their criminal empire hinging on glitter, lies, and the unshakeable belief that somewhere in this suburban chaos lay their greatest prize.
created_at
"2026-06-01T11:54:26.426Z"
6/1/2026, 11:54:26 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2282
moderation_latency
(null)
generation_time
43191
tokens_prompt
77
tokens_completion
954
native_tokens_prompt
236
native_tokens_completion
768
native_tokens_completion_images
(null)
native_tokens_reasoning
132
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00096504
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca5f21a63764c679e47748f21e7d8"
is_byok
false
latency
2282
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314866-zakzXURSLG6ax76AlPJ8"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314866-xP8lZbW1OWDZGe6a1EeJ"
upstream_id
"066ca5f21a63764c679e47748f21e7d8"
total_cost
0.00096504
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
0.00%
Dialogue to Total Word Ratio
Ratio: 53.91%, Deviation: 23.91%
neededClean
false
wordsTotal
512
wordsDialogue
276
49.8722%