NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GAHYTZ934VQZVDE6NF1GP
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
41.4s
Cost:
$0.000958
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.05%
Dialogue to Total Word Ratio
(details)
0.0406%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized novelty sunglasses and stared at the flyer stapled to the telephone pole. "Bill," she said, "this is it. Our retirement plan." Bill squinted at the paper. "A birthday party? For a kid named... Declan?" "Seven years old. Look at that gift registry. Molly pointed at a laminated list. There's a PS5, two iPads, and what I can only assume is a solid gold pony." "A solid gold pony, Molly?" "It's a very generous family, Bill." Bill tugged at the strap of his duffel bag, which clinked ominously. "I don't know. Kids' parties have parents. Parents have eyes. Eyes have 911 buttons." "That's why we'll dress as entertainment." Molly produced two glittering wigs from her satchel. One was neon pink. The other resembled a frightened parrot. "Absolutely not. I'm not wearing the parrot." "You'll wear the parrot or you'll wear the chicken, and trust me, Bill, the chicken is humiliating. I wore it to my nephew's baptism and his godmother still won't make eye contact." Bill reluctantly took the parrot. "Fine. What's my cover story?" "You're Declan's surprise uncle from Winnipeg. I've been Declan's clown aunt for three months. We've been preparing a very special magic show." "A magic show, Molly? You can't even shuffle a deck without dropping half the cards." "Which is why you'll be doing the actual magic. You're a retired accountant. You can misdirect. You do it every April." Bill sighed, pulling the parrot wig over his head. "Okay. Entry. Then what?" "Balloons," Molly said firmly. "We bring in a giant box of balloons. The parents are distracted. We mingle. We identify the gift table. We create a diversion." "Which is?" "A piƱata emergency. I'll pretend it's filled with bees." "It won't be filled with bees, Molly." "It will be filled with wasps, Bill, and I will scream with such conviction that every adult in that yard will form a protective human shield around the children, leaving the gift table completely unsupervised for approximately ninety seconds." "And then I swoop in." "With the duffel." "With the duffel." Bill paused. "What if there's a dog?" "There's always a dog. That's why I bought sixty hot dogs." "Sixty?" "The dog will be so distracted by the cornucopia of processed meat that it won't notice me waddling out with a sack of seven-year-old's birthday loot." Molly snapped the wig into place. "We'll split it seventy-thirty." "Why seventy-thirty?" "Because I did the recon. I tailed the birthday mom at Trader Joe's. I know her coffee order. I know her dog's name. I am emotionally invested in this heist, Bill." "Fine. Seventy-thirty." Bill adjusted his beak. "But if we get arrested, you're explaining the parrot to the officer." "Deal." Molly grinned. "Now help me into the clown car. I mean the clown van. I mean the clown situation."
created_at
"2026-06-01T11:50:21.919Z"
6/1/2026, 11:50:21 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1500
moderation_latency
(null)
generation_time
41363
tokens_prompt
77
tokens_completion
894
native_tokens_prompt
236
native_tokens_completion
762
native_tokens_completion_images
(null)
native_tokens_reasoning
145
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00095784
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca4fd316712f82ac3cfe3adf4f4c5"
is_byok
false
latency
1500
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314621-W1NKsJ66i0pQJw6wN9nq"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314621-Yn3WR5qYBrlbSuuCfNDl"
upstream_id
"066ca4fd316712f82ac3cfe3adf4f4c5"
total_cost
0.00095784
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
470
0.05%
Dialogue to Total Word Ratio
Ratio: 82.32%, Deviation: 12.32%
neededClean
false
wordsTotal
475
wordsDialogue
391
0.0406%