NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GC9ZP9MKDTMWC50CECTJ5
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
38.2s
Cost:
$0.000837
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.08%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his novelty Groucho Marx glasses for the fifteenth time, clearly not comfortable with the disguise. He stood in the middle of their cluttered apartment, a half-eaten sandwich balanced precariously on a stack of stolen encyclopedias. Across from him, Molly paced back and forth, her fingers drumming an anxious rhythm against the blueprints spread across the coffee table. Bill, I'm having second thoughts about this one, he said, his voice cracking slightly. Oh, come off it, Bill. We've pulled off bigger jobs than this. Remember the retirement home bingo hall? That was different. They had security. This is, Bill gestured vaguely, a room full of six-year-olds hopped up on sugar and Pixy Stix. That's basically a feral army. Molly rolled her eyes and stabbed a finger at the blueprints. Focus, Bill. Eighteen kids. Two parents. One disgruntled clown who, according to my source, hates children and will look the other way for a twenty. Twenty bucks? The clown takes bribes? The clown takes a lot of things, Bill. Probably why he's so bitter. Now, the presents will be piled near the piñata. Classic misdirection. Everyone watches the stick-wielding maniac swinging at the papier-mâché donkey, and nobody notices two professionals quietly making off with fifteen years worth of educational toys. Bill scratched his chin, looking unconvinced. And our exit strategy? Simple. I'll dress as a birthday mom, oversized sunglasses, a tote bag, and one of those ridiculous headbands with the cat ears. You, my friend, will be the entertainment. Me? Bill's eyes went wide. I'm not dressing as a clown, Molly. I have a reputation in the underworld. You're dressing as a magician, Bill. Slightly less tragic. You go in through the back with a collapsible rabbit cage and a bag of tricks. By the time you pull the quarter out from behind some kid's ear, I'll be out the front with the loot. Bill, you don't understand. Children terrify me. Last Christmas, my nephew looked at me for two hours straight and I lost the ability to form words. Molly walked over and patted him on the shoulder. That's exactly the energy we need, Bill. Mild, unthreatening, mildly confused middle-aged man. You're practically already in character. Bill considered this for a moment, then pulled out his phone and began scrolling. Fine. But I have conditions. First, we don't make eye contact with any of the kids. Second, if a child asks if I'm a real magician, I am required by my personal code of ethics to tell the truth. Which is no. Third, no glitter. Under any circumstances. Molly grinned and raised an imaginary glass. To the easiest fifty grand of our careers. Bill didn't look convinced but raised his sandwich in solidarity. To the feral army, he muttered. And to hopefully not getting tackled by a roomful of kindergarteners.
created_at
"2026-06-01T11:51:19.296Z"
6/1/2026, 11:51:19 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1200
moderation_latency
(null)
generation_time
38158
tokens_prompt
77
tokens_completion
831
native_tokens_prompt
236
native_tokens_completion
683
native_tokens_completion_images
(null)
native_tokens_reasoning
97
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00083712
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca537931bd8ad70286bd6f8c9cb37"
is_byok
false
latency
1200
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314679-jnfoQXwDE1hZ0GDKHu38"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314679-scbP43YayBSXyZIxY1oU"
upstream_id
"066ca537931bd8ad70286bd6f8c9cb37"
total_cost
0.00083712
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.08%
Matches word count
n/a
neededClean
false
words
471
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
477
wordsDialogue
0
0.0424%