NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1H7AJQV8Z4DACYDFTW5137
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
12m 48s
Cost:
$0.049580
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.81%
Dialogue to Total Word Ratio
(details)
99.9032%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his novelty necktie, a hideous garment covered in dancing bananas, and squinted at the so-called blueprints spread across the kitchen table. The blueprints were a child's crayon drawing of the Henderson residence on a napkin, drawn by the birthday girl herself, since Bill had not graduated to requiring actual blueprints. "Okay, Moll, walk me through it. The Henderson kid's sixth birthday party. How hard can five-year-olds be?" Molly stroked her fake mustache, a real mustache she had glued to her face that morning. "Harder than you'd think. Tiny humans are unpredictable. One tackled me at a supermarket." "You were disguised as a clown." "That's not the point. The point is, they bite." "Fair point. So what's the play?" Bill pulled a crayon map from his pocket and pinned it to the fridge with a magnet. "I've surveilled. Backyard bouncy castle, piƱata zone, and a present table guarded by a suspicious golden retriever." "The dog's named Mr. Biscuit. I've already befriended him with organic bacon treats." Molly nodded approvingly, her mustache wobbling dangerously. "Okay, entry strategy?" "You go in as the entertainment. I've got us a clown costume, a unicycle, and twenty-two helium balloons." "Twenty-two?" "One's for you. Twenty-one is a smoke screen. The unicycle is mostly symbolic." Bill scratched his head, his banana tie producing an odd sound neither commented on. "Moll, what if parents recognize you? You did get arrested at that puppet show in '09." She brushed this off with the dignity of a professional who had definitely been arrested at a puppet show. "That was entrapment. The puppets were holding drugs. I'm clean now. Mostly. What about the getaway?" Bill produced a deflated bouncy castle from behind the curtain with the solemnity of a magician. "We bounce out. I inflate it, we jump the fence, and land in the neighbor's koi pond." Molly paused, her mustache listing dramatically. "Bill, I have so many questions, but mostly: do you know how to inflate a bouncy castle?" He grinned the sheepish grin of a man who had watched tutorials but had not, in fact, inflated anything. "I watched a YouTube tutorial. Three times. The thumbnail was very encouraging." She sighed the weary sigh of a veteran criminal who had seen some things. "Fine. What about the loot?" Bill produced a list on the back of a stolen birthday invitation. "Xbox, American Girl doll, a pony, a Furby, and a very aggressive harmonica. We fence the pony, keep the Furby, donate it." Molly stared at the list, then at Bill, then back at the list. "Why is 'crying permitted' written in the margins?" Bill shrugged with the easy confidence of a man who had never once read a heist manual. "It's a heuristic, Moll. Tiny humans cry. You cry, I distract with balloons, we grab the loot, we bounce. Simple." They clinked coffee mugs, because mugs were all they had, and began packing the clown costume, twenty-two balloons, and a single squeaky shoe into a panda-shaped duffel bag.
created_at
"2026-06-01T12:06:04.638Z"
6/1/2026, 12:06:04 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1278
moderation_latency
(null)
generation_time
767967
tokens_prompt
77
tokens_completion
36710
native_tokens_prompt
236
native_tokens_completion
41302
native_tokens_completion_images
(null)
native_tokens_reasoning
35928
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.04957992
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca8ac2bf3074958c9a7323daf0ece"
is_byok
false
latency
1278
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315564-LDyuOu1yC2fjX0pW8bpv"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315564-3YydTFA4mXsABfl1LmIx"
upstream_id
"066ca8ac2bf3074958c9a7323daf0ece"
total_cost
0.04957992
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
99.81%
Dialogue to Total Word Ratio
Ratio: 50.20%, Deviation: 0.20%
neededClean
false
wordsTotal
508
wordsDialogue
255
99.9032%