NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GCPFVZCQSNHNBM6SHENRR
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9m 10s
Cost:
$0.036847
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
95.24%
Dialogue to Total Word Ratio
(details)
97.6176%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted through the garden fence, watching the inflatable castle sag slightly in the breeze. Beside her, Bill adjusted his fake mustache for the third time, muttering about adhesive. The suburban garden was a festival of pastel bunting and suspicious helium balloons, every available surface claimed by the sugar-fuelled chaos. "Bill, this is either your worst idea or your best," Molly said, eyeing the bouncy castle. "It'll be brilliant! Have I ever steered you wrong, partner?" "You got us arrested stealing a reindeer from a petting zoo." "Minor setback. Birthday party at 3 PM sharp. Thirty kids. Thirty goody bags. Chocolate, novelty pencils, the works." Bill produced a crumpled notebook from his inside pocket, flipping to a page covered in stick-figure diagrams. He tapped it gleefully with the enthusiasm of a man who had absolutely not thought this through. "Bill, we do banks, jewelry, art—" "Nostalgia's the real gold, Molly. Those juice boxes? Absolute collector's items in twenty years." "You're ridiculous." Molly pinched the bridge of her nose and muttered something unrepeatable. Through the fence, she could see a woman wrestling a sausage roll out of a toddler's grasping hand. This was, without question, far beneath them. "I've drawn blueprints." "You drew blueprints for a six-year-old's party?" "Entry through the piñata station, distraction at face-painting. Plus the clown looks like your Uncle Gerald." Molly felt a twitch in her eye. The clown in question was indeed hauntingly familiar, his oversized shoes flopping as he wandered straight into a rosebush. "Oh God. Please, not the clown." "Worse. He's on helium. Completely useless." "So we just waltz in?" A woman screamed in the garden nearby. A small child had birthday cake smashed across his forehead. Bill didn't even flinch. "Never waltz. Always tango. You're 'Belinda, the storytelling aunt.' I'm 'Wilbur, the dad who brought too much sangria.'" "What about security?" "One confused man in a rabbit suit. He high-fived me at the gate last year. Not exactly James Bond." "And the parents?" "Drunk by 3:30. Open bar. Unresolved marriage issues. I did my recon." Molly watched a dad attempt to negotiate a peace treaty over a deflated football. The operational environment was, she had to admit, beautifully chaotic. "Bill, if we get caught stealing from children—" "We're not stealing FROM them. We're relieving parents of poor decisions. It's a public service." "You told everyone at the pub this would be our greatest job." Bill straightened his mustache with tremendous, almost defiant dignity. Somewhere in the garden, a child began to wail in the key of E minor. "A heist so bold they'd make a feature documentary." "About us being chased by a man in a rabbit suit?" "Exactly. Theatrical." "Fine. But I get first pick of the presents." Molly sighed, the weary, defeated sigh of a woman who had committed to many bad ideas over the years but none quite this spectacularly bad. The bouncy castle deflated another inch, as if in sympathy. "Bring a party hat. We commit to the bit."
created_at
"2026-06-01T11:51:32.095Z"
6/1/2026, 11:51:32 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1561
moderation_latency
(null)
generation_time
549939
tokens_prompt
77
tokens_completion
26772
native_tokens_prompt
236
native_tokens_completion
30691
native_tokens_completion_images
(null)
native_tokens_reasoning
25943
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.03684672
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca5443c3e7c1734b4d2a1ffdfc49e"
is_byok
false
latency
1561
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314692-crGVhRMlHLI0DU9dtgna"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314692-R90idvXHibEBUbBEpRjQ"
upstream_id
"066ca5443c3e7c1734b4d2a1ffdfc49e"
total_cost
0.03684672
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
95.24%
Dialogue to Total Word Ratio
Ratio: 49.01%, Deviation: 0.99%
neededClean
false
wordsTotal
506
wordsDialogue
248
97.6176%