NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GN94KNJ6YSQQ485E2X4MJ
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4m 53s
Cost:
$0.015283
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
92.20%
Dialogue to Total Word Ratio
(details)
63.6029%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale crisps and regret, which Molly had come to consider the official perfume of a life of crime. Outside, the rain tapped politely against the windscreen like a butler who knew he was unwelcome. She turned to her partner and immediately reconsidered every choice that had led her to this moment. "Bill, are you wearing a clown suit?" "What? No! It's a tactical infiltration outfit." "It has a red nose on it." "That's for distraction purposes." Molly reached for coffee that had gone cold three hours ago, a habit that perfectly mirrored her enthusiasm for the job. "You're wearing a clown suit to rob a children's party. Of course you are. Why wouldn't you be?" "Technically it's a jester suit. Got it off the internet. The reviews were excellent." "What did the reviews say?" "One star. Man named Gerald. Very upset. But I'm sure he was being unreasonable." She closed her eyes, briefly considering a career in accounting. "Walk me through the plan, Bill. Slowly. As if explaining to a particularly tired golden retriever." He puffed out his chest, which in a jester suit looked deeply undignified. "Phase one. I arrive as hired entertainment." "You're the entertainment?" "I am wildly entertaining." "You once made a security guard weep by reading him his own employee handbook." "He was misinterpreting section four! It's a hostile work environment when you call Sandra 'a menace to paperwork.' Anyway, phase two: I herd the children into the bouncy castle." "And then?" "Then I distract them. With joy. Possibly juggling." "The parents, Bill." "Right. Parents. Hm." He scratched the jester hat. "What about them?" "They'll be filming the cake. Six minutes minimum. That's our window." Molly produced a napkin, on which she had drawn something that, in a very generous light, could be called a floor plan. "Living room. Presents behind the sofa. Wrapped in shiny paper. Names on the tags." "Looks like a dog drew it." "Architect dog. Rules. We do not touch the cake." "Why?" "Because last time you ate the evidence and we had to flee in the bouncy castle." "It was a very nice bouncy castle." "It was a rental. They want eight hundred pounds. They've described it as 'emotionally haunted,' Bill. I don't know what that means and I don't want to." "I didn't scream at it. I shouted near it." "You screamed at it while dressed as a wizard." "Emotionally!" She folded the napkin with the cold efficiency of a woman who had once orchestrated a diamond heist in Prague and now found herself negotiating with a man in tights. "One more rule. We do not befriend the birthday child." "What if she's nice?" "Especially then. Last time a child gave you a drawing and you cried for an hour." "She drew me as a wizard. It was moving." "You don't have a beard, Bill." "I could." "No you couldn't. Get in the van." He straightened his hat, dignity returning. "Best partners." "Bill." "Best, best partners." "Get in the van before I leave you in the bouncy castle." "Romantically or—" "In the van. Now."
created_at
"2026-06-01T11:56:13.34Z"
6/1/2026, 11:56:13 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1089
moderation_latency
(null)
generation_time
292767
tokens_prompt
77
tokens_completion
12567
native_tokens_prompt
236
native_tokens_completion
12721
native_tokens_completion_images
(null)
native_tokens_reasoning
11739
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.01528272
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca65deb079167eb0cb0f7eca6081a"
is_byok
false
latency
1083
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314973-LrgzzmQHsuZ3xNnihazw"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314973-w1megZXD4B5QaPxBLu5B"
upstream_id
"066ca65deb079167eb0cb0f7eca6081a"
total_cost
0.01528272
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
518
92.20%
Dialogue to Total Word Ratio
Ratio: 68.73%, Deviation: 1.27%
neededClean
false
wordsTotal
518
wordsDialogue
356
63.6029%