NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HSK4GVRM801HH4BCVFYYF
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 10s
Cost:
$0.002350
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
31.24%
Dialogue to Total Word Ratio
(details)
15.6192%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled a blueprint across the hood of their getaway car. Bill adjusted his balaclava nervously, the elastic snapping against his pudgy cheeks. "Alright, Bill, listen up. This is the biggest score of our careers." "It's a kid's birthday party, Molly." "It's the Henderson residence, Bill. A six-year-old's birthday with seventy-three registered guests. The manifest shows a minimum of forty wrapped presents, plus the piñata contents, which my intel suggests includes gift cards." Bill gulped. "Why would a six-year-old have gift cards?" "Because her grandmother loves her, Bill. Stay focused. We hit at 3:47 PM, when the cake is served and the adults are doing that thing where they pretend to enjoy sparkling wine." "What's my role?" "You are the distraction. You arrive dressed as a clown." "A clown?" "An entertainer, Bill. The parents are expecting one. You're wearing the suit, it's non-negotiable." Bill looked down at his oversized polka-dot ensemble. "I look ridiculous." "You look like a man with a dream and a squeaky nose. Now, your job is to herd the children into the bounce castle. Once they're contained—" "—which won't be easy, Molly, because kids bounce everywhere—" "—you activate the emergency horn. That's my signal. I slip in through the garage, neutralise the birthday father, and sweep the gift table." "Neutralise? You said neutralise?" "I meant distract. I distract the birthday father with a coupon for chimney cleaning. I've been practicing my chimney-cleaning pitch for three weeks, Bill." Molly pulled a tiny earpiece from her pocket and handed it to Bill. He stared at it like it was a jewel. "This is comms. Top of the line. Tiny Tim Industries." "I thought Tiny Tim made novelty birthday hats." "They diversified. Now they make surveillance equipment and novelty birthday hats. It's a strange company." Bill slipped the earpiece in. "Test, test. Am I coming through?" "You're coming through, Bill. Now, when you blow that horn, the chaos will be immense. Children screaming, parents panicking, the birthday mother reaching for her phone to call the police, which is why—" "—we unplug the landline." "Correct. The landline is unplugged. I will be in and out in ninety seconds." "What about the dog?" Molly paused. "What dog?" "The Hendersons have a dog, Molly. A terrier. I saw it on Instagram." "Did you read the room, Bill? The Instagram post said they got a fish." "It also said the fish died, Molly. They replaced it with a terrier named Captain Snuggles." Molly's eye twitched. "Captain Snuggles. Of course. A canine operative. This changes everything." "Should we abort?" "Abort? Bill, we didn't spend six hours sewing clown shoes for you to abort. We adapt. You distract Captain Snuggles with a treat." "What kind of treat?" "A steak treat. I have one in my pocket." Molly produced a small, suspicious-looking brown lump. Bill squinted at it. "Molly, is that a real steak?" "It's a steak treat, Bill. For the dog." "It looks like a real steak." "Everything looks like a steak when you're scared, Bill. Are you scared?" Bill looked at the clown suit. He looked at the brown lump. He looked at the blueprint of a six-year-old's birthday party. "Terrified," he said. "Good," said Molly, grinning. "Fear keeps you sharp. And the clown shoes fit you. Now let's go steal some presents from a bunch of toddlers."
created_at
"2026-06-01T12:16:03.221Z"
6/1/2026, 12:16:03 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
954
moderation_latency
(null)
generation_time
69860
tokens_prompt
77
tokens_completion
2328
native_tokens_prompt
236
native_tokens_completion
1944
native_tokens_completion_images
(null)
native_tokens_reasoning
1413
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00235032
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cab0330fdac703a601c5a874cd3bb"
is_byok
false
latency
954
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316163-TuJVhRYg42A4ti0mT7zj"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316163-xkHCbLPAfSDP4h9d9BvQ"
upstream_id
"066cab0330fdac703a601c5a874cd3bb"
total_cost
0.00235032
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
557
31.24%
Dialogue to Total Word Ratio
Ratio: 74.82%, Deviation: 4.82%
neededClean
false
wordsTotal
568
wordsDialogue
425
15.6192%