NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HJ1JFB5YWAP1ZMXA991SV
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
6m 31s
Cost:
$0.000985
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the overturned sandbox, her eyes wide with the kind of gleeful terror usually reserved for people about to do something extremely unwise. Bill waddled up beside her, a comically oversized poncho billowing around his lanky frame. "Tell me again why we're doing this," Bill whispered, adjusting the fake mustache that was already listing dangerously to the left. "Because," Molly hissed, "those children have so much loot, Bill. I've seen it. Mountains of presents. Entire economies wrapped in glittery paper and topped with aggressive ribbons." "Could we not do a bank? Banks are normal. Banks don't have jugglers." "Bill, focus. We need a distraction, a way in, and a very fast car." "Right. And the distraction?" Molly reached into her bag and produced a small cardboard box with great ceremony. "What's that?" "Twenty-four live frogs." "You brought frogs to a heist?" "I brought frogs to a children's party, Bill. There's a difference." Bill stared at the box. One of the frogs croaked mournfully. "So your plan is to release frogs at a group of screaming five-year-olds?" "Correct." "And in the chaos?" "We waltz in as the entertainment. I've brought stilts, a unicycle, and a very aggressive accordion." "You can play the accordion?" "I can play 'Twinkle Twinkle Little Star' and 'Happy Birthday' with aggressive vigor. That's all we need." Bill pinched the bridge of his nose. "What if a parent recognizes us?" "We're wearing wigs, Bill. I am currently a woman named Brenda. You are currently a man named— actually, I didn't pick a name for you." "Then I'm also Brenda." "Two Brendas?" "Chaos cousins." Molly considered this. "I like it. Two Brendas, here to perform at little Tyler's birthday party, here to absolutely destroy his presents, and here to vanish in a cloud of frog-induced pandemonium." "And the car?" "Parked two blocks away. I've taped a license plate to a Honda Civic that says HONK IF U LIKE TACOS." "You committed multiple vehicle crimes for this job?" "Bill, I am a professional." Bill sighed the sigh of a man who had made many poor life choices but was committed to this one. "Fine. What about the parents?" "Tyler is a Goldfish, Bill. He's a child. His parents are named Dave and Carol. Dave runs a Tupperware consultancy. Carol sells candles on the internet. They will be distracted by free entertainment and the promise of a photo." "A photo of us?" "A photo of their child with the frogs." Bill opened his mouth. Closed it. Opened it again. "I want a bigger cut." "You always want a bigger cut." "I brought the mustache." "The mustache is melting off your face, Bill." "Sixty percent." "Fifty-two." "Fifty-eight and I don't tell anyone about the time you cried at a puppet show." Molly's eyes narrowed. "That was a manipulative puppet. He knew too much." "Fifty-eight, and we never speak of the puppet again." "Deal. Now go put on your tutu." "My what?" "You're a clown, Bill. Clowns wear tutus." "I thought I was a Brenda." "Brendas can be clowns. Go." Bill trudged off toward the party, tutu flapping. Molly watched him go, clutched the frog box to her chest, and whispered, "Forgive me, little children. But those presents aren't going to steal themselves."
created_at
"2026-06-01T12:17:43.828Z"
6/1/2026, 12:17:43 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1028
moderation_latency
(null)
generation_time
43441
tokens_prompt
77
tokens_completion
967
native_tokens_prompt
236
native_tokens_completion
806
native_tokens_completion_images
(null)
native_tokens_reasoning
82
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00098472
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cab677803b8121beefab40311ad1a"
is_byok
false
latency
1028
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316263-WSOSUDmo4U8pvc0M3T1k"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316263-MkLR0NmbVn20jrwCw1JG"
upstream_id
"066cab677803b8121beefab40311ad1a"
total_cost
0.00098472
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
542
0.00%
Dialogue to Total Word Ratio
Ratio: 73.95%, Deviation: 23.95%
neededClean
false
wordsTotal
549
wordsDialogue
406
0.0000%