NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GNC86287JTKRHBKJV13SJ
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6m 23s
Cost:
$0.022806
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
77.16%
Dialogue to Total Word Ratio
(details)
88.4522%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her cat-eye mask in the rearview mirror, tongue pressed against her cheek as she studied the glossy flyer. The car smelled like stale coffee and bad decisions, which was, frankly, their signature scent. "Okay, recap time," she said, tapping the paper. "Six-year-old birthday brat, bouncy castle, thirty kids, three parents, one clown who looks like he's on something interesting. What did I miss?" Bill squinted at the map, mustache twitching. "The clown's called Bumbles. He does balloon animals and also, allegedly, taxes." "He does what?" "Don't ask. We go in at cake time. Every adult's eyes on the candles, not on us." Molly grinned, drumming the steering wheel. "And the haul?" "Forty-two presents, minimum. Mom went feral with the credit card. I saw the statement." "You saw her credit card statement. You absolute peach." Bill shrugged, butter wouldn't melt. "I have a gift." "Right. So what's the angle? Clown costume? Caterer? Distraught aunt?" Bill held up a finger, expression grave as a heart attack. "Bouncy castle repairman." "Bill. There is no bouncy castle damage." "There will be." Molly stared at him for a long, beautiful moment. Then she started laughing, the kind of laugh that makes your ribs ache and your mascara reconsider its entire career. "You're going to stab the bouncy castle?" "Strategically. A small, dignified puncture. Children evacuate in screaming waves. We sweep in, grab the gift table, and vanish before anyone blows out the candles. Clean. Surgical. Beautiful." "You've put an alarming amount of thought into this." "I had a long lunch. And a longer afternoon." She wiped her eyes, still giggling like a fool. "Okay. I'm in. But I have conditions." "Name them." "First: we leave the clown alone. He's had a rough life. I can tell." "Agreed. Solidarity with Bumbles." "Second: no kids. We are not traumatizing six-year-olds for a Furby and some Legos." "Scout's honor." "You were never a scout." "I could have been. Anyway, agreed. No kids." She leaned back, considering. "Third. If there's a pony at this party, we do not take the pony. I want that absolutely clear." "You take the pony out of the heist, I take the pony out of the heist. Pony's safe." "Good. Good." She paused, turning toward him. "Bill." "Molly." "Why do we do this?" He looked at her, eyes soft beneath the ridiculous black mask. "Because some people have too many gifts, Moll. And some people have too few. Also because it's hilarious." "Also because it's hilarious," she agreed quietly. They bumped fists over the center console, a sacred ritual. "Now," she said, cracking her knuckles. "Tell me more about Bumbles and the taxes." "Honestly, I think he's running a side hustle. Very hush-hush. The IRS doesn't know which parties to audit." "Oh my God." "Get ready, Moll. Cake time waits for no thief." She started the engine, grinning like a wolf eyeing a field of very small, very rich sheep. "Cake time, baby. Let's go rob a six-year-old."
created_at
"2026-06-01T11:56:16.526Z"
6/1/2026, 11:56:16 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
977
moderation_latency
(null)
generation_time
382756
tokens_prompt
77
tokens_completion
16818
native_tokens_prompt
236
native_tokens_completion
18969
native_tokens_completion_images
(null)
native_tokens_reasoning
16009
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.02280624
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca660d9580aa16fddfedbc2ca58f8"
is_byok
false
latency
977
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780314976-NplRk7b0FcJicIPS2jtI"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780314976-GpCVoIIsqBHf1BBpiNLi"
upstream_id
"066ca660d9580aa16fddfedbc2ca58f8"
total_cost
0.02280624
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
77.16%
Dialogue to Total Word Ratio
Ratio: 67.72%, Deviation: 2.28%
neededClean
false
wordsTotal
505
wordsDialogue
342
88.4522%