NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HCDWFJKZKFA0G0QFDXT1J
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
48.5s
Cost:
$0.001036
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
1.70%
Dialogue to Total Word Ratio
(details)
0.8496%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the garden hedge, binoculars pressed to her face, and surveyed the enemy territory with the grim determination of a woman who hadn't slept since Tuesday. Pink bunting flapped in the breeze. A bouncy castle threatened structural collapse. Somewhere, a clown honked. "Right," she whispered, tapping her clipboard. "Entry point, exit strategy, alibi, and most importantly—dress code. We cannot be underdressed for a six-year-old's birthday." Bill emerged from the bushes wearing a bright purple waistcoat and a Groucho Marx disguise that he'd clearly slept in. A bee circled his head. "I look fantastic," he said, adjusting the nose. "Nobody suspects the funny uncle." "Nobody suspects the funny uncle because he doesn't have crumbs in his beard and a parking ticket from 2019." Bill brushed himself off with wounded dignity. "That ticket was a misunderstanding." "It was three hundred pounds and you parked on a roundabout." Molly snatched the glasses away from her eyes. "Focus, Bill. We have a window of nine minutes between the piñata and the gift opening. That's our entire window." "Nine minutes to loot a child's party." Bill let out a low, reverent whistle. "This is the big one, isn't it?" "This is the medium one. The big one is the bat mitzvah in November. Now listen. I'm going in as the magician." Bill blinked. "You're not a magician." "I'm not a *good* magician. I'm a magician who's very bad at magic. Parents love that. Builds character in the children." She pulled a wand from her sleeve and it snapped immediately. "See? Authentic." "And me?" "You're the piñata man. You break the piñata, and while the children are distracted by the screaming sugar rush, I sweep the gift table. Then we vanish into the chaos." Bill frowned beneath his plastic nose. "What if there's a guard dog?" "There's no guard dog, Bill. It's a party for a child called Poppy who's turning six and her parents are vegans." "Vegans are unpredictable." "Vegans are exhausted. Trust me, I've done the research. The mother cried during a John Lewis advert. She won't fight us." Bill considered this, scratching his chin through the disguise. "What's our cut?" "Sixty-forty." "Forty?" "Forty *and* you get the giant teddy bear. I've seen it, Bill. It's the size of a small child. It'll change your life." He stroked the fake mustache thoughtfully. "What about the clown?" "What about the clown?" "Real clowns are usually ex-military. I've seen the files." Molly looked up from her clipboard. Bill was, against all odds, occasionally right about these things. "Fine. If the clown approaches, we deploy the emergency protocol." "Which is?" "You do a very bad magic trick and I scream." Bill considered this. "I can do a very bad magic trick." "You can do an excellent one. That's the problem. If the clown sees actual magic, he'll know we're professionals." A long silence settled between them as a distant child shrieked with laughter. A balloon escaped into the sky. "Right," Molly said, checking her watch. "Positions in five. And Bill?" "Yes?" "Leave the bee alone. He's seen too much." Bill nodded solemnly. The bee, sensing it had been promoted to witness, flew away to inform its hive of the heist of the century. The pink bunting fluttered. Somewhere, a parent yelled for more hummus. The clown honked again, mournful and low, and Molly thought, not for the first time, that crime was simply poetry with better snacks.
created_at
"2026-06-01T12:08:51.86Z"
6/1/2026, 12:08:51 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1030
moderation_latency
(null)
generation_time
48524
tokens_prompt
77
tokens_completion
1025
native_tokens_prompt
236
native_tokens_completion
849
native_tokens_completion_images
(null)
native_tokens_reasoning
106
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00103632
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca9531a61e518be50b550fd3b4f01"
is_byok
false
latency
1030
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315731-8Y26m1zvPhnTqhxGDLeH"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315731-WU0juO1IBjNixjXtgcRj"
upstream_id
"066ca9531a61e518be50b550fd3b4f01"
total_cost
0.00103632
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
573
1.70%
Dialogue to Total Word Ratio
Ratio: 59.03%, Deviation: 9.03%
neededClean
false
wordsTotal
576
wordsDialogue
340
0.8496%