NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HRB33FYCBQGVG9M013BTR
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
10m 2s
Cost:
$0.033079
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
64.89%
Dialogue to Total Word Ratio
(details)
82.4354%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the municipal planter box, binoculars pressed to her eyes, surveying enemy territory. Across the street, a bouncy castle the size of a small apartment billowed in the breeze, its colors practically screaming "steal me." A banner above the front door read "Happy 6th Birthday Bartholomew III" in cursive that had probably cost more than her rent. Bill materialized beside her, fully committed to his disguise as a man enjoying a peaceful Tuesday morning. He looked like a retired dentist, which was impressive given his actual résumé of convenience stores, racetracks, and one unfortunate incident involving a llama. "You see that bouncy castle?" Molly asked. "Hard to miss," Bill said. "I also see a clown." "The clown is irrelevant. Clowns are underpaid and over-caffeinated." "That's not the intel you think it is." Molly tucked the binoculars into her suspiciously clinking handbag and produced a crumpled napkin covered in what could generously be called notes. The handwriting suggested she had not slept in three days, which tracked with her general approach to heist planning. She smoothed the napkin against her knee, ignoring the smudge that might have been chocolate or might have been a previous victim's blood. There was no way to know. "Here's the play," Molly said. "I enter as the replacement entertainer. I do a magic trick, create a diversion, you take the van around back." "Twelve minutes," Bill said. "That's the window?" "That's the window." "And we steal what?" "A pony." Molly watched the words land on Bill like tiny anvil-shaped disappointments. His left eye twitched, which she had learned to recognize as his "I have concerns but I'm going to say yes anyway" tell. It was, historically, how they had ended up stealing a giraffe from a safari-themed wedding reception in 2019. They had not spoken of the giraffe since, by mutual agreement, several cease-and-desist letters, and one very awkward run-in at a farmer's market. "A miniature pony, Bill. A horse the size of a Labrador, currently grazing on thousand-dollar grass." "We're stealing a pony." "We're stealing a pony and approximately forty-seven wrapped boxes of undisclosed value." Bill pinched the bridge of his nose in a way that suggested he was calculating the exact cost of his conscience. The number, as always, came up short. "And the children?" "What about them?" "Twenty of them, all hopped up on sugar and cake?" "And that," Molly said, "is why we work fast." Molly straightened her scarf, tucked a spare silk handkerchief into Bill's breast pocket, and adopted the expression of a woman who had never stolen anything in her life. "Twelve minutes," she said. "Showtime in twenty. Try not to look so guilty." "I'm regretful. There's a difference." "Not to the parole officer there isn't. Now come on." Together they strolled toward the most lucrative toddler birthday party in the tristate area, arm in arm, looking for all the world like concerned relatives and absolutely not like two people about to commit the most ridiculous heist of their careers.
created_at
"2026-06-01T12:15:22.218Z"
6/1/2026, 12:15:22 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1357
moderation_latency
(null)
generation_time
601785
tokens_prompt
77
tokens_completion
23813
native_tokens_prompt
236
native_tokens_completion
27530
native_tokens_completion_images
(null)
native_tokens_reasoning
23014
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.03307944
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066caada2ebbc598537cfd2d80ec6f15"
is_byok
false
latency
1357
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316122-kt4adG3HbsQsCYCsr7hT"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316122-05lJ8z4YeHlMgW6o4Abd"
upstream_id
"066caada2ebbc598537cfd2d80ec6f15"
total_cost
0.03307944
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
64.89%
Dialogue to Total Word Ratio
Ratio: 32.94%, Deviation: 2.94%
neededClean
false
wordsTotal
510
wordsDialogue
168
82.4354%