NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1H6SAJJ05C20YHD3E4KZMQ
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16m 43s
Cost:
$0.061522
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
99.14%
Dialogue to Total Word Ratio
(details)
52.6132%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a mossy garden gnome, binoculars trained on the inflatable bouncy castle billowing in the back garden of forty-two Willow Lane. A flag bearing the words "Happy 7th Birthday, Poppy!" flapped in the gentle breeze. Bill crouched beside her, nervously eating a sandwich that was definitely not part of their operational cover story, and was absolutely not making crinkly noises that could be heard three houses down by the Hendersons' cat, who had begun to regard them with deep suspicion. "Target acquired," Molly whispered into her earpiece. "Birthday party. Fourteen sugar-fueled children. One exhausted mother, currently wielding a spatula like a weapon. Bill, are you even listening?" "Multitasking," Bill whispered back through a mouthful of bread. "You said the rendezvous point had snacks." "I said it had cover." Molly snatched the remaining crust and bit into it with the slow fury of a woman betrayed, repeatedly and exclusively, by dairy products. "Look at that mountain of presents." "So what is the play?" Molly unfolded a crumpled blueprint, smoothing it across the lawn with the reverence normally reserved for sacred texts, and outlined the operation in rapid, hissing whispers. "You approach the birthday girl and ask about her favorite dinosaur. She will, statistically, talk for at least forty-five minutes." Bill considered this, mayonnaise glistening at the corner of his mouth. "And then?" "I crawl through the dog flap, load the wheelbarrow in the garage, and slip out behind the recycling bins." "Why a wheelbarrow?" "Because our getaway car is at the detailers." "Today?" "Today is when the presents are, Bill. Present, if you will." She pinched the bridge of her nose, a small sigh escaping her lips. "Please ignore that pun. It is beneath me." A shriek of laughter erupted from the party as a small child in a Superman cape faceplanted directly into a bowl of trifle, sending sponge, custard, and one rogue raspberry cascading spectacularly across the patio. Molly used the resulting chaos to scale the fence, her boots silent as a librarian's sigh upon the wooden slats, her silhouette slipping over the top like a particularly determined shadow. Bill remained behind, clutching the dregs of his sandwich like a beloved, mildly sticky toy, his eyes fixed on the trifle disaster with the quiet admiration of a man who genuinely appreciated chaos—who had once been a wedding photographer, and was only now discovering his true calling in the noble art of heisting alongside the most exasperating woman he had ever met. "Molly," he hissed into his misaligned sleeve microphone. "What if the piñata fights back?" "What if it does?" Molly murmured, one leg already hooked over the top, the other dangling into the enemy garden. She paused, considering whether retirement, or perhaps a quiet life opening a teashop in the Cotswolds, might be a more reasonable career move than another decade of Bill.
created_at
"2026-06-01T12:05:46.967Z"
6/1/2026, 12:05:46 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
900
moderation_latency
(null)
generation_time
1002580
tokens_prompt
77
tokens_completion
42157
native_tokens_prompt
236
native_tokens_completion
51254
native_tokens_completion_images
(null)
native_tokens_reasoning
41395
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.06152232
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca89acb43c85708ac6618bec3aa9b"
is_byok
false
latency
900
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315546-I83PTk5WmQvH7WP3E8x8"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315546-KCqRdV9DXJo7x2yYwNf3"
upstream_id
"066ca89acb43c85708ac6618bec3aa9b"
total_cost
0.06152232
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
99.14%
Dialogue to Total Word Ratio
Ratio: 30.42%, Deviation: 0.42%
neededClean
false
wordsTotal
480
wordsDialogue
146
52.6132%