NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HV3Z25PDPSCX9HJWNW92V
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
38.9s
Cost:
$0.000825
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his novelty Groucho Marx glasses in the rearview mirror and turned to his partner in crime. "So, are we really doing this? A children's birthday party? That's a new low, even for us." Molly snorted, pulling her hair into a tight bun beneath a baseball cap. "Bill, we stole a live turkey from a police station on Thanksgiving. There's no 'low' left for us. We're already in the basement of low." "That was actually pretty impressive," Bill admitted, stroking his mustache thoughtfully. "The turkey union still sends me Christmas cards." "Focus." Molly slapped a blueprint onto the dashboard. It was clearly drawn on the back of a Happy Meal bag. "The Henderson kid is turning seven. There will be at least forty wrapped presents, all in one location, all unattended during the piñata chaos." "Piñata chaos?" "It's a thing, Bill. I've done the research. Adults lose all rational thought when candy starts falling from the sky. It's like a feeding frenzy at SeaWorld, but with more screaming children and diabetes." Bill squinted at the McDonald's logo on the corner of the blueprint. "Did you draw this on a McDonald's bag?" "Steaksauce extra value meal. We don't have the budget for proper plotting paper, Bill. The man who buys our disguises out of the back of his van doesn't take American Express." "What about the costumes?" Molly grinned, revealing a set of plastic vampire fangs. "I'm going as a clown who got lost on the way to a different party. You're going as..." She handed him a frilly pink dress. "No." "Bill. You have the legs for it." "I most certainly do not." "Every man has the legs for it if he believes in himself." She patted his knee. "You'll be Mrs. Butterworth. I'll be the clown. We'll walk right in, compliment the cake, and wait for the piñata to start spinning. Once the parents dive for the floor like they're dodging mortar fire, we grab the presents and leave through the bounce house." "The bounce house?" "Emergency exit. I've tested it. You just sort of... roll." Bill sighed and held up the dress. "If anyone from my book club sees me in this, I'm blaming you." "Bill, you don't have a book club." "I'm starting one. And the first book we're reading is about dignity and self-respect." Molly started the engine. "We'll have plenty of time for self-respect after we're living large in a condo paid for by Hot Wheels and gift cards to Build-A-Bear." "Build-a-Bear," Bill whispered reverently. "Do you know how much those gift cards go for on the black market?" "Enough to buy a lot of dignity, Bill. Enough to buy all the dignity."
created_at
"2026-06-01T12:16:53.222Z"
6/1/2026, 12:16:53 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
916
moderation_latency
(null)
generation_time
38908
tokens_prompt
77
tokens_completion
774
native_tokens_prompt
236
native_tokens_completion
651
native_tokens_completion_images
(null)
native_tokens_reasoning
87
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00082464
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cab3571a74e5d1695d90ffe21e7cb"
is_byok
false
latency
916
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316213-rQvVmanTsq4tjNMEMUhX"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316213-mRWJ5ootCDbBefE7T72s"
upstream_id
"066cab3571a74e5d1695d90ffe21e7cb"
total_cost
0.00082464
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
449
0.00%
Dialogue to Total Word Ratio
Ratio: 77.58%, Deviation: 27.58%
neededClean
false
wordsTotal
455
wordsDialogue
353
0.0000%