NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HTFBH9HSG0J5R7HJ3ZC9P
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
43.0s
Cost:
$0.000949
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
3.0453%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his fake mustache in the rearview mirror, frowning at his reflection. The mustache was pink, because Molly had insisted on "authentic clown aesthetic," and he looked like a disgraced accountant who'd lost a bet. Across the van, Molly was struggling into a pair of giant inflatable shoes that made her look like a baby dinosaur trying to enter law school. "So let me get this straight," Bill said, tapping the steering wheel. "We dress as clowns. We crash a six-year-old's birthday party. We make balloon animals until the parents are sufficiently distracted, and then we make off with approximately thirty-seven thousand dollars' worth of loot?" "Thirty-eight, if you count the American Girl doll," Molly corrected, hopping on one foot. "I have a buyer in Hoboken. Guy named Sal. He's a wigmaker, very specific clientele." "And Sal doesn't know the doll was obtained from a screaming child named Lily?" "Her name is Esmeralda, and she won't even remember. I saw the YouTube vlog. She cries at everything. Once she cried because a butterfly landed on her hand and it was "too much beauty for one day." Bill pinched the bridge of his nose beneath the ridiculous nose. "I have so many questions about that vlog." "Focus, Bill." Molly finally mastered the shoes and did a little shuffle. "How do I look? Festive? Harmless? Like someone who definitely won't be making off with a haul of Legos and a karaoke machine?" "You look like a liability in a tutu." "It's a clown ruffles, not a tutu. There's a difference, and I will die on this hill." Bill sighed and pulled a crumpled list from his pocket. "Okay. Phase one. We arrive as the entertainment the mom hired from Craigslist, because apparently people do that." "Desperate people," Molly agreed. "Phase two. We dazzle them with our unparalleled skills." "I did practice the poodle. I cannot do a giraffe. The neck is structurally impossible." "Phase three. Mom goes inside to heat up the hot dogs. Dad is passed out in the garage. Phase four. We grab the gift table and skedaddle." "What about Esmeralda?" "What about her?" "She's the crown jewel, Bill. The crème de la crème. She has a removable dress and everything." Bill stared at his partner. "You have a buyer in Hoboken for a doll with a removable dress." "Don't kink-shame, Bill. Sal is a professional." They sat in silence for a moment, two grown adults in a van, wearing clown noses, planning to ruin a child's birthday. "Molly," Bill said quietly. "Bill." "After this, we need to talk about our career choices." "After this, we're going to be rich. You can buy a better mustache." "I want a real mustache. A normal one. I'm tired of the disguise life." She reached over and squeezed his shoulder with a giant foam hand. "When this is over, I'm going to get you a nice brown one. Maybe even handlebar." "With wax?" "With the good wax." Bill nodded, satisfied. He turned the key in the ignition. Somewhere, a small girl named Esmeralda was about to have the worst birthday of her short, over-documented life.
created_at
"2026-06-01T12:16:32.12Z"
6/1/2026, 12:16:32 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
982
moderation_latency
(null)
generation_time
42957
tokens_prompt
77
tokens_completion
922
native_tokens_prompt
236
native_tokens_completion
776
native_tokens_completion_images
(null)
native_tokens_reasoning
114
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00094872
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cab20883ae30146bb1677c7da6eba"
is_byok
false
latency
981
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316192-W15K2A7UpM6RPdHmkmDk"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316192-oylKkzCA6fvQbDyDubwQ"
upstream_id
"066cab20883ae30146bb1677c7da6eba"
total_cost
0.00094872
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
523
0.00%
Dialogue to Total Word Ratio
Ratio: 45.18%, Deviation: 24.82%
neededClean
false
wordsTotal
529
wordsDialogue
239
3.0453%