NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1J1RD28RBMB3X31BG9NMDD
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 41s
Cost:
$0.004316
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.16%
Dialogue to Total Word Ratio
(details)
0.0821%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the kitchen table, anchoring the corners with half-eaten pepperoni slices. Bill stood behind her, chewing thoughtfully. "So," Molly said, tapping the paper, "the target is a seven-year-old's birthday party. Saturday. Three p.m." "Bit beneath us, isn't it?" Bill asked. "We robbed a casino last month." "And got chased by a man with a gun. This is safer." "It's a children's party, Moll." "Exactly. No guns. No lasers. No tigers. Possibly a piñata, but I'm hoping for a blindfold situation." Bill considered this. "Fair point. What's the venue?" "Henderson residence. Three-story Victorian. Bouncy castle in the backyard. Clown already booked." "A real clown?" "Balloon-twisting clown. Distraction one. You show up as the clown's assistant." "Again with the clown, Moll." "You hate clowns, Bill. It's a perfect cover. You'll be vibrating with rage, and the parents will think it's method acting." Bill sighed deeply. "And you?" "I am the Fairy Godmother." "From Cinderella." "From the off-brand rental company on Fourth Street. Thirty-eight dollars an hour, includes wand and a broken tiara." Molly slid a photograph across the table. A mountain of wrapped boxes towered beneath a banner reading HAPPY 7TH JORDAN! "Presents," she whispered reverently. "Estimated value: twelve hundred dollars in gift cards alone." "Twelve hundred—" "And a bicycle. A Razor scooter. A drum set." "Drum sets are cursed, Moll. You know this. My uncle lost a toe to a drum set." "That's why you're taking it. You throw the drum set in the bouncy castle. Chaos. Parents distracted. Children screaming. I grab the presents, swap them for a decoy bag filled with wrapping paper and a single sock." Bill nodded slowly. "And the getaway?" "Jeremy." "Jeremy's eight, Moll." "Exactly. Too young to testify. I told him he was helping the Fairy Godmother, and he'd get to keep the gift bags. He cried with joy, Bill. It was beautiful." "Who is Jeremy?" "The birthday boy's best friend. I met him at the park. I told him I was a researcher studying the emotional weight of presents. He believes everything. Eight is gullible." Bill raised his pizza slice like a champagne glass. "And if it goes wrong?" "Plan B: you set the bouncy castle on fire." "With what?" "Rage, Bill. Pure clown rage." Molly leaned back, eyes glittering. "Saturday, we retire. We take the loot, buy a condo in Arizona, and we never, ever speak of the clown incident again." "To Jordan," Bill toasted. "To Jordan," Molly agreed. "And his very, very confused parents."
created_at
"2026-06-01T12:20:30.758Z"
6/1/2026, 12:20:30 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1736
moderation_latency
(null)
generation_time
101192
tokens_prompt
77
tokens_completion
4005
native_tokens_prompt
236
native_tokens_completion
3582
native_tokens_completion_images
(null)
native_tokens_reasoning
3308
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00431592
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cac0eb2c6ea81170eeb32f5f0852d"
is_byok
false
latency
1736
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316430-VeC47cLnJ2iL0U2WE6KR"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316430-GVrbx8vOc2Q1aJx0vCSf"
upstream_id
"066cac0eb2c6ea81170eeb32f5f0852d"
total_cost
0.00431592
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
416
0.16%
Dialogue to Total Word Ratio
Ratio: 81.32%, Deviation: 11.32%
neededClean
false
wordsTotal
423
wordsDialogue
344
0.0821%