NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF1ZKCQ1FJTSQ3DR24BYVD
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
40.0s
Cost:
$0.000924
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
15.67%
Dialogue to Total Word Ratio
(details)
57.7056%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly checked the address on her phone again and grinned. "This is it. Little Timmy's sixth birthday party. Three hundred dollars in gift cards, a brand new tablet, and fifteen envelopes from relatives who don't know what kids actually want." Bill adjusted his fake mustache, which kept sliding toward his left ear. "I still think we should go as clowns. Everyone trusts a clown." "Nobody trusts a clown, Bill. That's literally the plot of every horror movie." "Exactly! They don't expect the clown to rob them. It's perfect misdirection." Molly pulled out a crumpled diagram she'd drawn on a napkin. "Listen, I've been studying the layout. The presents go in the living room. Parents hover around the cake in the kitchen. The kids are in the backyard doing some kind of... what is that, piƱata time?" Bill nodded seriously. "Crucial intelligence." "The window by the powder room is unlocked. I checked the Zillow photos. We slip in, grab the loot, slip out. Fifteen minutes, tops." "What about the clown nose?" Bill asked, holding up a bright red nose. "I feel like it completes the look." Molly pinched the bridge of her nose. "We are not going as clowns. We're going as... concerned relatives. You can be a weird uncle. I'll be the aunt who smells like peppermint and asks inappropriate questions." "I don't know how to ask inappropriate questions." "Just ask why they don't have a real job yet. Ask about their ex. Works every time." Bill scribbled notes on his arm. "Weapons?" "Charm and desperation. And this." Molly held up a small device. "Noise maker. If anyone asks, it's a whoopee cushion. Classic icebreaker." "I thought we weren't doing the clown thing." "We're not. Whoopee cushions are universal. Everyone laughs. Then we laugh. Then we leave with everything." A car pulled into the driveway. A woman emerged with two toddlers and a massive wrapped box. "That's our cue," Molly whispered. "Act normal." Bill straightened his tie, which was visibly inside out. "Do I look like a guy with a mortgage?" "You look like a guy who steals children's presents, which, to be fair, you are." They walked toward the house with the confidence of people who absolutely belonged there. The woman smiled at them. "Oh, you must be Timmy's Uncle What's-His-Name from Minnesota!" "That's me," Bill said, shaking her hand too enthusiastically. "I'm great-uncle something. From Minnesota." Molly patted the woman's shoulder. "And I'm his wife. We drove eighteen hours for this." "Eighteen hours," Bill repeated. "The car smells like beef jerky now." The woman laughed and ushered them inside, completely unaware that she had just invited professional thieves into her home to case the joint. Or at least, she would be unaware until about forty-five minutes later when she discovered that every present was gone and the only thing left was a business card that read "Molly & Bill: If You Can Read This, You Should Have Bought a Safe."
created_at
"2026-02-17T09:32:24.838858+00:00"
2/17/2026, 9:32:24 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1989
moderation_latency
(null)
generation_time
39944
tokens_prompt
77
tokens_completion
893
native_tokens_prompt
101
native_tokens_completion
745
native_tokens_completion_images
(null)
native_tokens_reasoning
106
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0009243
router
(null)
provider_responses
0
id
"05e3688024a2e568189fa7b8436c1fd0"
status
200
is_byok
false
latency
1989
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320704-Q8l30Ag0YYZjTJ9J5Klg"
upstream_id
"05e3688024a2e568189fa7b8436c1fd0"
total_cost
0.0009243
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
15.67%
Dialogue to Total Word Ratio
Ratio: 63.91%, Deviation: 6.09%
neededClean
false
wordsTotal
496
wordsDialogue
317
57.7056%