NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEVYHP3RN0E9PTV73EFQ2V
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7m 48s
Cost:
$0.034293
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
50.0002%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
A dim moon hung low over the cul‑de‑sac, its pale light spilling onto the pastel façade of the Hendersons’ house. Banners fluttered in the night breeze, announcing a child’s sixth birthday, and the street was lined with balloons that bobbed like nervous spies. Inside, a mountain of brightly wrapped presents sat beneath a frosting‑topped cake, and the hum of parents’ chatter drifted through the cracked open window. Molly crouched behind the trash cans, her eyes flicking to the back gate, while Bill adjusted his mask and whispered, “Ready when you are.” Bill checked his watch. Molly hissed, 'First, we need to disable the garden sprinkler timer. It's set to spray at nine, and the water will soak our rope.' Bill replied, 'Got it. I'll clip the timer with the USB‑C cable we stole from the office. That'll freeze the spray and keep the rope dry.' The night was cool, and a distant car horn sounded. Molly whispered, 'Next, we masquerade as the balloon clown. I’ve got a fake red nose and a pair of oversized shoes from the costume box in the attic.' Bill murmured, 'Great. The clown gets the kids to gather around the piñata, while we slip the gift bags into the van parked around the corner.' A dog barked, but it was only the neighbor’s poodle. Molly giggled, 'When the piñata breaks, the candy will explode outward. We’ll catch the loot in a net I nicked from the gymnastics class.' Bill nodded, already visualizing the net unfurling. Bill said, 'After the net, we’ll grab the biggest box. It's labeled "For Emma" but I bet it's full of video games and a drone.' Bill frowned. Molly whispered, 'We'll need a distraction, so I'll set off the smoke detector in the kitchen with a incense stick from the birthday party loot.' Bill muttered, 'We'll disable the alarm by shorting the hallway panel. I brought a cheap multimeter and a roll of duct tape.' Molly said, 'Once the lights go out, we slip through the back door. I’ll have the rope ladder ready, and you’ll climb the fence to the van.' Bill whispered, 'If anything goes wrong, we’ll improvise. I’ve got a bag of fake moustaches and a pack of glitter bombs for a quick escape.' Molly rolled her eyes. Molly hissed, 'We should also grab the glitter cannon from the party favor table, because nothing says subtle getaway like a burst of rainbow glitter in the security guard's eyes, and we can use it to blind the cameras later.' Bill laughed, 'If the glitter fails, I’ve hidden a tiny EMP device in the birthday card, which will fry the DJ’s speaker system and give us a five‑second window to slip the loot into the van unnoticed for us today.' Molly whispered, 'Let’s also snag the extra juice boxes, because nothing fuels a high‑speed chase like a sugar rush, and we can use the boxes to power our portable lights while we haul the presents out in the van quickly.'
created_at
"2026-02-17T09:36:15.884258+00:00"
2/17/2026, 9:36:15 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4225
moderation_latency
(null)
generation_time
468248
tokens_prompt
77
tokens_completion
28648
native_tokens_prompt
101
native_tokens_completion
28552
native_tokens_completion_images
(null)
native_tokens_reasoning
27914
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0342927
router
(null)
provider_responses
0
id
"05e367ba1ae7a93695920032906e50ab"
status
200
is_byok
false
latency
4225
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320506-e7JXlCYxTFM6lX5CZWVJ"
upstream_id
"05e367ba1ae7a93695920032906e50ab"
total_cost
0.0342927
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
0.00%
Dialogue to Total Word Ratio
Ratio: 54.20%, Deviation: 15.80%
neededClean
false
wordsTotal
500
wordsDialogue
271
50.0002%