NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFBEA1H8D8ND3CTJKC0MBA
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 50s
Cost:
$0.016440
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
80.02%
Dialogue to Total Word Ratio
(details)
90.0004%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
A dim streetlamp flickered over the suburban cul-de-sac, casting long shadows across the pastel decorations of the Hendersons' backyard. The inflatable castle sagged in the breeze, and the smell of birthday cake drifted from the open kitchen window. Molly: "Bill, here's the plan. The party starts at three, and the parents will be busy with the clown and the magician until five. We slip in through the back gate, hide behind the bouncy castle, and wait for the kids to tear into the presents." Bill: "The back gate is padlocked, but I've got a pair of bolt cutters in my bag. Also, the neighbor's dog barks at strangers, so we need to bring a treat to keep it quiet. We'll move fast once the cake is cut and run quickly." Molly: "We can use the piñata as a distraction. When the kids start hitting it, the parents will gather around, giving us a clear path to the present table. I'll wear the clown wig and hand out extra candy to keep them focused and laugh loudly." Bill: "Once the cake is sliced, the presents will be stacked near the stage. I'll grab the heavy ones while you sweep the small gifts into a duffel bag. We'll need to be out of sight before the birthday song ends and dash to the van." Molly: "The side alley leads to the parking lot where our van is hidden behind the trash bins. We'll slip through the fence, toss the bags in the back, and drive off before anyone realizes the presents are missing and speed away without turning back quickly." Bill: "If any child spots us, we'll claim we're the entertainment's assistants. I'll flash a fake ID and you will hand out stickers. The ruse should buy us enough time to load the last of the loot and then disappear before the parents start asking questions." Molly: "After we leave, we'll meet at the abandoned warehouse on Maple Street. We'll sort the toys, keep the ones that fetch a high price, and donate the rest to a charity box to cover our tracks and pretend we were just generous donors for now." Bill: "Check your gadgets: the bolt cutters, the candy stash, the fake moustache, and the van keys. We'll synchronize our watches to the second the clown finishes his act, then move in sync like a well‑oiled machine and make a clean getaway without leaving clues behind." Molly slipped the bolt cutters into her belt, while Bill adjusted the moustache in the mirror of a parked car. They glanced at the house, noting balloons bobbing in the wind and laughter spilling onto the lawn. The plan was set; timing precise. As the clown's final trick ended, they moved like shadows, slipping through the gate and blending with the party's chaos. The streetlight flickered, casting a glow over the back fence as they vanished into the night, the weight of the loot still humming in their pockets. They plan the next heist.
created_at
"2026-02-17T09:40:44.460491+00:00"
2/17/2026, 9:40:44 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1852
moderation_latency
(null)
generation_time
229506
tokens_prompt
77
tokens_completion
14278
native_tokens_prompt
101
native_tokens_completion
13675
native_tokens_completion_images
(null)
native_tokens_reasoning
13067
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0164403
router
(null)
provider_responses
0
id
"05e369b65a168978dee77d3ae96cdf25"
status
200
is_byok
false
latency
1852
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321014-P1nHGWoiboYrgPs9Lusd"
upstream_id
"05e369b65a168978dee77d3ae96cdf25"
total_cost
0.0164403
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
80.02%
Dialogue to Total Word Ratio
Ratio: 72.11%, Deviation: 2.11%
neededClean
false
wordsTotal
502
wordsDialogue
362
90.0004%