NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEVYPGCPM9WBAB35P53NGG
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2m 42s
Cost:
$0.011196
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The hall blazed with streamers, balloons, and a cake shaped like a superhero. Children shrieked and chased each other around a bouncy castle, while parents huddled near the drink station. A tower of wrapped gifts sat unguarded beside the DJ booth, tempting anyone daring enough to make a move. Two shadows, Molly and Bill, lingered near the snack table, pretending to admire the clown's juggling act. They exchanged a nervous glance, then edged toward the prize table, careful not to draw attention. The music thumped, the lights flickered, and the chaos provided the perfect cover for a daring theft today. "Molly: We’ll infiltrate during the cake ceremony, grab the pile, and dash out the back door." "Bill: I’ll set off the glitter bomb right as the singer hits the high note, creating a diversion." "Molly: While everyone’s eyes are on the explosion, I’ll swipe the gifts and you cover our exit." "Bill: Got it. Remember, we need to keep the noise level just right so the parents don’t panic." "Molly: I’ve already timed the fuse to three seconds. We’ll be in and out before the clown can even laugh." "Bill: Perfect. Let’s move before the kids start the next game."
created_at
"2026-02-17T09:31:09.949166+00:00"
2/17/2026, 9:31:09 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5431
moderation_latency
(null)
generation_time
162016
tokens_prompt
77
tokens_completion
8757
native_tokens_prompt
101
native_tokens_completion
9305
native_tokens_completion_images
(null)
native_tokens_reasoning
9046
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0111963
router
(null)
provider_responses
0
id
"05e367bbdc8a4a9142041f644975262d"
status
200
is_byok
false
latency
5431
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320507-MH2Hx0VGSZs7N1iS3jca"
upstream_id
"05e367bbdc8a4a9142041f644975262d"
total_cost
0.0111963
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
200
wordsDialogue
100
100.0000%