NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFRY9RYVMVNEBJG6QWBD4Q
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3m 12s
Cost:
$0.010683
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the bouncy castle, whispering. The party was in full swing, the music thumped outside now. Molly: I still think the clown is a red herring. Bill: The clown's got a key to the gift vault, I'm sure. Molly: We slip in with the piñata crew, grab the loot when the kids go for cake. Bill: What about the parents? They’ll be watching like hawks. Molly: We’ll distract them with a karaoke machine. Bill: Perfect, I’ll hijack the playlist and blast a solo. Molly: While they’re cringing, we swipe the presents. Bill: And the hidden safe behind the balloon arch? Molly: I’ve got the laser cutter. Bill: Okay, meet at the bouncy castle at three. Molly: And remember, no confetti. Bill: Agreed, confetti is a crime scene. Molly: I'm bringing a collapsible ladder, the back fence is six feet. Bill: And I’ve got a dog to sniff out the hidden treats. Molly: When the cake is cut, we’ll slip the presents into the goody bags. Bill: If anyone asks, we’re the entertainment, here for the magic show. Molly: And if the clown tries to intervene, we’ll use the glitter bomb. Bill: Glitter’s the perfect alibi—it’s everywhere.
created_at
"2026-02-17T09:47:29.378185+00:00"
2/17/2026, 9:47:29 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6094
moderation_latency
(null)
generation_time
191945
tokens_prompt
77
tokens_completion
7667
native_tokens_prompt
101
native_tokens_completion
8877
native_tokens_completion_images
(null)
native_tokens_reasoning
8586
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0106827
router
(null)
provider_responses
0
id
"05e36b701ab6e9778df2d153b9070636"
status
200
is_byok
false
latency
6094
endpoint_id
"3afb00c4-6135-4c55-96e9-6befa9679b06"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321456-avj5R8V7ObeMSuo3ue3T"
upstream_id
"05e36b701ab6e9778df2d153b9070636"
total_cost
0.0106827
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
201
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 90.00%
neededClean
false
wordsTotal
201
wordsDialogue
0
49.9995%