NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEVX53VC1556JP5YRXCTBH
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
1m 43s
Cost:
$0.009215
Provider:
Fireworks
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
50.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The street was quiet, but the house blazed with streamers. The cake was a towering castle of frosting, gleaming brightly. Molly: Okay, Bill, we’ve got thirty minutes before the piñata drops, and the presents are stacked like a fortress. Bill: Perfect. I’ll masquerade as the magician’s assistant, fake wand loaded with glitter bombs. Molly: The clown’s balloon animals will be our cover. We inflate a few, hide loot inside, and stroll out like proud parents. Bill: When the cake smash starts, we’ll slip behind the bounce house, grab the gift bags, and vanish before anyone yells surprise. Molly: Parents have smartphones, so we keep faces hidden or we’ll go viral as the ‘thieves at a toddler party.’ Bill: I swapped the juice boxes for medicine‑flavored ones. That’ll keep the kids away from the table for five minutes. Molly: I’ll hide empty gift bags in the sandbox, then we exit through the hedge maze. The only thing missing will be the loot. Bill: If anyone asks, we’re the new clowns hired for the after‑party. I’ve got a rubber chicken. Molly: Let’s move. The first song is ending, and the kids are eyeing the piñata. We don’t want to miss the candy.
created_at
"2026-02-17T09:30:09.12052+00:00"
2/17/2026, 9:30:09 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5945
moderation_latency
(null)
generation_time
102831
tokens_prompt
77
tokens_completion
7072
native_tokens_prompt
97
native_tokens_completion
7669
native_tokens_completion_images
(null)
native_tokens_reasoning
6761
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009214556
router
(null)
provider_responses
0
id
"d1b52d72-13d2-4704-9192-d9ccf287ecd5"
status
200
is_byok
false
latency
5945
endpoint_id
"c61d37ec-745c-4efd-8129-a68371e056b2"
provider_name
"Fireworks"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320505-zr7gm1VcgR10Eio1DjNM"
upstream_id
"d1b52d72-13d2-4704-9192-d9ccf287ecd5"
total_cost
0.009214556
cache_discount
0.000017344
upstream_inference_cost
0
provider_name
"Fireworks"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
0.00%
Dialogue to Total Word Ratio
Ratio: 2.50%, Deviation: 87.50%
neededClean
false
wordsTotal
200
wordsDialogue
5
50.0000%