NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF7RVW4TNCTD3EYTHEXC59
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
3m 49s
Cost:
$0.050619
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.99%
Dialogue to Total Word Ratio
(details)
99.9950%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leans against the rust‑spotted van, tapping a crumpled flyer. "Okay, we sneak in through the bounce house vent." The bounce house rattles in the wind, quietly hinting a secret entry. "You realize that vent is shaped like a giraffe, right?" Bill snorts, eyeing the giraffe‑shaped vent. "That's why I brought the laser cutter, stealth like a giraffe doorway." Molly brandishes a sleek laser cutter, its red light humming. "We won't cut cake, we'll swap it for dummy and pocket prize." Bill chuckles, imagining a fake cake that bursts with confetti. "What about clown? He has a screaming horn when we get close." Molly points at the noisy entertainer, planning to mute him. "I'll jam a whoopee cushion, his blow won't be the only pop." Bill laughs, picturing the clown's horn merely wheeze. "Ten piñatas, six goody bags, and a giant unicorn need a strategy." Molly sketches a quick plan, marking piñatas and a giant unicorn. "We blend as the magic show, you handle bunny, I grab treasure." Bill nods, adjusting his fake magician hat while Molly checks the timer. "Fine, if we get caught, we blame the hamster." They grin, engine rumbling, as the van rolls toward kids' party.
created_at
"2026-02-17T09:38:44.324807+00:00"
2/17/2026, 9:38:44 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1046
moderation_latency
(null)
generation_time
229308
tokens_prompt
77
tokens_completion
20854
native_tokens_prompt
101
native_tokens_completion
21066
native_tokens_completion_images
(null)
native_tokens_reasoning
20786
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.050619
router
(null)
provider_responses
0
id
"05e3693e2f2af799cdf17dc30f267dd2"
status
200
is_byok
false
latency
1046
endpoint_id
"489502e5-7555-4113-a3f3-18b8f71e8bb2"
provider_name
"Minimax"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320894-gyghl6TKSQjQo3YIuITP"
upstream_id
"05e3693e2f2af799cdf17dc30f267dd2"
total_cost
0.050619
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Minimax"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
99.99%
Dialogue to Total Word Ratio
Ratio: 51.00%, Deviation: 1.00%
neededClean
false
wordsTotal
200
wordsDialogue
102
99.9950%