NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEWZ3EAAKP78X47KHB2GZB
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 18s
Cost:
$0.011198
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The moon hung low over the suburban cul‑de‑sac, casting a milky glow on the inflated bouncy castle. Molly wiped a smear of lipstick from her cheek and consulted the crumpled blueprint Bill had drawn on the back of a pizza receipt. The plan was absurdly simple: infiltrate the annual birthday blowout for little Timmy, swipe every shiny wrapped box, and vanish before the piñata even cracked. Bill adjusted his fake moustache and whispered his final adjustments, already picturing the children’s bewildered faces. The kitchen smelled of frosting and cheap pop, and a dozen parents hovered near the snack table, oblivious to the two shadows circling the loot. Molly crouched behind a balloon arch, her heart hammering like a tiny drum. Bill signaled with a nod, and they slipped through the back door, their shoes squeaking on the linoleum. The plan hinged on speed, coordination, and a believable alibi: a clown‑themed entertainer and his unsuspecting assistant. They rehearsed their roles with the seriousness of veteran actors, whispering jokes about sugar crashes and the inevitable tantrum when the cake disappears in silence. "We pose as clowns, steal gifts, exit before the cake." "I'll juggle, distract parents, you grab the loot quickly now."
created_at
"2026-02-17T09:30:18.862077+00:00"
2/17/2026, 9:30:18 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1420
moderation_latency
(null)
generation_time
77775
tokens_prompt
77
tokens_completion
10195
native_tokens_prompt
98
native_tokens_completion
9307
native_tokens_completion_images
(null)
native_tokens_reasoning
9052
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0111978
router
(null)
provider_responses
0
id
"019c6aee806ee3165a3f91dae3e5cb9c"
status
200
is_byok
false
latency
1420
endpoint_id
"e89b11ef-ca0e-4cc1-9333-69343ec25819"
provider_name
"SiliconFlow"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320540-WSLVHAsEAEWpHG6Ocbfa"
upstream_id
"019c6aee806ee3165a3f91dae3e5cb9c"
total_cost
0.0111978
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 10.00%, Deviation: 0.00%
neededClean
false
wordsTotal
200
wordsDialogue
20
100.0000%