NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFVAT8BH8RTHMYGZP6XZM7
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 49s
Cost:
$0.015221
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
87.99%
Dialogue to Total Word Ratio
(details)
93.9927%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the oversized clown nose, eyeballing the bouncy castle that had been transformed into a glittering present stack. The neighborhood's annual kids' carnival was in full swing, with streamers cutting the sky and a troupe of magician assistants juggling glitter bombs. In the background, a piped in soundtrack of cartoon theme songs threatened to drown out any suspicious rustle. She whispered to Bill, who was pretending to be a parent chaperone with a clipboard and a suspicious amount of glitter on his shirt. Molly: "First, we need to figure out where the present table is hidden. The kids are currently occupied with a puppet show, so we have maybe fifteen minutes." Bill: "I've already scouted. It's behind the giant inflatable giraffe, next to the cotton candy stand. I also spotted a security guard who looks like he's more interested in his phone than the candy." Molly: "Perfect. We'll pose as the entertainment. I'll be the clown who can inflate balloons into shapes, and you can be the magician with the hat." Bill: "I brought a bag of 'trick' confetti that actually emits a harmless smoke screen. We'll need a signal to coordinate the switch." Molly: "How about we use the phrase 'pie for the birthday boy' as our cue. When the DJ announces it, we make our move." Bill: "Deal. And if any kid asks for a balloon animal, just hand them a regular balloon and walk away." Molly: "Exactly. Let's practice our innocent laugh. We want to sound like parents, not criminals." Bill: "I can do a giggle that sounds like a squeaky toy. Ready?" Molly: "Ready. And remember, no matter how cute the presents look, we stick to the plan. No sticky fingers." Bill: "No sticky fingers. Except maybe a sample chocolate." Bill: "After we grab the loot, we slip out through the exit behind the cotton candy stand. And clean." Molly: "And if anyone asks why we're carrying a sack of gifts, we say it's for a drive. Easy peasy." The plan was absurd enough to sound like a cartoon caper, but Molly had a habit of turning childish chaos into cash. She imagined the children’s faces when the presents vanished, their eyes wide as saucers, while she and Bill vanished into the crowd with sacks full of toys. Bill Practiced his innocent smile in the reflection of a soda can, adjusting his fake mustache so it wouldn't slip. The DJ’s voice crackled over the loudspeaker, announcing the birthday boy’s name and promising a cake that was taller than most toddlers. The air smelled of sugar, popcorn, and cheap perfume, a mixture that made Bill’s stomach rumble, but his mind stayed focused on the prize. With the clock ticking, they shuffled toward the gift table, the balloons bobbing like eager spectators. They exchanged a quick nod, the kind that only partners who've stolen candy from a kindergarten can understand, and stepped toward the glittering table, ready to execute the most whimsical heist of the season.
created_at
"2026-02-17T09:47:25.151956+00:00"
2/17/2026, 9:47:25 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1460
moderation_latency
(null)
generation_time
109116
tokens_prompt
77
tokens_completion
13228
native_tokens_prompt
98
native_tokens_completion
12660
native_tokens_completion_images
(null)
native_tokens_reasoning
12033
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0152214
router
(null)
provider_responses
0
id
"019c6afdaf6a80701e0552d6894a28c2"
status
200
is_byok
false
latency
1460
endpoint_id
"e89b11ef-ca0e-4cc1-9333-69343ec25819"
provider_name
"SiliconFlow"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321535-M1jlNJDpiAYKikAjpUFn"
upstream_id
"019c6afdaf6a80701e0552d6894a28c2"
total_cost
0.0152214
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
87.99%
Dialogue to Total Word Ratio
Ratio: 51.60%, Deviation: 1.60%
neededClean
false
wordsTotal
500
wordsDialogue
258
93.9927%