NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNETNP73E55SMKSK0RRVXTM
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 18s
Cost:
$0.020271
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
98.98%
Dialogue to Total Word Ratio
(details)
96.3169%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement smelled of stale pizza and ambition, the air thick with the faint whiff of cheap cologne and the distant echo of a birthday march. A folding table held a hand‑drawn map of the community center, its walls papered with crayon suns, while a stack of invites lay scattered like fallen confetti. Molly perched on a wobbly stool, twirling a plastic spoon, and Bill squinted at a faded flyer, his glasses smudged with ketchup. She whispered, “We need a distraction that makes the kids scream with joy, not terror.” Bill tapped the flyer, pointing at a cartoon clown and a magician, and muttered, “Both are overbooked, but we could steal an act.” Molly grinned, “Let’s pose as the catering crew, roll a cart past the guard, and lift the presents while the crowd sings ‘Happy Birthday.’” Bill rubbed his chin, “What about the security guard? He’s got a clipboard and a badge that could double as a weapon.” Molly waved a hand, “We’ll slip him a fake VIP invite. He’ll wave us through, and we’ll be inside before the first balloon pops.” Bill chuckled, “And the loot? The presents are stacked in a corner, each wrapped in glittery paper that could set off an alarm if we’re not careful.” Molly nodded, “We’ll use a glitter bomb. One flash and the room sparkles while we grab the stash. The kids will think it’s part of the show.” Bill grinned, “Alright, we’ve got a catering cover, a glitter distraction, and a fake VIP. We’ll rehearse at dawn and claim the prize.” The next morning, the two thieves met in a cramped kitchen, the smell of fresh coffee mingling with the lingering aroma of glue from a hastily assembled fake mustache. Molly laid out a checklist: a borrowed catering van, a bright red apron with a stitched logo, and a stack of faux VIP invitations printed on cheap cardstock. Bill examined the list, cross‑checking each item against a mental diagram of the community center’s floor plan, noting the location of the security desk, the cake table, and the cluster of wrapped gifts near the stage. He muttered about the probability of a stray balloon popping during the heist, while Molly assured him that the glitter bomb would mask any accidental noise. They practiced their roles in front of a cracked mirror, Bill perfecting a nervous smile that looked more like a wince, and Molly rehearsing a cheerful “Welcome, welcome!” that could charm a mouse. By noon, they had transformed their shabby hideout into a makeshift rehearsal space, complete with a cardboard cutout of a bouncy castle and a wobbly table representing the gift pile. They high‑fived, the sound echoing off the concrete, and set the final time on a battered alarm clock: 3:15 PM, just as the birthday song began. With a final glance at the map, they slipped out, their hearts pounding in time with the imagined beats of a children’s drum, confident that by nightfall they would be the richest pair of petty thieves the town had ever seen.
created_at
"2026-02-17T09:30:04.080232+00:00"
2/17/2026, 9:30:04 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
540
moderation_latency
(null)
generation_time
137494
tokens_prompt
77
tokens_completion
16332
native_tokens_prompt
98
native_tokens_completion
16868
native_tokens_completion_images
(null)
native_tokens_reasoning
16225
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.020271
router
(null)
provider_responses
0
id
"019c6aed5772e23f43e406517ae12f42"
status
200
is_byok
false
latency
540
endpoint_id
"e89b11ef-ca0e-4cc1-9333-69343ec25819"
provider_name
"SiliconFlow"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771320465-qYu4cVoSsOX3RcTBYbOV"
upstream_id
"019c6aed5772e23f43e406517ae12f42"
total_cost
0.020271
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
509
98.98%
Dialogue to Total Word Ratio
Ratio: 30.45%, Deviation: 0.45%
neededClean
false
wordsTotal
509
wordsDialogue
155
96.3169%