NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFMD7TCRWDST4PQVE1RFTS
Run Details
Model:
Minimax M2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 56s
Cost:
$0.018248
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
95.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain drummed against the cracked window of Molly's cramped hideout, a converted storage closet that smelled of stale popcorn and ambition. Bill lounged on a busted armchair, his feet propped on a folding table covered in neon sticky notes shaped like cartoon rockets. In front of them, a hand-drawn map of the Maplewood Community Center was pinned to the wall, its crayon blue walls and glittery streamers a sharp contrast to the thieves' grimy scheme. The annual Sunshine Squad birthday bash was slated for Saturday afternoon, a sugar-fueled extravaganza where dozens of kids would converge for games, cake, and a mountain of wrapped gifts. Molly traced a finger along the perimeter, whispering the details she had gathered from a sneaky glance at the parent's flyer. She had scouted the location the day before, noting the positions of the bounce house, the face‑painting station, and the looming inflatable castle that blocked the back entrance. Bill, ever the optimist, claimed that a disguised clown costume would grant them unrestricted access, but Molly rolled her eyes, pointing out that the party’s schedule was tighter than a jar of pickles. Molly: "We can't just stroll in wearing a clown nose. The parents will be scanning the guest list like hungry hawks." Bill: "That's why I brought these invitations. Look, they have the kid's name, a cartoon dinosaur, and an RSVP stamp." Molly: "Nice, but problem is gift heap. It's stacked near the stage, next to the DJ booth. DJ will blast 'Happy Birthday' on repeat." Bill: "We time it with cake cutting. When lights dim for candles, we slip behind curtain, swipe the bags, and vanish into night right now." Molly: "And if a kid spots us? We'll claim we're the magical present elves, here to deliver the gifts to the 'special vault' in the back." Bill: "Perfect. We'll wear those little elf hats. Kids love elves, they'll think it's a surprise." Molly: "Just remember, no one gets hurt. We steal presents, not birthdays." Bill: "Agreed. Now let's test the zip‑line we borrowed from the elementary school gym." Molly nodded, slipping the invitations into a hidden pocket of her jacket. Bill pulled out a battered toolbox and began calibrating the zip‑line, whispering the numbers like a prayer. The rain had stopped, and the streetlights flickered, casting long shadows across the cracked pavement. They rehearsed their moves in slow motion, counting steps, timing the music cue, and testing the weight of a bulging gift bag. As the hour approached, they slipped out of the hideout, the night air cool against their skin. The community center loomed ahead, its colorful banners fluttering like promise. With a final glance at each other, they merged into the crowd of parents and balloons, ready to turn a child's celebration into their own personal jackpot. Bill whispered a countdown while Molly adjusted the tiny earpiece in her hair, a relic from a past heist that still crackled with static. The street was silent except for distant car hums and occasional laughter from nearby houses in the night.
created_at
"2026-02-17T09:44:45.222476+00:00"
2/17/2026, 9:44:45 AM
model
"minimax/minimax-m2.5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2648
moderation_latency
(null)
generation_time
175873
tokens_prompt
77
tokens_completion
16604
native_tokens_prompt
98
native_tokens_completion
15182
native_tokens_completion_images
(null)
native_tokens_reasoning
15825
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0182478
router
(null)
provider_responses
0
id
"54837407de294811b8dd8fcce84be53d"
status
200
is_byok
false
latency
2648
endpoint_id
"acf82778-f331-40d9-ad3f-8c57494d4c4b"
provider_name
"AtlasCloud"
model_permaslug
"minimax/minimax-m2.5-20260211"
api_type
"completions"
id
"gen-1771321308-UYrBKVVejhuzRf6POc8f"
upstream_id
"54837407de294811b8dd8fcce84be53d"
total_cost
0.0182478
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
510
100.00%
Dialogue to Total Word Ratio
Ratio: 30.00%, Deviation: 0.00%
neededClean
false
wordsTotal
510
wordsDialogue
153
95.2419%