NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GTYHERZWVRDB3A0RXR8G8
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
9m 8s
Cost:
$0.031153
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
70.03%
Dialogue to Total Word Ratio
(details)
85.0144%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the inflatable bouncy castle with the focused intensity of a woman who had trained for this moment her entire adult life, give or take a few weekends spent watching paint dry. The backyard sprawled before her like a battlefield decorated by someone with a serious sugar addiction. Streamers hung from trees. A bouncy castle deflated slowly in the corner like a sad metaphor for childhood itself. Children ran in circles, screaming with the joy only achievable through cake and a complete lack of adult supervision. This was the wild wilderness of suburbia, and Molly was its most glamorous predator. Bill slid in next to her, a fistful of jelly beans in one hand and a heat map of the party drawn in crayon on the other. He looked like a man who had been hit by optimism and was still recovering. His socks did not match. This was not unusual. Bill had marked the cake table with a small explosion of yellow. He had shaded the gift pile in aggressive red. He had even drawn the piñata wearing a tiny frown. "Talk to me," Molly whispered. "What are we dealing with?" "Twenty-seven kids. Fourteen adults. Two clowns. One traumatized piñata." "Elmo stays. I like him." She squinted at the layout. "Parents?" "Hovering around the grill like seagulls at a chip shop. Dangerous, but predictable." "Distractions. What's in the bag?" "Three whoopee cushions, a kazoo, a tube of vanishing cream, and a rubber chicken I found in a ditch." "The chicken stays too. We're building a family, Bill." He did not appear moved by this. Molly consulted the crayon map, tapping the present table with one finger. The gifts sat in a pile like sleeping toddlers—vulnerable, unaware, not to be touched. But Molly was going to touch them. That was the job. That was the dream. "You'll be the clown," she said. "I don't look like a clown." "You will. There's a wig in the car. Red nose. Oversized shoes. The full package." "Why me?" "Because you cry beautifully under pressure. And I've got an allergy to face paint." "That's a lie." "What do I say to the kids?" Bill asked. "Tell them you're Bongo the Magnificent. They won't question it. Children believe anything." "Clowns speak in nonsense, Bill. Honk. Cry. Juggle the chicken. We rendezvous at the bouncy castle in seven minutes. We leave covered in glitter and shame. Got it?" "Possums," Bill muttered. "Possums," Molly agreed, and that was the whole plan. He waddled off toward the car, shoulders squared with the dignity of a man about to commit career-ending foolishness. Molly watched him go, then turned back. A child had just shoved an entire cupcake into the piñata's eye socket, and the piñata had not made a sound. Even Elmo knew when to keep his mouth shut. The afternoon smelled like sugar and minor crime. Somewhere in the trunk of an unmarked car, a wig waited. Somewhere in Bill's heart, the chicken was watching.
created_at
"2026-06-01T11:59:19.09Z"
6/1/2026, 11:59:19 AM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
904
moderation_latency
(null)
generation_time
547951
tokens_prompt
77
tokens_completion
24801
native_tokens_prompt
236
native_tokens_completion
25925
native_tokens_completion_images
(null)
native_tokens_reasoning
24040
native_tokens_cached
114
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.03115344
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca7173b90f70a3982d2e9d3d33845"
is_byok
false
latency
904
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315159-45GCdwB80vLdJofemT0S"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315159-8ZqDJ38YYh8MteA1cXYl"
upstream_id
"066ca7173b90f70a3982d2e9d3d33845"
total_cost
0.03115344
cache_discount
0.00005472
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
70.03%
Dialogue to Total Word Ratio
Ratio: 32.67%, Deviation: 2.67%
neededClean
false
wordsTotal
502
wordsDialogue
164
85.0144%