NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GZVWSEA535EW58VHMCFG6
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
21.1s
Cost:
$0.000470
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
14.30%
Matches word count
(details)
99.94%
Dialogue to Total Word Ratio
(details)
57.1215%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized sunglasses and studied the leaflet with the intensity of a surgeon examining an x-ray. It was a perfect target: sugar-fueled chaos, distracted parents, and a mountain of wrapped loot just waiting to be liberated from the tyranny of grateful children. "Children's parties are low-hanging fruit," Bill muttered, stroking his terrible mustache thoughtfully. "Literally, with all those juice boxes around." Molly nodded, her mind already racing through logistics. The venue was a community hall with exactly one security camera and a fire exit that groaned louder than her Uncle Gerald after Thanksgiving dinner. She estimated there would be roughly forty kids, twelve adults, one magician with shaky hands, and a piƱata containing enough confetti to cause a small avalanche. "Honk if you love crime," Bill said. "Honk," Molly replied without hesitation. The plan was airtight, or at least air-adjacent. Bill would disguise himself as a clown, because nothing lowers suspicion like face paint and shoes that honk. Molly would pose as a health inspector, clipboard in hand, authority in voice. They would move fast, grab the gift table, and vanish into the night like the ghosts of Christmas past, except meaner and with better shoes. By the time the magician pulled the rabbit from his hat, they'd be long gone, already arguing about who got the Easy-Bake Oven.
created_at
"2026-06-01T12:02:00.222Z"
6/1/2026, 12:02:00 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1284
moderation_latency
(null)
generation_time
21093
tokens_prompt
77
tokens_completion
468
native_tokens_prompt
236
native_tokens_completion
377
native_tokens_completion_images
(null)
native_tokens_reasoning
110
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00046992
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066ca7b8aa678ee8e95c911106a592f9"
is_byok
false
latency
1284
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315320-QDqd3FQWSEd3p1UlZica"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315320-KDAQXReiWGLFO6hPTjpx"
upstream_id
"066ca7b8aa678ee8e95c911106a592f9"
total_cost
0.00046992
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
14.30%
Matches word count
n/a
neededClean
false
words
221
99.94%
Dialogue to Total Word Ratio
Ratio: 8.44%, Deviation: 1.56%
neededClean
false
wordsTotal
225
wordsDialogue
19
57.1215%