NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HNBF9BTT76N1QQYV0WKG2
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3m 25s
Cost:
$0.010533
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
98.09%
Dialogue to Total Word Ratio
(details)
99.0460%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints spread across the hood of their getaway car. Bill tapped his finger on the bounce house diagram, sighing. "You're telling me," Bill said, "we dress as a six-foot rabbit, sneak into a five-year-old's party, and steal sixty presents?" "Sixty-three," Molly corrected. "The parents have an unopened pony ride coupon." "Molly, I'm a professional. We don't steal coupons." "Professionals adapt. Also, that coupon's worth eight dollars. Now hand me the face paint." "What about the clown? These events always have clowns." "The clown's getting fired tomorrow. Two Yelp stars. Terrible balloon animals." "So we replace the clown?" "And the rabbit. I need the whole costume. Are you in or out?" "What about the loot? Where do we fence finger paintings?" "Finger paintings? Please, Bill. I checked the registry. They want a bike." "A bike? We're stealing a bike now?" "Two bikes. Twins. Matching helmets included." Bill sighed. "What about security? Birthday parties are basically fortresses." "Terrified mothers. One dad with a camcorder. I'll handle mom. You distract dad with dad jokes." "Absolutely not. My material is sacred." "Knock knock." "Molly, no." "Who's there?" "Honeydew you want me to drive the getaway van?" Bill paused. "Pink whiskers or black, partner?"
created_at
"2026-06-01T12:13:44.305Z"
6/1/2026, 12:13:44 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1404
moderation_latency
(null)
generation_time
204699
tokens_prompt
77
tokens_completion
7876
native_tokens_prompt
236
native_tokens_completion
8763
native_tokens_completion_images
(null)
native_tokens_reasoning
7522
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.01053312
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066caa78f7e8deff909d322fb9e96310"
is_byok
false
latency
1404
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316024-jN6vwBf07v24jaVvfX6H"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316024-fYntwyiYwzbg3tf1Vydt"
upstream_id
"066caa78f7e8deff909d322fb9e96310"
total_cost
0.01053312
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
98.09%
Dialogue to Total Word Ratio
Ratio: 86.27%, Deviation: 3.73%
neededClean
false
wordsTotal
204
wordsDialogue
176
99.0460%