NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HVYQYZVKYHHMNNPD824DX
Run Details
Model:
MiniMax M3
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
4m 30s
Cost:
$0.016008
Provider:
Minimax
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her cat-eye mask and tapped the stolen blueprints with a manicured nail. The Henderson kid's seventh birthday party would be a circus of sugar-hyped witnesses, Hawaiian-shirted fathers, and one deeply concerning golden retriever named Biscuit, who had once bitten a FedEx driver on the shin. Bill studied the schematic. "What's the plan, Moll?" "Cake entrance. Balloon lady." "And me?" "Sad clown. Parents ignore clowns." Bill nodded slowly. Their disguises would get them inside, no question. The real challenge was the sheer volume of the haul. Every wrapped present on a sixty-foot table, the piñata, goody bags everywhere. Molly had run the numbers twice. Seventeen thousand in gift cards alone, conservatively, plus a Hot Wheels collection worth killing for. "Exit?" "Window. Fireman's slide. Fast." Biscuit growled in the distance. A child shrieked with joy. The piñata quivered ominously above a circle of sticky-fingered predators. Molly checked her watch. The party started in ninety minutes. Plenty of time, she figured, to pull off the biggest heist of their careers. Time enough to perfect her balloon animals. Time enough to rehearse the saddest clown face. Time enough to acquire a squeaky decoy for that horrible, horrible dog. The heist was on.
created_at
"2026-06-01T12:17:20.642Z"
6/1/2026, 12:17:20 PM
model
"minimax/minimax-m3-20260531"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1662
moderation_latency
(null)
generation_time
269520
tokens_prompt
77
tokens_completion
11256
native_tokens_prompt
236
native_tokens_completion
13325
native_tokens_completion_images
(null)
native_tokens_reasoning
10918
native_tokens_cached
222
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.01600752
router
(null)
provider_responses
0
endpoint_id
"6851b352-a7e2-45ad-a69e-67ed1208a416"
id
"066cab5028cb8c78c46f4613d50dd371"
is_byok
false
latency
1659
model_permaslug
"minimax/minimax-m3-20260531"
provider_name
"Minimax"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316240-i5ASr7Mv2ku7xLB6a8IS"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316240-XhysrTMINmF7CX2dwU0n"
upstream_id
"066cab5028cb8c78c46f4613d50dd371"
total_cost
0.01600752
cache_discount
0.00010656
upstream_inference_cost
0
provider_name
"Minimax"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 9.76%, Deviation: 0.24%
neededClean
false
wordsTotal
205
wordsDialogue
20
100.0000%