NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HAN3CFZPS96J9QTYC2W5B
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6.6s
Cost:
$0.004646
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
16.53%
Dialogue to Total Word Ratio
(details)
9.2707%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night before the birthday bash, Molly and Bill crouched behind a stack of cardboard boxes in the dimly lit hallway of the community center. The smell of pizza and cheap perfume lingered, and a stray balloon bobbed in the draft. Molly twirled a lockpick like a baton, while Bill adjusted his oversized sunglasses. They reviewed their plan on a crumpled napkin, scribbling doodles of candy and toys. The stakes were high: a mountain of presents, a room full of squealing children, and a security guard who took his job too seriously. Molly: "Alright, Bill, the party starts at 6, and the kids will be glued to the piñata until midnight." Bill: "Exactly, which means we have a ten‑minute window after the cake is cut." Molly: "The front door is guarded by Mr. Jenkins, but his cat is allergic to pepper spray." Bill: "Good, we’ll just sprinkle a little on the hallway and the cat will sprint." Molly: "Then we slip through the side window—see that one with the clown stickers?" Bill: "The one that squeaks when you touch it—perfect for a silent alarm." Molly: "Inside, the presents are stacked in a pyramid of wrapping paper." Bill: "I’ll grab the biggest gift box first, it’s probably the newest gaming console." Molly: "Don’t forget the chocolate cake—my sweet tooth is a liability." Bill: "I’ll take the cake, you take the toys, we split the loot 50‑50." Molly: "What about the piñata? It’s filled with candy, and kids love it." Bill: "We’ll smash it, collect the candy, and toss it out the trash." Molly: "The kids will think it’s a surprise, and the parents will blame the clown." Bill: "Speaking of clowns, the hired clown will be busy juggling the balloons." Molly: "Perfect, he’ll be distracted, and we can swap the balloons for confetti." Bill: "I’ll rig the confetti cannon to go off when the lights dim." Molly: "That will cause a blackout, and the security cameras will reboot." Bill: "Then we’ll slip in, grab the presents, and slip out before anyone notices." Molly: "We need a getaway car—my old van with the squeaky brakes." Bill: "I’ll park it behind the dumpster, and we’ll load the loot in the trunk." Molly: "What about the guard’s badge? He loves his badge more than his coffee." Bill: "We’ll pinch it, and later we’ll sell it to a collector of police memorabilia." Molly: "And the kids’ drawings on the fridge? We’ll take them as souvenirs." Bill: "I’ll keep the ones with unicorns; they fetch a good price on the dark web." Molly: "Do we have a timeline?" Bill: "Six minutes to get in, three minutes to grab, two minutes to get out." Molly: "What if the kids start crying?" Bill: "We’ll hand them a balloon and say it’s a magic trick." Molly: "And if the clown notices us?" Bill: "We’ll blame the clown’s bad breath, he’ll be too embarrassed to speak." Molly: "Alright, let’s rehearse the entry." Bill: "On three—one, two, three—now!" Molly: "Sneak, grab, dash—repeat until the lights go out." Bill: "And then we’ll be the most successful party crashers in town."
created_at
"2026-03-06T12:18:00.948Z"
3/6/2026, 12:18:00 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6143
moderation_latency
(null)
generation_time
6566
tokens_prompt
77
tokens_completion
2590
native_tokens_prompt
60
native_tokens_completion
6190
native_tokens_completion_images
(null)
native_tokens_reasoning
5439
native_tokens_cached
50
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00464625
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-8566e71c-fd91-4c33-8db1-4aeca34a5772"
is_byok
false
latency
52
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799480-t7oVwZ3U79omj5WkPd7w"
upstream_id
"chatcmpl-8566e71c-fd91-4c33-8db1-4aeca34a5772"
total_cost
0.00464625
cache_discount
0.00001125
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
525
16.53%
Dialogue to Total Word Ratio
Ratio: 76.00%, Deviation: 6.00%
neededClean
false
wordsTotal
525
wordsDialogue
399
9.2707%