NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HZY3F8R83DZ5VZ9KG6XDB
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.8s
Cost:
$0.008218
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night was unusually quiet in the suburban cul‑de‑sac, the kind of silence that made the streetlights look like stage spotlights. Molly leaned against the rusted fence, a half‑eaten donut in one hand and a clipboard of crayon schematics in the other. Bill perched on the low wall opposite, fidgeting with a plastic party hat he had stolen from a previous job. The target was a birthday bash at the Johnsons’ house, a room filled with balloons, a dinosaur cake, and a mountain of wrapped gifts glimmering under flickering fairy lights. They had scoped the place earlier, noting two adult chaperones, a neighbor’s dog that barked at every footstep, and the kids’ endless energy. Their plan was simple: slip in through the back door, grab the presents, and disappear before the “Happy Birthday” song started. But as thieves, they liked to add a little flair to the execution. Molly: Okay, Bill, the kids will be distracted by the clown, giving us a window before the cake is cut. Bill: Ten minutes? That's a lifetime for a thief who can't even tie his shoes without tripping. Molly: Focus. The back door is unlocked—Mrs. Johnson left the key on kitchen table when she went to the bathroom. Bill: The dog? That mutt barks at the slightest rustle. Molly: I've got a squeaky toy that will keep him busy for three minutes. Then we slip in. Bill: Three minutes? I was hoping for at least five. What about the chaperones? Molly: One is in the kitchen, the other in the living room, glued to their phones scrolling through memes. Bill: Memes? Kids these days are more into cat videos than actual party fun. Molly: Which is why the cake is shaped like a dinosaur—expects a T‑Rex to be edible. Bill: Do we have a bag big enough for all the wrapped gifts? Molly: I stole a reusable tote last week; it folds compactly and can hold a mountain of presents. Bill: And the security camera above the hallway? Molly: Disabled it with a cheap EMP gadget; it’ll be blind for fifteen minutes. Bill: Fifteen minutes? That’s generous. What if the kids start a conga line early? Molly: Then we join them, dance to the gift table, and grab boxes while they’re distracted. Bill: I can picture it—Molly in a glittery hat, Bill doing the robot. Classic. Molly: Keep your eyes on the biggest present under the red balloon. It’s the perfect cover. Bill: The kids will think it’s a magic trick. Molly: If they ask, we’ll claim the presents vanished because we’re the “Great Gift Ghosts.” Bill: Do we have an alibi? Molly: We’ll say we were at the karaoke bar belting “Happy Birthday” at the top of our lungs. Bill: Got it. I’ll bring extra gloves in case the gifts are sticky. Molly: Bring the tote, squeaky toy, and EMP. Meet me at the back gate at 7:45. Bill: Perfect. This will be the most ridiculous heist ever.
created_at
"2026-03-06T12:29:38.317Z"
3/6/2026, 12:29:38 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
11291
moderation_latency
(null)
generation_time
11681
tokens_prompt
77
tokens_completion
5785
native_tokens_prompt
60
native_tokens_completion
10952
native_tokens_completion_images
(null)
native_tokens_reasoning
10260
native_tokens_cached
50
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00821775
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-bb96acd0-7ce5-48dc-8195-3bfff57dd877"
is_byok
false
latency
4
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772800178-T4VsldSVkaVlRSwteDtW"
upstream_id
"chatcmpl-bb96acd0-7ce5-48dc-8195-3bfff57dd877"
total_cost
0.00821775
cache_discount
0.00001125
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 6.21%, Deviation: 63.79%
neededClean
false
wordsTotal
499
wordsDialogue
31
49.9995%