NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1GBV4TRTSR9ASNQ1RY6GAF
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7.4s
Cost:
$0.005113
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night was unusually quiet in the suburban cul‑de‑sac, the streetlights flickering like nervous fireflies. A brightly colored banner flapped lazily over the driveway, announcing a 'Superhero Celebration' for six‑year‑olds. Inside, a pile of gift‑wrapped boxes waited, each wrapped in glittering paper and tied with ribbons that shone like tiny rainbows. Molly and Bill crouched behind a hedge, their backpacks bulging with tools. The faint sound of a children's laugh drifted from the open back door, a reminder that the party was already in full swing. Molly checked her watch, then whispered, 'We have exactly ten minutes before the parents start cleaning up.' Bill nodded, his eyes scanning the yard for security cameras. The plan was simple: slip in through the side gate, grab every present, and disappear before the birthday cake was even sliced. But the real challenge was not the alarms—it was the inevitable mess of plastic and confetti that would give them away. Molly: Okay, Bill, the side gate is unlocked; we just need a hairpin to slip the latch. Bill: Got it. I have a hairpin in my pocket, plus a set of lock picks for backup. Molly: The kids' parents will be distracted by the clown; his balloon animals are a perfect diversion. Bill: The clown will be juggling for at least fifteen minutes, giving us a window. Molly: We should aim for the gift table near the cake; that's where the biggest boxes sit. Bill: And the candy bowls—those are low hanging fruit, literally. Molly: I’ll disable the motion sensor on the porch with this little EMP gadget. Bill: Make sure the EMP is set to low power; we don’t want to fry the house lights. Molly: Once inside, I’ll grab the largest present first—looks like a bike. Bill: Then we split the loot: you take the big stuff, I’ll handle the smaller toys. Molly: We need to be quick; the birthday song starts in five minutes. Bill: I’ll keep an eye on the timer on my watch; if it hits zero, we bail. Molly: What about the confetti cannons? They could give us away if they fire. Bill: I’ll throw a wet towel over the trigger; that should dampen the pop. Molly: And the balloons—if they float away, we’ll be spotted. Bill: I’ll cut the strings with scissors; they’ll drop to the ground quietly. Molly: The kids will be busy opening presents; they won’t notice a couple of adults slipping in. Bill: I’ll stay near the back door, ready to slip out with the sack. Molly: The sack is waterproof, so the presents stay dry if we get caught in the rain. Bill: Speaking of rain, the forecast says a drizzle; that might help mask our footprints. Molly: Good point. Wet grass means fewer prints. Bill: I’ll wear the rubber boots I stole from the janitor’s closet. Molly: And I’ll wear a mask that looks like a superhero—so if anyone sees me, they’ll think I’m part of the party. Bill: Perfect. Let’s synchronize our watches to the beat of the music. Molly: One, two, three—now! Bill: The gate is open; we’re in. Molly: I’m heading for the table—there’s a stack of presents under the streamers. Bill: I’m covering the hallway, listening for any adult footsteps. Molly: Got a box—looks like a remote‑control car. Bill: Nice. I see a plush unicorn; I’ll grab that. Molly: The bike is huge; I’ll need both hands. Bill: I’ll help you lift it; we can carry it together. Molly: The timer on my watch says three minutes left. Bill: We’re almost done. Keep moving. Molly: I hear a parent calling a child—maybe they’re noticing something. Bill: Stay calm. I’ll distract them with a joke about the clown. Molly: The child is looking at the balloons; I’ll slip the sack behind the cake. Bill: The sack is full; let’s head to the side gate. Molly: The rain is starting; the ground is slick. Bill: Perfect for a quick exit. Molly: We’re out—mission accomplished. Bill: Let’s split the loot at the safe house. Molly: And maybe next time we target an adult party—less glitter. Bill: Let’s go now.
created_at
"2026-03-06T12:01:11.331Z"
3/6/2026, 12:01:11 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6845
moderation_latency
(null)
generation_time
7394
tokens_prompt
77
tokens_completion
3565
native_tokens_prompt
60
native_tokens_completion
6799
native_tokens_completion_images
(null)
native_tokens_reasoning
5856
native_tokens_cached
6
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0051129
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-f89b7665-4402-4dc3-bf27-01c87596c971"
is_byok
false
latency
54
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772798471-M3dtFhITaIFMQdbEnq5l"
upstream_id
"chatcmpl-f89b7665-4402-4dc3-bf27-01c87596c971"
total_cost
0.0051129
cache_discount
0.00000135
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
697
0.00%
Dialogue to Total Word Ratio
Ratio: 5.74%, Deviation: 64.26%
neededClean
false
wordsTotal
697
wordsDialogue
40
0.0000%