NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1JANKKAC90GGR01G29B9W9
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
6.6s
Cost:
$0.003797
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
33.33%
Dialogue to Total Word Ratio
(details)
16.6663%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The old community center smelled like stale pizza and crayons. A crooked banner of rainbow balloons hung over the entrance, and a squeaky clown horn drifted from inside. Molly leaned against the cracked brick wall, hair in a messy bun, grinning like a crescent moon. Bill shuffled his oversized coat, flapping like a sail, and adjusted night‑vision goggles on his nose. They had been hired by a mysterious client called ‘The Elf’. The job: infiltrate a children’s birthday party at 2 a.m., slip past parents, cake, and overexcited kids, and walk away with every wrapped present. The reward was a stack of gold bars enough to make a candy store jealous. Molly tapped the map of the center, tracing the route from the loading dock to the party room. “The back door is guarded by a security guard who thinks he’s a superhero,” she whispered, trying not to laugh. Bill raised an eyebrow. “A superhero, huh? Does he wear a cape or just a badge?” They both knew the stakes: if caught, the police would treat them like mischievous toddlers. The plan required precision, timing, and fake frosting. Molly pulled a tiny bag of glitter, glittering like a thousand fireflies, and tossed it into the air. “We’ll need a distraction,” she said, eyes sparkling. Bill nodded, already racing through possible alibis. All set. Molly: "First, we need a distraction. I will bring a box of glitter that explodes into confetti when we open it." Bill: "Great, and I’ll rig the punch bowl to fizz like a soda fountain. The kids will think it’s a magic potion." Molly: "We also need a fake cake. I’ve already baked a sponge that looks like chocolate but is actually just foam." Bill: "Perfect. I’ll pose as the clown and juggle rubber chickens while the kids chase the glitter." Molly: "Meanwhile, I’ll slip through the side door with a sack of my own. The guard will be too busy watching the clown’s pratfalls." Bill: "What about the parents? They’ll be on the phone, scrolling through memes." Molly: "Exactly. I’ll set up a fake Wi‑Fi hotspot called ‘Free Candy’ to keep them glued to their screens." Bill: "And the presents? I’ll stack them in a cart and push it out the back while the kids are distracted by the glitter storm." Molly: "We must move fast. The party ends at 3 a.m., and the cleaning crew arrives at 3:15." Bill: "I’ve got a timer on the glitter bomb. Ten seconds after the blast, we make our exit." Molly: "If anything goes wrong, we blame the clown’s bad jokes." Bill: "Deal. Let’s meet at the loading dock at 1:45 and grab the loot before sunrise."
created_at
"2026-03-06T12:35:30.043Z"
3/6/2026, 12:35:30 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6201
moderation_latency
(null)
generation_time
6519
tokens_prompt
77
tokens_completion
3894
native_tokens_prompt
60
native_tokens_completion
5058
native_tokens_completion_images
(null)
native_tokens_reasoning
4461
native_tokens_cached
50
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00379725
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-c773cc85-5c1f-424d-b5d7-164c9d6c0f02"
is_byok
false
latency
45
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772800530-OIUlGffjAExZK27QgVrP"
upstream_id
"chatcmpl-c773cc85-5c1f-424d-b5d7-164c9d6c0f02"
total_cost
0.00379725
cache_discount
0.00001125
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
448
33.33%
Dialogue to Total Word Ratio
Ratio: 54.69%, Deviation: 4.69%
neededClean
false
wordsTotal
448
wordsDialogue
245
16.6663%