NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HKTQGTQE53FQ8VGPXYW4C
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.2s
Cost:
$0.003629
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
43.87%
Dialogue to Total Word Ratio
(details)
71.6235%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The stale air of the attic smelled of pizza crust and old laundry detergent, a fitting backdrop for a duo whose specialty was stealing joy from unsuspecting kids. Molly, with a hairnet that looked like a bird’s nest, was hunched over a hand‑drawn map of a suburban cul‑de‑sac, while Bill, whose eyebrows could have been used as a ruler, was polishing a glittery plastic crowbar that had once been a party favor. Molly tapped the map with a pen and said, “The party’s at 3 p.m., the kids will be distracted by cake, and the parents will be busy pretending they can’t see the mess.” Bill chuckled, “And the presents are all in one room, stacked like a mountain of candy. We just have to roll in, grab the loot, and roll out before anyone notices the missing balloons.” The plan was simple enough: slip through the back gate, pose as a balloon‑artist, and use the crowd’s chaos as cover. Molly imagined herself inflating a dozen squeaky dogs, while Bill rehearsed a grin that could convince a toddler that a glittery hammer was a magic wand. Molly whispered, “If we get caught, we can blame the clown. He’s always the scapegoat.” Bill replied, “And if the clown runs away, we’ll have the whole party to ourselves. No one will suspect a pair of adults in oversized party hats.” They practiced the entrance in the attic, stepping over a stray cat that seemed to judge their every move. The cat, named Sir Whiskers, flicked its tail as if to say, “You’re not welcome.” Bill tossed a fake balloon animal into the air, and it landed with a soft thud on the wooden floor, making a sound that was oddly similar to a sigh of relief. Molly giggled, “We’ll be the only ones who get to open the gifts. Imagine the kids’ faces when they find their toys missing!” Bill smirked, “And the parents will spend the rest of the night Googling ‘how to replace lost presents.’ We’ll be legends.” The night before the party, they loaded the glittery crowbar, a sack of black‑glove gloves, and a pair of night‑vision goggles that looked like oversized sunglasses. The attic windows were cracked open, letting in a moonlit breeze that carried the distant sound of a children’s lullaby from a neighbor’s baby monitor. Molly said, “Remember, we only take the presents, not the cake. The cake is a liability; it’ll crumble if we touch it.” Bill answered, “Got it. We’ll be like ghosts—silent, swift, and leaving only a trail of confetti.” As the first light of dawn painted the sky pink, the two thieves slipped out, their footsteps muffled by the carpet of discarded party hats. They arrived at the cul‑de‑sac, where a rainbow of balloons bobbed in the wind like a lazy parade. Molly adjusted her hairnet, Bill checked his crowbar, and together they whispered a promise to return home with pockets full of toys and a story that would make even the most jaded detective laugh.
created_at
"2026-03-06T12:23:01.624Z"
3/6/2026, 12:23:01 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4804
moderation_latency
(null)
generation_time
5143
tokens_prompt
77
tokens_completion
3986
native_tokens_prompt
60
native_tokens_completion
4833
native_tokens_completion_images
(null)
native_tokens_reasoning
4164
native_tokens_cached
50
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0036285
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-bc60f848-22fc-444d-aa6a-25de134e36c2"
is_byok
false
latency
56
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799781-crPzmNMMgGUQq8D35CRG"
upstream_id
"chatcmpl-bc60f848-22fc-444d-aa6a-25de134e36c2"
total_cost
0.0036285
cache_discount
0.00001125
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
43.87%
Dialogue to Total Word Ratio
Ratio: 34.06%, Deviation: 4.06%
neededClean
false
wordsTotal
505
wordsDialogue
172
71.6235%