NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1H1D0G51QDRQW1CEKZQPSK
Run Details
Model:
Inception Mercury
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.4s
Cost:
$0.000418
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The moon hung low over the suburban cul-de-sac, a silver lantern that cast shadows on the hedged lawns. Inside the house, the air was thick with the scent of frosting and fresh balloons, the faint hum of a children's playlist drifting through the open windows. Molly slipped through the back gate, her sneakers silent on the gravel, and paused by the kitchen window, where she could see the family gathered around the table, laughing and swapping gifts. Bill waited a few feet away, perched on a low fence, his eyes darting between the house and the streetlights. He was a lanky fellow with a crooked grin, always ready to turn a simple heist into a comedy of errors. He called out, his voice low and conspiratorial, but the words were muffled by the rustling leaves. Molly: 'You sure this is a good idea? We could get caught by the kids, the parents, or the police.' Bill: 'Relax, Molly. I've got a plan. We'll slip in through the back door, use the garden hose as cover, and then grab the presents while the parents are distracted by the cake.' Molly: 'That sounds risky. What about the alarm system? The house has a motion detector that triggers a loud siren.' Bill: 'I've disabled it with EMP device. It will fry the sensors for twenty minutes, giving us a window.' Molly: 'We need a distraction. Maybe we can set off the fireworks in the backyard to draw the kids away.' Bill: 'Good idea. I have a stash of sparklers. We'll light them up, and while the kids run toward the glow, we can sneak in.' The plan was set in motion as the clock ticked toward five-thirty. Molly slipped through the back door, the garden hose spraying a fine mist that masked her footsteps. Bill followed, his EMP device humming softly. They split up at the hall, each heading toward a different side of the house. Molly headed for the living room, where the presents were stacked under the Christmas tree, while Bill made his way to the kitchen, where the cake and the glittery confetti lay. The children, oblivious to the thieves, chased each other around the toys, their laughter echoing through the rooms. The parents, distracted by the cake cutting, barely noticed the sudden rustle of the presents being lifted. As the clock struck six, Molly and Bill slipped out, the loot tucked into their bags, and vanished into the night. Back at the safe house, they celebrated with a stash of chocolate, laughing about the absurdity of their scheme. The night was the city lights twinkling far below, and the thrill of the heist lingered in their hearts, promising adventures to come today.
created_at
"2026-03-06T12:12:57.751Z"
3/6/2026, 12:12:57 PM
model
"inception/mercury"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
18897
moderation_latency
(null)
generation_time
19345
tokens_prompt
77
tokens_completion
648
native_tokens_prompt
72
native_tokens_completion
552
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004176
router
(null)
provider_responses
0
endpoint_id
"da1082ea-721e-460e-b330-034ae7aefc7f"
id
"chatcmpl-7390772d-27d5-4bc9-974d-6ae39271b811"
is_byok
false
latency
117
model_permaslug
"inception/mercury"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799177-cdzICV5mqqRPBQQJXaOI"
upstream_id
"chatcmpl-7390772d-27d5-4bc9-974d-6ae39271b811"
total_cost
0.0004176
cache_discount
0.0000144
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
453
0.00%
Dialogue to Total Word Ratio
Ratio: 7.68%, Deviation: 22.32%
neededClean
false
wordsTotal
456
wordsDialogue
35
0.0000%