NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HFY116DCQV95X5JXT51QV
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.9s
Cost:
$0.003651
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
79.81%
Dialogue to Total Word Ratio
(details)
57.4080%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night was warm, the kind of summer evening that made the cicadas sound like a cheap soundtrack for a cartoon. A white picket fence surrounded a house where a banner read “Happy Birthday, Timmy!” in glittering letters. Balloons bobbed in the gentle breeze, and a table overflowed with cupcakes, candy, and a mountain of wrapped gifts that glimmered under the porch light. Molly crouched behind a garden gnome, her dark hair hidden under a baseball cap, while Bill adjusted his oversized sunglasses, despite the darkness. Molly: “You see that cake? It’s a decoy. The real treasure is under the gift table.” Bill: “Right, but the kids will be screaming if we trip the clown balloon arch.” Molly: “I’ve got a plan. We slip in through the side gate, hide behind the inflatable pool, and grab the presents before anyone notices.” Bill: “What about the parents? They’ll be busy with the piñata and the karaoke machine.” Molly: “Exactly. The noise will mask our footsteps. I’ll use the kids’ own party favors as a distraction.” Bill: “I’ve got a gadget that can silence a squeaky toy for exactly ten seconds. That should be enough to snatch the biggest gift box.” Molly: “Don’t forget the glitter bomb. If the kids start crying, we’ll just throw it in the air and watch the sparkles distract them.” Bill: “And the parents will think it’s a prank. They’ll blame the clown for the missing presents.” Molly: “We need a getaway car. I’ve arranged a bright pink scooter that looks like a kid’s toy. No one will suspect a thief on a scooter.” Bill: “Perfect. I’ll drive it to the back gate, and you’ll hop on with the loot. We’ll meet at the oak tree after the cake is cut.” Molly: “One more thing: the party favors are all wrapped in foil. We can use the foil to make a quick mirror to signal each other.” Bill: “Got it. And if anything goes wrong, we’ll blame the magician’s disappearing act.” Molly: “Let’s make this the most unforgettable birthday heist ever.” The clock struck seven, and the backyard erupted in a chorus of squealing children and occasional balloon pops. Molly slipped through the side gate, her sneakers barely making a sound on the grass, while Bill perched on the inflatable pool, his sunglasses reflecting the fairy lights. The glitter bomb, tucked in his pocket, trembled. As the clown began his juggling, Molly slipped a foil mirror into her pocket, catching the moon’s glimmer. She nudged the gift table, and the wrapped boxes tipped like dominoes, spilling candy onto the lawn. Bill seized the biggest box, its ribbon fluttering like a surrender flag. Suddenly, the magician appeared, pulling a rabbit from a hat, and the kids gasped. In the chaos, Molly tossed the glitter bomb; a shower of confetti rained down, obscuring the escape route. They sprinted toward the oak tree, the pink scooter humming behind them, while the parents blamed the clown for the missing presents, never suspecting the two adults who vanished into the night. The moon smiled, as if approving their mischief tonight.
created_at
"2026-03-06T12:20:53.929Z"
3/6/2026, 12:20:53 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5488
moderation_latency
(null)
generation_time
5869
tokens_prompt
77
tokens_completion
3023
native_tokens_prompt
58
native_tokens_completion
4864
native_tokens_completion_images
(null)
native_tokens_reasoning
4198
native_tokens_cached
49
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003651475
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-1fe774ca-d51f-475b-bde4-bafb5c34fd16"
is_byok
false
latency
49
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799653-jjsn9a5XAouqNyMWfPlC"
upstream_id
"chatcmpl-1fe774ca-d51f-475b-bde4-bafb5c34fd16"
total_cost
0.003651475
cache_discount
0.000011025
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
518
79.81%
Dialogue to Total Word Ratio
Ratio: 47.88%, Deviation: 2.12%
neededClean
false
wordsTotal
518
wordsDialogue
248
57.4080%