NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HAZRTQAJWK9H31Z4ZS3K8
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7.3s
Cost:
$0.004925
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
29.33%
Dialogue to Total Word Ratio
(details)
14.9134%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the hedgerow bordering the community park, the muffled sounds of squealing children and birthday music drifting over the fence. The low sun cast long shadows, turning the garden party into a secret carnival. Molly adjusted her backpack strap, feeling the lock‑picking set and the sack for loot. Bill checked his watch, eyes flickering between the clock and the inflatable castle where kids bounced. He whispered, "We’ve got ten minutes before parents clean up." The air smelled of frosting and fresh grass, a festive‑yet‑illicit mix. They had scouted the gift table, the security camera, and the catering route earlier. Now, with the party in swing, the perfect window opened. Bill: "Alright, Molly, the cake table is right under the blue balloon arch. We slip in through the side gate, grab the presents, and slip out before anyone notices." Molly: "What about the kids? They're all over the place, dancing, crying, chasing the clown." Bill: "Kids are a distraction, not a problem. The clown's juggling pins are perfect for creating a diversion." Molly: "You think the clown will be busy enough? We need a solid alibi." Bill: "I'll pretend to be a parent looking for the restroom. While I chat with the clown, you can slide the sack under the table." Molly: "And the security camera? It's perched on the fence, pointing at the gift table." Bill: "I have a portable jammer. One quick pulse, and the feed goes dark for thirty seconds." Molly: "Thirty seconds? That's a tight window." Bill: "Exactly. That's why we move fast. You grab the boxes, I handle the jam." Molly: "What about the parents? They'll be checking the kids' bags later." Bill: "We plant a fake gift box with a note that says 'Sorry, we ran out of candy.' They'll think it's a mix‑up." Molly: "Clever. And the real loot goes straight into the sack." Bill: "Don't forget the spare key to the kitchen. The catering staff will bring the big cake there, and we can slip in through the service door." Molly: "Got it. I’ll keep an eye on the kids' balloon animals. If they start floating away, we abort." Bill: "Abort? No, we just adjust. The balloon animals are tethered, they won't drift far." Molly: "Alright, let's do a quick run‑through. You distract the clown, I grab the sack, we jam, we loot." Bill: "Sounds good. Ready when you are." Molly: "One... two... three... go!" Bill: "If the kids start screaming, we just shout 'surprise!' and pretend it's part of the act." Molly: "Brilliant, that'll buy us a few seconds." Bill: "And the clown? He'll think we're part of the entertainment crew." Molly: "Exactly. We'll wear the same neon vests they gave the staff." Bill: "Alright, on my count—three, two, one—now!" Molly: "If the security guard shows up, we slip behind the cake table and hide under the streamers." Bill: "The guard will be distracted by the balloon pop contest we set up." Molly: "Perfect. We'll be invisible among the ribbons." Bill: "And after we grab the loot, we exit through the garden gate, where the kids' parents will be too busy taking photos."
created_at
"2026-03-06T12:18:11.874Z"
3/6/2026, 12:18:11 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6850
moderation_latency
(null)
generation_time
7248
tokens_prompt
77
tokens_completion
3370
native_tokens_prompt
60
native_tokens_completion
6549
native_tokens_completion_images
(null)
native_tokens_reasoning
5820
native_tokens_cached
6
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0049254
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-d629d86f-b6d7-46b5-a312-602264e2ba3c"
is_byok
false
latency
61
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799491-8UPP09fDTLZ5cpRIoK9y"
upstream_id
"chatcmpl-d629d86f-b6d7-46b5-a312-602264e2ba3c"
total_cost
0.0049254
cache_discount
0.00000135
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
527
29.33%
Dialogue to Total Word Ratio
Ratio: 74.95%, Deviation: 4.95%
neededClean
false
wordsTotal
527
wordsDialogue
395
14.9134%