NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1H778PC9PEQ7WKQWQDDMQ2
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
5.5s
Cost:
$0.003265
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night before the neighborhood’s annual birthday bash, the two thieves met in the dimly lit garage behind the community center. The air smelled of oil and stale pizza, and a single flickering bulb cast long shadows on the concrete floor. Molly, a wiry woman with a scar across her left cheek, paced back and forth, while Bill, a lanky man with a perpetual grin, tapped his fingers on the steering wheel of an old sedan. They had scoped the party, noting bright balloons, a candy table, and the unsuspecting crowd. Molly: The kids will be distracted by the clown, so we slip in through the side door. Bill: Got it. The side door is unlocked because the staff thinks it's a service entrance. Molly: We need a distraction for the parents. I’ll set off the fire alarm at 6:15. Bill: Perfect. The fire alarm will clear the hall, and the kids will be ushered outside. Molly: Meanwhile, I’ll slip a bag of confetti into the gift table to make a mess. Bill: That mess will keep the staff busy while we grab the presents. Molly: The presents are in three big boxes marked ‘Happy Birthday!’ We just need to lift them. Bill: I’ve got a dolly in the trunk. We can roll the boxes straight to the van. Molly: The van is parked two blocks away, hidden behind a dumpster. No one will suspect a party‑crasher. Bill: We should wear the clown costume they’re using for the entertainment. It’ll give us cover. Molly: I’ll borrow a red nose and oversized shoes from the costume rack. It’s perfect. Bill: And I’ll take the oversized bow tie. It’ll make us look like part of the act. Molly: What about the kids’ eyes? We need to keep them calm. Bill: I’ll bring a bag of balloons. We’ll hand them out as ‘goodie bags’ to keep the kids occupied. Molly: Great. The kids will love balloons, and the parents will think we’re part of the party. Bill: We need a timeline. Fire alarm at 6:15, chaos at 6:20, exit at 6:30. Molly: I’ll set the alarm on my phone. I can trigger it remotely. Bill: And I’ll monitor the security cameras from the van. If anything looks off, we abort. Molly: The gift table is near the cake. We’ll grab the boxes before anyone notices. Bill: The cake will be cut at 6:25, so the crowd will be focused on that. Molly: After we load the boxes, we’ll slip out the same side door. No one will see us. Bill: And the clown costume will help us blend in as the entertainment crew leaves. Molly: We’ll stash the presents in the van’s trunk, then drive to the warehouse. Bill: At the warehouse, we’ll sort the toys, sell the high‑value ones, and keep the rest for later. Bill: Then it’s settled. Let’s get ready, and may the balloons be with us. They exchanged a grin, slipped into the night, and vanished before sunrise.
created_at
"2026-03-06T12:16:08.477Z"
3/6/2026, 12:16:08 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5048
moderation_latency
(null)
generation_time
5418
tokens_prompt
77
tokens_completion
3499
native_tokens_prompt
58
native_tokens_completion
4335
native_tokens_completion_images
(null)
native_tokens_reasoning
3699
native_tokens_cached
4
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00326485
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-4c1b9080-3768-42b6-ae6f-cf3b165bf216"
is_byok
false
latency
106
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799368-cwqP1JDJX88oXZV7ATwx"
upstream_id
"chatcmpl-4c1b9080-3768-42b6-ae6f-cf3b165bf216"
total_cost
0.00326485
cache_discount
9e-7
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 3.41%, Deviation: 66.59%
neededClean
false
wordsTotal
499
wordsDialogue
17
49.9995%