NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1H4W6CH6ME966ZMTGSH4C9
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
5.6s
Cost:
$0.003460
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
95.98%
Dialogue to Total Word Ratio
(details)
97.9912%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night before the birthday bash, Molly and Bill hid behind a towering stack of colorful balloons, the scent of cake drifting from the open gate. They checked their tools: a remote to mute music, squeaky shoes to mask footsteps, and a bag for the loot. A mischievous grin spread across Molly’s face as she imagined the chaos of a party turned into a heist. Molly: "We slip through the clown tunnel, silent as mice." Bill: "Squeaky shoes will distract the kids, they’ll quickly chase the sound." Molly: "I’ll mute the music with the remote, no jingle." Bill: "Then we grab the gift pile, stack it in the bag." Molly: "If a child sees us, we claim we’re magicians." Bill: "The birthday girl gets a fake present, we keep the real ones." Molly: "We exit through the cake table, slip into the van." Bill: "Speed is key, no lingering, the party ends at midnight." Bill: "And the kids will think Santa was late, funny enough." They tiptoed out, the bag bulging with stolen toys, while the party erupted in giggles and cake crumbs. The night was a comedy of crimes, and the twins vanished into the street, humming a tune.
created_at
"2026-03-06T12:14:51.603Z"
3/6/2026, 12:14:51 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5422
moderation_latency
(null)
generation_time
5515
tokens_prompt
77
tokens_completion
2755
native_tokens_prompt
58
native_tokens_completion
4609
native_tokens_completion_images
(null)
native_tokens_reasoning
4347
native_tokens_cached
49
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003460225
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-b780c94a-7aec-4e81-adf2-cb32ac73ec88"
is_byok
false
latency
49
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799291-OEcD5qmIiwgy2iyuZloF"
upstream_id
"chatcmpl-b780c94a-7aec-4e81-adf2-cb32ac73ec88"
total_cost
0.003460225
cache_discount
0.000011025
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
95.98%
Dialogue to Total Word Ratio
Ratio: 45.50%, Deviation: 4.50%
neededClean
false
wordsTotal
200
wordsDialogue
91
97.9912%