NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1H03DVB6GTG5EAKK766ETB
Run Details
Model:
Inception Mercury 2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
13.7s
Cost:
$0.008916
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9998%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night was with with frosting and balloon strings, the backyard lit by a garden lamp. Molly and Bill crouched behind a hedge, their faces half-hidden by a pair of sunglasses that looked more like clown glasses. The children's party was a chaos of giggles, a cake shaped like a dinosaur, and a mountain of wrapped gifts piled in a corner like a treasure trove. Molly pulled a crumpled flyer from her pocket, the only clue to the location: 'Maggie's 6th Birthday Bash – 3 PM – 123 Sprinkler Lane.' Bill adjusted his black mask, his voice low but eager. They had scoped the house earlier, noting the open windows and the distracted parents glued to a karaoke machine. Now they whispered their plan, each line a mix of absurdity and meticulous detail. The air smelled of sugar and mischief, and the distant sound of a squeaky toy added a soundtrack. Molly: 'Alright, Bill, the cake is a dinosaur, but the real prize is the mountain of presents.' Bill: 'Got it. We need to get past the piñata guard.' Molly: 'The kids think it's a monster, but it's just a papier‑mâché thing.' Bill: 'So we distract them with a fake balloon pop.' Molly: 'I have a squeaky toy that sounds like an alarm.' Bill: 'Perfect. I’ll set off the karaoke track at full volume.' Molly: 'While they’re dancing, I’ll slip in through the garden gate.' Bill: 'The gate is unlocked; the neighbor’s cat left it ajar.' Molly: 'And the presents are stacked near the clown car.' Bill: 'We need a cart, a wheelbarrow, something to haul them.' Molly: 'I brought a collapsible wagon from the garage.' Bill: 'Great, but we have to avoid the glitter‑covered floor.' Molly: 'I’ll wear shoes with anti‑slip soles.' Bill: 'And I’ll wear gloves to keep the glitter off my hands.' Molly: 'What about the parents? They’re glued to the karaoke.' Bill: 'We’ll hide behind the cake table and pop out when the chorus hits.' Molly: 'The song is “Baby Shark,” right? That’s three minutes of distraction.' Bill: 'Exactly. We’ll have seconds to grab the top box.' Molly: 'The top box is the one with the superhero cape.' Bill: 'That’s the biggest one, worth the most points.' Molly: 'We need to be quick. The kids will notice if the pile shrinks.' Bill: 'I’ll coordinate the timing. On my count, we move quickly.' Bill: 'I’m in. The wagon is rolling.' Molly: 'I’m scooping the cape box, careful not to knock over the cupcakes.' Molly: 'That’s the cue. I’m slipping out the back door.' Bill: 'We have to leave immediately before the birthday song ends.' Bill: 'If we linger, the parents will see us and the party will turn into a police raid.' Molly: 'Let’s make a clean exit. No glitter left behind.' Bill: 'All right. Let’s vanish now.' Molly: 'And remember, no one ever suspects a clown for a heist, seriously.' Bill: 'That’s the secret sauce. We’re the only thieves who can pull off a birthday caper today.'
created_at
"2026-03-06T12:12:15.17Z"
3/6/2026, 12:12:15 PM
model
"inception/mercury-2-20260304"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
13230
moderation_latency
(null)
generation_time
13649
tokens_prompt
77
tokens_completion
5472
native_tokens_prompt
60
native_tokens_completion
11870
native_tokens_completion_images
(null)
native_tokens_reasoning
11161
native_tokens_cached
6
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00891615
router
(null)
provider_responses
0
endpoint_id
"c2a24e34-c306-4b4d-9e2f-b162c2d8bf0e"
id
"chatcmpl-4e7812cb-7955-4b69-9055-3bd9ee8c3bab"
is_byok
false
latency
101
model_permaslug
"inception/mercury-2-20260304"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799135-gn9exPZyY5zo4lSB1eWd"
upstream_id
"chatcmpl-4e7812cb-7955-4b69-9055-3bd9ee8c3bab"
total_cost
0.00891615
cache_discount
0.00000135
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 54.40%, Deviation: 15.60%
neededClean
false
wordsTotal
500
wordsDialogue
272
49.9998%