NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HH0FEQ9W867TQX2783MG6
Run Details
Model:
Inception Mercury
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
12.4s
Cost:
$0.000436
Provider:
Inception
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Stand outside the glittering house, Molly and Bill crouched behind a hedgerow, a mischievous grin on each face. The party lights flickered from the windows, and the scent of fresh lemonade drifted through the air. They were ready to execute the most audacious heist of their careers. Molly: Bill, are you sure we can get past the guard dog? I heard it barks at anyone who smells like a clown. Bill: Yeah, I’ve got a plan. We’ll use the old sprinkler system to drown the pup in foam. No one will notice. Molly: And what about the kids? They’re all around, laughing and playing. We need to keep them busy. Bill: I’ve invited a magician to perform a quick trick. He’ll distract them while we slip into the gift room. Molly: That’s brilliant! And the presents are stacked in the back of the house, right? We can grab them all and disappear. Bill: Exactly. We’ll use the inflatable bunny as a cover. When we climb onto it, the kids will think it’s a new game. Molly: I’ll bring a bag of glitter to hide the footprints. The kids will be too busy with the confetti to notice. Bill: And I’ll have a bag of chocolate bars to sweeten the deal. The kids will love them, and we’ll be out before they know what hit them. Molly: I’m nervous, but I trust you. Let’s do this. Bill: Let’s go. They slipped through the back door, the sprinkler system activated, and the foam covered the garden. The guard dog fell asleep, and the magician’s trick worked, drawing the children away. Molly: Did we get everything? Bill: Check the inventory. I see ten boxes, twelve ribbons, and a giant stuffed unicorn. Molly: Perfect. Let’s head to the exit. Bill: I’ll keep the door open while you carry the gifts. Molly: And I’ll hide behind the curtain to avoid the security guard. Bill: Great plan. We’re almost there. They made it out of the house, the gifts tucked under their coats, and the guard dog snored in the yard. The children were none the, and the party continued as if nothing had happened. Molly: We did it! The presents are ours. Bill: And we stayed under the radar. That’s the mark of a true heist. They walked away into the night, the moon shining on their faces, and the promise of another adventure waiting in the shadows. The moon rose higher, casting a pale glow over the city. Molly and Bill laughed softly, the adrenaline still coursing through their veins. They had pulled off the impossible, and the night felt like a victory lap for good today.
created_at
"2026-03-06T12:21:29.207Z"
3/6/2026, 12:21:29 PM
model
"inception/mercury"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
11983
moderation_latency
(null)
generation_time
12353
tokens_prompt
77
tokens_completion
656
native_tokens_prompt
71
native_tokens_completion
577
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004361
router
(null)
provider_responses
0
endpoint_id
"da1082ea-721e-460e-b330-034ae7aefc7f"
id
"chatcmpl-47cfde90-6b92-4aae-a3c7-b2f47da1a3e6"
is_byok
false
latency
67
model_permaslug
"inception/mercury"
provider_name
"Inception"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799689-s9hQNEQSgc4F4knYT3jD"
upstream_id
"chatcmpl-47cfde90-6b92-4aae-a3c7-b2f47da1a3e6"
total_cost
0.0004361
cache_discount
0.0000144
upstream_inference_cost
0
provider_name
"Inception"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
443
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
443
wordsDialogue
0
0.0000%