NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYZ9BW52EDX1X9CQCJWXT37
Run Details
Model:
Writer: Palmyra X5
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
5.0s
Cost:
$0.002239
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
99.82%
Dialogue to Total Word Ratio
(details)
50.0155%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her ski mask in the rearview mirror. “You sure this is the right house? Looks like a piñata exploded in the yard.” Bill craned his neck at the rainbow streamers and a life-sized foam banana mascot slumped by the mailbox. “Definitely the one. Rich aunt’s kid’s birthday. Clown included. Clown means chaos. Chaos is our entry.” “Clown’s a liability! What if he recognizes us?” “He’s paid in pizza coupons, Molly. He won’t remember his own name.” “Still—kids, Bill! We’re robbing toddlers.” “Toddlers with trust funds. That bouncy castle probably costs more than our getaway go-kart.” “It’s not a go-kart, it’s a borrowed—” “Stolen lawn mower with streamers. Semantics. Now, plan: I distract with the kazoo, you sneak in through the ball pit.” “The ball pit? That thing’s a death trap. Last time I went in one, I lost an earring and my dignity.” “This time you’ll lose dignity *and* twenty-seven wrapped presents.” “What if they have a unicorn? Real or inflatable, I’m not dealing with that.” “Unicorns hate shiny things. Just don’t wear the tiara.” “I’m wearing the tiara. It’s part of my tactical aesthetic.” “Fine. But no singing ‘Happy Birthday’ when you’re stuffing loot into the diaper bag.” “Only if you promise not to join the limbo contest.” “...How low do you think I’ll go for a cover story?” “Real low, Bill. Real, real low.”
created_at
"2026-02-08T15:52:57.464979+00:00"
2/8/2026, 3:52:57 PM
model
"writer/palmyra-x5-20250428"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
269
moderation_latency
185
generation_time
4297
tokens_prompt
61
tokens_completion
340
native_tokens_prompt
301
native_tokens_completion
343
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0022386
router
(null)
provider_responses
0
status
200
is_byok
false
latency
269
endpoint_id
"c684c4e7-1b77-4ca5-9516-7ecc66e2c455"
provider_name
"Amazon Bedrock"
model_permaslug
"writer/palmyra-x5-20250428"
api_type
"completions"
id
"gen-1770565971-rHxGGU4L4ssaJLxNno4G"
upstream_id
(null)
total_cost
0.0022386
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
228
99.82%
Dialogue to Total Word Ratio
Ratio: 87.93%, Deviation: 2.07%
neededClean
false
wordsTotal
232
wordsDialogue
204
50.0155%