NC Bench
Overview
About
Tests
Models
Model
Writer: Palmyra X5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2026-02-08-01KGYZ9BW52EDX1X9CQCJWXT37
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her ski mask in the rearview mirror. “You sure this is the right house? Looks like a piñata exploded in the yard.” Bill craned his neck at the rainbow streamers and a life-sized foam banana mascot slumped by the mailbox. “Definitely the one. Rich aunt’s kid’s birthday. Clown included. Clown means chaos. Chaos is our entry.” “Clown’s a liability! What if he recognizes us?” “He’s paid in pizza coupons, Molly. He won’t remember his own name.” “Still—kids, Bill! We’re robbing toddlers.” “Toddlers with trust funds. That bouncy castle probably costs more than our getaway go-kart.” “It’s not a go-kart, it’s a borrowed—” “Stolen lawn mower with streamers. Semantics. Now, plan: I distract with the kazoo, you sneak in through the ball pit.” “The ball pit? That thing’s a death trap. Last time I went in one, I lost an earring and my dignity.” “This time you’ll lose dignity *and* twenty-seven wrapped presents.” “What if they have a unicorn? Real or inflatable, I’m not dealing with that.” “Unicorns hate shiny things. Just don’t wear the tiara.” “I’m wearing the tiara. It’s part of my tactical aesthetic.” “Fine. But no singing ‘Happy Birthday’ when you’re stuffing loot into the diaper bag.” “Only if you promise not to join the limbo contest.” “...How low do you think I’ll go for a cover story?” “Real low, Bill. Real, real low.”
created_at
"2026-02-08T15:52:57.464979+00:00"
Sunday, February 8th, 2026 at 3:52:57 PM GMT+00:00
model
"writer/palmyra-x5-20250428"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
269
moderation_latency
185
generation_time
4297
tokens_prompt
61
tokens_completion
340
native_tokens_prompt
301
native_tokens_completion
343
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0022386
router
(null)
provider_responses
0
status
200
is_byok
false
latency
269
endpoint_id
"c684c4e7-1b77-4ca5-9516-7ecc66e2c455"
provider_name
"Amazon Bedrock"
model_permaslug
"writer/palmyra-x5-20250428"
api_type
"completions"
id
"gen-1770565971-rHxGGU4L4ssaJLxNno4G"
upstream_id
(null)
total_cost
0.0022386
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0028%
Matches word count
n/a
neededClean
false
words
232
99.8169%
Dialogue to Total Word Ratio
Ratio: 87.93%, Deviation: 2.07%
neededClean
false
wordsTotal
232
wordsDialogue
204
49.9099%