NC Bench
Overview
About
Tests
Models
Model
Writer: Palmyra X5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2026-02-08-01KGYYGDYKRWYBQX2DHJHRWDAQ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a bush, binoculars pressed to her eyes. “You said no clowns, Bill.” “There aren’t any clowns,” Bill whispered, adjusting his fake mustache. “It’s just a theme. Look—balloons shaped like squirrels. No threat.” “Last time you said ‘no threat,’ we got chased by a hamster. A *hamster*, Bill!” “That hamster had issues. This is different. We’re dealing with cupcakes and piñatas, not rodent mafia.” “So you admit the hamster was organized crime?” “Focus, Molly! Twenty-three unopened presents under the ‘Happy Birthday, Kevin’ banner. Minimum value: nine juice boxes and a talking toilet.” “A *talking toilet*? That's just a prank toy. It says ‘flush me’ in a British accent.” “Exactly. Resale gold on eBay. Parents’ll pay anything for novelty plumbing.” “You’re pricing stolen toilet humor? Are we criminals or sad podcast hosts?” “We’re *entrepreneurs with felony tendencies*. Now, what’s the plan?” “Distract them with a fake magic show. You do the disappearing rabbit trick.” “I told you, I don’t do rabbits. They judge me.” “Then use a sock. Socks disappear all the time.” “Fine. But if I’m juggling sweaty socks, I want a bigger cut.” “You get the cut I give you. Now, is the getaway vehicle ready?” “The stroller’s primed. Looks suspiciously effective.” “Perfect. We strike at snack time. When their guard is down... and their hands are sticky.” “Like us, but socially acceptable.”
created_at
"2026-02-08T15:39:19.784028+00:00"
Sunday, February 8th, 2026 at 3:39:19 PM GMT+00:00
model
"writer/palmyra-x5-20250428"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
348
moderation_latency
153
generation_time
4413
tokens_prompt
61
tokens_completion
334
native_tokens_prompt
301
native_tokens_completion
348
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0022686
router
(null)
provider_responses
0
status
200
is_byok
false
latency
348
endpoint_id
"c684c4e7-1b77-4ca5-9516-7ecc66e2c455"
provider_name
"Amazon Bedrock"
model_permaslug
"writer/palmyra-x5-20250428"
api_type
"completions"
id
"gen-1770565154-X8WAkpK6uxzI7MOPRy6J"
upstream_id
(null)
total_cost
0.0022686
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.2141%
Matches word count
n/a
neededClean
false
words
228
99.2119%
Dialogue to Total Word Ratio
Ratio: 92.98%, Deviation: 2.98%
neededClean
false
wordsTotal
228
wordsDialogue
212
49.7130%