NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2025-02-24-01JMX27BC1PW8K1THR3Y92JNC7
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill hunched over a worn kitchen table, spreading out a crude crayon drawing of a suburban house. The late afternoon sun filtered through dusty blinds, casting prison-bar shadows across their conspiratorial faces. "I still think this is low, even for us," Bill muttered, adjusting his thick-framed glasses. Molly rolled her eyes. "Low? It's brilliant. Little Timmy Henderson's fifth birthday party. Twenty-five kids equals at least forty presents. Do the math, Bill." "I can do math just fine," Bill snapped, tapping his pen against the paper. "I'm just saying, stealing from kids feels... I don't know... extra wrong?" "Extra profitable," Molly corrected, twirling a strand of her purple hair. "Besides, these are Oakwood kids. Their parents will just buy them replacement gifts, probably better ones out of guilt." Bill sighed deeply. "So what's the plan? We just waltz in dressed as clowns?" "God no. Clowns get noticed. We go in as caterers." Molly pointed to a side entrance marked on their diagram. "Kitchen door. I scouted it yesterday pretending to deliver flyers. The mom's hiring a taco truck, but they'll need people to serve." "And you know this how?" "Facebook event page. Public. Amateur move." Molly smirked. "We get in, we locate the present table, and we create a diversion." Bill leaned back in his chair. "What kind of diversion? Please don't say fire." "Nothing so dramatic. I'm thinking food poisoning." "You want to poison children?!" Bill's voice cracked. "Not poison-poison. Just... urgent bathroom needs. I've got some stuff that'll work through a punch bowl. Twenty minutes later, chaos erupts." Molly's fingers wiggled to illustrate the panic. "While mommy dearest is handling two dozen kids with tummy troubles, we load the gifts into those catering boxes and exit through the side gate." Bill pinched the bridge of his nose. "This is wrong on so many levels." "You said that about the nursing home job too, and that went fine." "Mrs. Abernathy caught us! She chased me with her walker!" "But we got away," Molly countered. "And those prescription meds sold nicely." Bill studied the crayon map. "What about the dad? The drawing shows a pretty big guy." "Working. Birthday falls on a Tuesday. Capitalism wins again." "Security system?" "Basic. I can disable it when we arrive with the 'extra napkins' from the 'catering company,'" Molly said, making air quotes. Bill stared at his partner for a long moment. "Fine. But I want it noted that I think this job stinks." "Noted," Molly said cheerfully, rolling up their plans. "And if it makes you feel better, we'll leave the card with the twenty bucks from his grandma. Nobody needs that kind of guilt." "How thoughtful of you," Bill muttered. "What time does this heist of moral bankruptcy begin?" "Party starts at three. We arrive at two-thirty as the advance team." Molly stood and stretched. "Cheer up, partner. After this, we can afford that security system bypass kit you've been drooling over." Bill allowed himself a small smile. "The Phantom 5000?" "The very one," Molly confirmed. "Well," Bill said, straightening the plans, "at least Timmy won't remember his fifth birthday party."
id
"gen-1740437171-4OFgP0dxpD4trs65iF0G"
upstream_id
"msg_01Sd1wiTNMUQJ6ZZL4fL1Cz6"
total_cost
0.012765
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:46:29.117787+00:00"
Monday, February 24th, 2025 at 10:46:29 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
469
moderation_latency
(null)
generation_time
16193
tokens_prompt
68
tokens_completion
731
native_tokens_prompt
75
native_tokens_completion
836
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012765
Evaluation details
Result
Evaluator
Details
Meta Data
2.0116%
Matches word count
n/a
neededClean
false
words
525
0.0000%
Dialogue to Total Word Ratio
Ratio: 71.81%, Deviation: 21.81%
neededClean
false
wordsTotal
525
wordsDialogue
377
1.0058%