Model

Claude 3.7 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2025-02-24-01JMX0H52W6AE4C78W51MAGDCY

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly crouched behind the bush, binoculars trained on the rainbow-colored bounce house. Bill slid beside her, unfolding a crude crayon drawing of the backyard. "Sixteen presents in the gazebo," Molly whispered. "Pink table. Unattended." Bill scratched his stubble. "Guards?" "One clown, three moms, and a photographer." "The clown worries me." Molly lowered her binoculars. "He's making balloon animals. Hardly Ocean's Eleven." "Remember the magician last time? My eyebrows still haven't grown back." "That was different. This is just a five-year-old's birthday." Bill pointed to a spot on his drawing. "We enter through the side gate during the cake cutting. Maximum distraction." "Then we stuff the presents into these princess backpacks," Molly added, patting her disguise. "Isn't this... morally questionable?" "Says the man who robbed a nun." "She wasn't a real nun!" Molly rolled her eyes. "Look, we grab, we go. Twenty seconds max." "What about the pinata?" "Leave it." "But it's shaped like Spider-Man." "Focus, Bill." He sighed, zipping up his stolen Chuck E. Cheese employee vest. "Let's ruin some childhood memories."
id"gen-1740435396-3JNwMP4HJs11cUAm9Wlc" 
upstream_id"msg_01Ke7CN1Dr3ajTBt8r6Gv1kT" 
total_cost0.005235
cache_discount(null)
provider_name"Anthropic" 
created_at"2025-02-24T22:16:42.694924+00:00" Monday, February 24th, 2025 at 10:16:42 PM GMT+00:00
model"anthropic/claude-3-7-sonnet-20250219:beta" 
app_id182717
streamedtrue
cancelledfalse
latency394
moderation_latency(null)
generation_time5404
tokens_prompt68
tokens_completion262
native_tokens_prompt75
native_tokens_completion334
native_tokens_reasoning0
num_media_prompt(null)
num_media_completion(null)
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.005235

Evaluation details

Result Evaluator Details Meta Data
3.6234% Matches word count n/a
neededCleanfalse
words176
1.7059% Dialogue to Total Word Ratio
Ratio: 64.20%, Deviation: 14.20%
neededCleanfalse
wordsTotal176
wordsDialogue113
2.6646%