NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYZ0Z70440ZJQ5877ZE73Q9
Run Details
Model:
Claude Haiku 4.5
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
4.3s
Cost:
$0.001645
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
# The Birthday Caper Molly tapped her pen against the clipboard, studying the intel. "Okay, so the target is 847 Maple Street. Twenty-three kids. Estimated forty to sixty wrapped packages." "That's a lot of loot," Bill whispered, adjusting his fake mustache nervously. "What about security?" "The parents will be distracted by cake. Two golden retrievers, but I've got bacon." She pulled out several strips. "We slip in through the side door at 2 PM sharp, during present-opening pandemonium." Bill frowned. "What if someone recognizes us?" "We'll wear those ridiculous clown wigs. Nobody looks twice at party entertainers." Molly circled the living room on the floor plan. "You take the south wall, I'll handle the north. We meet at the kitchen, load the van, and we're gone in seven minutes." "Seven minutes?" Bill's eye twitched. "Eight, if there's a piƱata," Molly conceded. "But Bill, we can't get sentimental. These toys could be worth thousands on the black market." Bill pulled out the getaway route map. "What about the dog walker on Oak Street?" "She's at the dog park until 3 PM. We're golden." He extended his hand. "Let's do this thing."
created_at
"2026-02-08T15:48:21.460299+00:00"
2/8/2026, 3:48:21 PM
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
352
moderation_latency
(null)
generation_time
4066
tokens_prompt
61
tokens_completion
270
native_tokens_prompt
75
native_tokens_completion
314
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001645
router
(null)
provider_responses
0
id
"msg_01QfaP1UvRkmTDtZgvdehek5"
status
200
is_byok
false
latency
352
endpoint_id
"41d2915a-92e6-4993-b537-210b4e10cba8"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770565696-T4wMLZYsqSjlyDsjwfri"
upstream_id
"msg_01QfaP1UvRkmTDtZgvdehek5"
total_cost
0.001645
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
190
0.00%
Dialogue to Total Word Ratio
Ratio: 72.40%, Deviation: 22.40%
neededClean
false
wordsTotal
192
wordsDialogue
139
45.2419%