NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYX8109YSQ3V6868B5A6C4S
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
11.4s
Cost:
$0.014815
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
82.08%
Dialogue to Total Word Ratio
(details)
89.8515%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hedge, binoculars pressed to her eyes, surveying the suburban battlefield that was the Henderson backyard. Balloons bobbed menacingly from the fence posts. A bounce castle loomed like an inflatable fortress, and somewhere within, a clown was juggling with the confidence of a man who'd never been arrested. Bill unrolled the blueprints—really just a crayon drawing he'd swiped from the mailbox—and spread them across the grass with the gravity of a general planning D-Day. "The presents are inside," he whispered. Molly nodded, scribbling notes. The gift table, according to her calculations, held approximately forty wrapped boxes, each containing untold treasures: action figures, dolls, possibly cash tucked into greeting cards by lazy aunts. The motherlode. Their plan was elegant in its stupidity. Bill would create a diversion involving a piñata and strategic chaos. Molly, disguised as a face-painting entertainer, would infiltrate the perimeter, load the loot into an oversized duffel bag, and vanish before the candles were even lit. She'd practiced her balloon animals for weeks. They all resembled tumors, but children weren't art critics. Bill checked his watch, sweat beading despite the autumn chill. "It's go time," he breathed. They rose in unison, two grown adults declaring war on a seven-year-old's birthday.
created_at
"2026-05-31T11:38:27.444Z"
5/31/2026, 11:38:27 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2873
moderation_latency
(null)
generation_time
11065
tokens_prompt
77
tokens_completion
421
native_tokens_prompt
93
native_tokens_completion
574
native_tokens_completion_images
(null)
native_tokens_reasoning
66
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.014815
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_019HJpCFmLwBVgCC4Kw3NwjH"
is_byok
false
latency
1181
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227507-rS432ypI1g6LmdDHkMqv"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227507-bDnrlixQhDN0n26o2G4J"
upstream_id
"msg_019HJpCFmLwBVgCC4Kw3NwjH"
total_cost
0.014815
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
207
82.08%
Dialogue to Total Word Ratio
Ratio: 3.33%, Deviation: 6.67%
neededClean
false
wordsTotal
210
wordsDialogue
7
89.8515%