NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SC65HR3A59BJDRBYZJXJB1
Run Details
Model:
Claude 3.5 Sonnet
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
4.1s
Cost:
$0.002682
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
90.42%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The old mansion loomed ominously against the stormy night sky. Detective Sarah Mills pulled up, her headlights cutting the darkness. She stepped out, raindrops pelting her trench coat and fedora. A scream pierced the air, sending chills down her spine. Sarah rushed towards the house, hand instinctively reaching for her gun. The front door creaked open, revealing a dimly lit foyer. A trail of blood led upstairs, glistening in the moonlight. She ascended cautiously, her footsteps muffled by the plush carpet. At the top, a body lay motionless in a pool of crimson. Sarah knelt down, examining the corpse with a trained eye.
id
"gen-dYXYOKvP5lLdhxkM2ugVBmDXrLlH"
total_cost
0.002682
upstream_id
"msg_01FQ3DCEvYUrjum3RHHwTmv6"
created_at
"2024-07-14T19:40:52.213776+00:00"
7/14/2024, 7:40:52 PM
model
"anthropic/claude-3.5-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
2976
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
42
tokens_completion
140
native_tokens_prompt
44
native_tokens_completion
170
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.002682
Evaluation details
Result
Evaluator
Details
Meta Data
90.42%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
10
1
10
2
10
3
10
4
11
5
10
6
10
7
10
8
12
9
10