NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SC65JAM0MXM7TMBGD90N51
Run Details
Model:
WizardLM 2 8x22b
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
8.1s
Cost:
$0.000179
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
51.75%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The moon hung low, casting eerie shadows across the manicured lawn. Detective Lila Grey stepped over the threshold, her boots echoing in the silent hall. A crystal chandelier swayed gently, the only movement in the still mansion. The victim lay sprawled, a crimson stain spreading beneath him. Glass shards glinted like diamonds around the body, reflecting the cold moonlight. Footsteps sounded behind Lila; her partner, Sergeant Dan Brooks, joined the scene. "Time of death?" she inquired, her gaze never leaving the corpse. Dan checked his watch, his face grim. "Around midnight, we suspect," he replied, kneeling beside the body. Lila's eyes narrowed, analyzing every detail, as a detective should. The room felt heavy with secrets, the air thick with the scent of death. "We have a murder to solve," Lila declared, her resolve unwavering.
id
"gen-4zygm66RMEHy2MkEN8t3BtbUnMwj"
total_cost
0.00017856
upstream_id
"cmpl-a47f3a02de974c1ab5f1e19e6ad26108"
created_at
"2024-07-14T19:40:56.390535+00:00"
7/14/2024, 7:40:56 PM
model
"microsoft/wizardlm-2-8x22b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Novita"
latency
6390
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
184
native_tokens_prompt
75
native_tokens_completion
204
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00017856
Evaluation details
Result
Evaluator
Details
Meta Data
51.75%
Matches word count
n/a
neededClean
false
sentences
12
wordCounts
0
11
1
14
2
12
3
10
4
12
5
12
6
11
7
7
8
10
9
10
10
14
11
11