NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-10-16-01JABHZ7CPBNKX4FHJZVR0Q9ZJ
Run Details
Model:
Llama 3.1 Nemotron 70B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
7.0s
Cost:
$0.000108
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
Warm light spilled from the bakery's display case, illuminating the sweet treats within. The scent of freshly baked croissants wafted through the air, enticing the sleepy town to wake up. "I heard the police found another clue at Willow Creek Manor." "Another clue, another dead end, if you ask me. That detective's been sniffing around for weeks." "Give him a break, the man's got a tough case. Especially with the victim being a... well, you know." "A recluse with more enemies than friends? Yeah, that narrows it down. Still, I think it's fishy the NYPD let him take lead. Small-town murder, small-town cops, if you ask me." "Hey, don't knock it till it works. Besides, Chef Élise seems to think he's on the right track." "Oh? What's the celebrated pastry chef know about police work?" "Just that she overheard him asking some very pointed questions about the catering staff... and the disputed cake order."
id
"gen-1729112284-7u9feyGfsmswzwQh7NM5"
upstream_id
"cmpl-RIQ0fpQRYMdD5goWxfszcqjT"
total_cost
0.0001084
cache_discount
(null)
created_at
"2024-10-16T20:58:11.86604+00:00"
10/16/2024, 8:58:11 PM
model
"nvidia/llama-3.1-nemotron-70b-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"DeepInfra"
latency
72
moderation_latency
(null)
generation_time
6822
finish_reason
"stop"
tokens_prompt
103
tokens_completion
208
native_tokens_prompt
72
native_tokens_completion
208
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001084
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)