NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 Euryale 70B v2.2
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-09-20-01J881E6BZ0065T6VW6NPG2X97
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright, so the party is at noon on Saturday?" Molly asked, studying the floor plan of the house. "That's right. And it's going to be a massacre of color and sugar," Bill replied, leaning back in his chair. "A massacre? It's just a kid's birthday party, Bill." "Geez, you've never been to one of those things. It's a war zone. Glitter, cake, screaming kids high on punch..." "We can handle it. Now, the challenge here will be the gifts." Molly pointed to the playroom on the plan. "That's where the mother lode is. But how do we get to it?" "I've thought about that. We come in through the backyard just before noon, dresses as caterers - you know, bow ties, silly hats, the works." "And then?" "Then we politely offer to help set up the food table. Once that's up, we excuse ourselves, and voila, we're in the house and headed to the loot." "That could work. And then what, dress up as bouncy clowns and escape with presents?" "Close... but moldy old boxing experience, we hire some clowns. I've got some guy from the short. He's got ounces and likes the peppermint schnapper." "It's schnapphire, Bill. Can he deliver?" "Yep. He's got a dozen waiting ben 'round the eyes. All we have to do is make a hollow in the sock and we're golden."
id
"gen-8ZWITSm5NAQsEV4x8ml6qJy19XhI"
total_cost
0.0005745
upstream_id
"chatcmpl-69ea4f37-111b-4413-980c-5866f28c2b13"
created_at
"2024-09-20T15:40:39.004316+00:00"
Friday, September 20th, 2024 at 3:40:39 PM GMT+00:00
model
"sao10k/l3.1-euryale-70b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Infermatic"
latency
66
moderation_latency
(null)
generation_time
36190
finish_reason
"stop"
tokens_prompt
102
tokens_completion
310
native_tokens_prompt
72
native_tokens_completion
311
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0005745
Evaluation details
Result
Evaluator
Details
Meta Data
2.0116%
Matches word count
n/a
neededClean
false
words
225
99.9980%
Dialogue to Total Word Ratio
Ratio: 89.33%, Deviation: 0.67%
neededClean
false
wordsTotal
225
wordsDialogue
201
51.0048%