NC Bench
Overview
About
Tests
Models
Model
Lumimaid v0.2 8B
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-09-20-01J881MZ34K8CFCPRX88W9P7JX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright, let me see," Bill said, slowly pacing around the dusty storage room, his eyesnoĆci "Damn it, where did I put that stupid crowbar?" "Huh?" Asked Molly, placing down the backpack she was rummaging through, turning to look at Bill. "Where was I just looking? You? If you find it, you'll see!" "Oh no, I don't think it would be a good idea to just let anyone find it. " Bill chuckled, "Ah, quit being a pain. Come on, let's support you for a second while you test your height against it." Molly rubbed her hands together as she helped lift up the large ladder up, fiddling with the crowbar. She said "This, huh?" "Yes. This is it! Now empty that backpack!" "Huh?" "We need the space for the presents we're gonna steal. Take out all of your nuts, cookies and seeds! I want a lot of space for the presents! It's almost a children's party." "Ah, of course!" Molly cried, reaching into her bag and pulling out some nuts, tossing them one by one into her hand and then swallowing them. "Our plan depends on your hunger! This is it!" "Oops, we've got a small problem! Oh no!" "What is it?" Bill asked. "This totally new thing, oh yeah! But yeah, uh, w-where was I?" Molly sputtered. "I don't know, but the exact same here!" "I'm trying to find the crowbar again!" "Huh? How does that work again? First you can't find it and now you lost it again, eh?" "I'm just looking for it, don't make such jokes, uh?" "Hey"nameless, put the crowbar down because look up!" Molly turned around, looking up to see the large light above them. She shook her head. "Well, sorry for that. How goes along?" "Same here. I'm just tired. So should we pack up and be more efficient afterwards, if possible, to make sure we have what we need for the heist?" "Oh, absolutely," Molly agreed "Let's see if we could find a bag that won't get caught." "Easy it, there's that nylon bag over here!" "Grab it. I'm almost ready!" Molly turned around to make sure she had the last few items left in her backpack and saw Bill holding the nylonsack. "Oh shoot!" "Whoops, put the bag down!" "WHY?" asked Bill. "Sometimes we keep things you don't want in its hands. Uh-huh. Come here. I'll help you." Bill automatically put the bag down as Molly started going through the items that were in it. "Let's save time and see if there's important things we need!" "Ah, I've got some trackers...some explosives, a compass. Fancy. Some food. Oh, oh there's that crowbar." "I do love the detail," Bill said. "Yeah, shut up!" "Oh, rest assured that all my jokes have a purpose. Now the question is: do we use the rulers to navigate ourselves through the maze? Or is there a more creative way?" "You'll answer that later, it's night. Check in during the day, wish you were a part of the heist! Bye!"
id
"gen-ocnS2Glxgj0dZjbtDbuU7f755rrJ"
total_cost
0.000795
upstream_id
"5fdbf8e0CSnn9eHT"
created_at
"2024-09-20T15:43:57.706563+00:00"
Friday, September 20th, 2024 at 3:43:57 PM GMT+00:00
model
"neversleep/llama-3.1-lumimaid-8b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Mancer"
latency
328
moderation_latency
(null)
generation_time
12460
finish_reason
"stop"
tokens_prompt
102
tokens_completion
695
native_tokens_prompt
72
native_tokens_completion
696
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000795
Evaluation details
Result
Evaluator
Details
Meta Data
99.9190%
Matches word count
n/a
neededClean
false
words
503
0.0003%
Dialogue to Total Word Ratio
Ratio: 54.08%, Deviation: 15.92%
neededClean
false
wordsTotal
503
wordsDialogue
272
49.9597%