NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 11B (Vision)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-09-26-01J8PYWWCJCWDVHY6W6425NPYE
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sun beat down on the crowded children's party, the smell of sugary treats and sunscreen hanging heavy in the air. Molly and Bill, masked and dressed in colorful wigs, crouched behind a nearby inflatable bounce house, scanning the scene for vulnerabilities. "It's a kid's party," Bill whispered, "I'm sure there's somewhere to hide a stash of presents. A mom's gotta keep the little rugrats entertained, right?" "Unless they're just buying this as an excuse to drink a whole bottle of sparkling cider," Molly muttered, eyes scanning the tables. "Not exactly a holy war," Bill chuckled. "I found the storage room. Jackpot. It's got a display case, plus crates labeled 'RECESS SOCKS.'" Molly's eyes widened. "Sweet. Think we can crack this display case without some sad kid waking up and screaming?" "We'll need some creative hack work," Bill said, tucking a stray strand of his wig behind his ear. "Oh, I know. I'll just tell you the plan, and you whistle a little tune while I work my magic." "Ugh, fine. What's the plan?" "A hand-tooled defibrillator disguised as a Pin the Tail on the Donkey game piece," Bill said calmly, pulling a device from his breast pocket. "I refined it myself. I'm like a real-life phoenix from the ashes, minus the no-dying-part."
id
"gen-FaFIkgyT3N1zwlROGK3If2KbndpP"
total_cost
0.00005913
upstream_id
"chat-e0fd78bd0cff40bc9f693f143006c480"
created_at
"2024-09-26T10:44:16.531565+00:00"
Thursday, September 26th, 2024 at 10:44:16 AM GMT+00:00
model
"meta-llama/llama-3.2-11b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Together"
latency
142
moderation_latency
(null)
generation_time
1782
finish_reason
"stop"
tokens_prompt
68
tokens_completion
293
native_tokens_prompt
71
native_tokens_completion
294
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00005913
Evaluation details
Result
Evaluator
Details
Meta Data
43.3783%
Matches word count
n/a
neededClean
false
words
217
1.1793%
Dialogue to Total Word Ratio
Ratio: 64.52%, Deviation: 14.52%
neededClean
false
wordsTotal
217
wordsDialogue
140
22.2788%