Model

Llama 3.2 11B (Vision)

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-09-26-01J8PZ1MNYP47GJ27X8FWT01YE

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly hunched over the moldy pizza in the dingy slice joint, her eyes fixed on the violet-haired Barbie doll slumped on the face of the party venue's security schedule. "Okay, Bill, here's the plan – we get in through the service door at 3:14. I take point, grab the loot, and you cover my six." Bill poked at the pizza crust with a crusty breadstick, his Adam's apple bobbing as he swallowed. "Unless you can make me believe the ' six' thing doesn't sound like a superhero pose, I'm sticking with 'you get the presents, I watch the door'." Molly blinked for a beat, then shoved her hands up into the hood of her sweatshirt. "Fine, superhero pose or not, I do the Presents Gig, and we both get the opals and fully-loaded Nerf blasters. Just don't mess this up, 'Guardian of the Entryway'." Bill raised a vinyl finger at her, his forehead creasing. "Leafy greens tell the story, Molly – leave out the destinational disaster pixie evil rituals."
id"gen-eJKz8G7LOM95GnIz1tX6SuYB844i" 
total_cost0.000049734
upstream_id"chat-25aa2340efe2417cb25fe45225d5c1d9" 
created_at"2024-09-26T10:46:52.584327+00:00" Thursday, September 26th, 2024 at 10:46:52 AM GMT+00:00
model"meta-llama/llama-3.2-11b-vision-instruct" 
app_id182717
streamedtrue
cancelledfalse
provider_name"Together" 
latency377
moderation_latency(null)
generation_time1550
finish_reason"stop" 
tokens_prompt68
tokens_completion235
native_tokens_prompt71
native_tokens_completion236
native_tokens_reasoning(null)
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.000049734

Evaluation details

Result Evaluator Details Meta Data
0.0028% Matches word count n/a
neededCleanfalse
words168
83.2106% Dialogue to Total Word Ratio
Ratio: 56.55%, Deviation: 6.55%
neededCleanfalse
wordsTotal168
wordsDialogue95
41.6067%