NC Bench
Overview
About
Tests
Models
Model
Magnum v2 72B
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-10-01-01J94R9HDG80V3D6BEJCQ3H707
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly studied the map of the backyard, her brow furrowed in concentration. "Okay, here's the plan. We go in through the back gate at 10 PM sharp, right as the clowns are finishing their final set." Bill, idly picking at his fingernails, glanced up. "Won't the kids be awake? Seems like poor timing." "Bill, think!" Molly smacked his arm. "The parents will likely be drunk off their wine coolers by then. Easy peasy." "Good point," Bill conceded. "Now, we need a disguise. Maybe nursery rhyme characters?" Bill nodded thoughtfully. "I could be Old King Cole, and you..." "A cooly dressed woman?" Molly suggested. "Uhh, sure. We'll be in the party favor bags faster than Humpty Dumpty fell off that wall."
id
"gen-1727810291-Sy0vmZ457J0YrxDk6UTd"
total_cost
0.0010665
upstream_id
"chatcmpl-8720da04-3969-4b88-96dc-196ebf14062a"
created_at
"2024-10-01T19:18:40.631128+00:00"
Tuesday, October 1st, 2024 at 7:18:40 PM GMT+00:00
model
"anthracite-org/magnum-v2-72b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Infermatic"
latency
3445
moderation_latency
(null)
generation_time
25760
finish_reason
"stop"
tokens_prompt
67
tokens_completion
176
native_tokens_prompt
72
native_tokens_completion
177
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0010665
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
120
0.0000%
Dialogue to Total Word Ratio
Ratio: 74.17%, Deviation: 64.17%
neededClean
false
wordsTotal
120
wordsDialogue
89
0.0000%