NC Bench
Overview
About
Tests
Models
Model
Qwen 2.5 72B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-09-20-01J881DAXF66BMT1B30MK894GJ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Are you sure this is the right house, Bill?" Molly whispered, squinting at the festive bunting. "Positive," Bill replied, adjusting his ski mask. "I scoped it out last night. The presents are under a giant pirate ship bouncy castle." "Seriously? A bouncy castle? How are we supposed to stealthily grab presents from a bouncy castle?" "We’ll have to bounce, of course. It’s the only way to reach them without looking suspicious." "Stealth through bouncing. Got it. What’s the plan once we get in?" "First, we distract the kids. I’ve got a bag of rubber chickens. They never fail." "Rubber chickens? That’s your big distraction?" "Trust me, they’ll be too busy laughing to notice us filling our backpacks." "Fine. But what if the parents catch us?" "We tell them it’s for a very important study on childhood joy and gift distribution. It’s foolproof." "Bill, you’re a genius. Or a lunatic." "Sometimes the line is blurry, Mol. Now, let’s bounce."
id
"gen-TZlBeKEnwSk4fTDtFFyfoyt0chQG"
total_cost
0.0001252
upstream_id
"chat-47ff090a292b447996a1859ad4c8e954"
created_at
"2024-09-20T15:39:40.236215+00:00"
Friday, September 20th, 2024 at 3:39:40 PM GMT+00:00
model
"qwen/qwen-2.5-72b-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Hyperbolic"
latency
319
moderation_latency
(null)
generation_time
5226
finish_reason
"stop"
tokens_prompt
78
tokens_completion
228
native_tokens_prompt
83
native_tokens_completion
230
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001252
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
158
99.9014%
Dialogue to Total Word Ratio
Ratio: 91.77%, Deviation: 1.77%
neededClean
false
wordsTotal
158
wordsDialogue
145
49.9507%