NC Bench
Overview
About
Tests
Models
Model
EVA Qwen 2.5 14B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-10-01-01J94RCM07Z2FZV9262Z2CYW24
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced back and forth, a maniacal grin on her face. Bill leaned against the wall, his arms crossed, a slight smirk playing on his lips. Molly: "Just imagine it, Bill. Little tykes playing, unaware of the party pooper in their midst." She twirled an imaginary lollipop. Bill: "And when the rain of presents pelts down from the ceiling? This will be our greatest crime of all!" Molly: "Au contraire, mon ami. The real joy comes from seeing their faces when we sweep in to... Bill: "*Right.*" He rolled his eyes. "Calming thoughts, Molly." Molly: "The mothers, innocent birds with their nests. We'll show them what humanity's really like." Bill: "You're a mean one, Molly Wun." Molly: "Ah, yes. The expressions of shock, deceit, and depression on those little faces. Oh, the villainies we could scheme!" Bill: "Uhh, *right*... but how do you propose we get in without... you know..." Molly: "Hall pass from a rebellious teacher, duh. Now, Bill, it's time to discuss our exit strategy. Because we're so going to—" Bill: "All right, calm down. Remember, we're not villains here. We're simply... misdirecting a situation." Molly: "A terrible plight, darling. Tell that to the parents forced to come up with new presents. *Argh!*" She collapsed onto the sofa, kicking her feet in frustration.
id
"gen-1727810392-pAGx1Kuj8OwR6RQ09OcZ"
total_cost
0.0001825
upstream_id
"chatcmpl-bb1b24c7-1467-49c9-a463-58e1bc55886f"
created_at
"2024-10-01T19:20:10.576402+00:00"
Tuesday, October 1st, 2024 at 7:20:10 PM GMT+00:00
model
"eva-unit-01/eva-qwen-2.5-14b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Infermatic"
latency
192
moderation_latency
(null)
generation_time
17836
finish_reason
"stop"
tokens_prompt
67
tokens_completion
329
native_tokens_prompt
72
native_tokens_completion
329
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001825
Evaluation details
Result
Evaluator
Details
Meta Data
60.2752%
Matches word count
n/a
neededClean
false
words
215
0.0000%
Dialogue to Total Word Ratio
Ratio: 41.86%, Deviation: 48.14%
neededClean
false
wordsTotal
215
wordsDialogue
90
30.1376%