NC Bench
Overview
About
Tests
Models
Model
EVA Qwen 2.5 14B
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-10-01-01J94RFV8FK4REG4C63S05P3H2
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped her fingers nervously on the polished wooden table, a small pile of maps and blueprints fanned out before her. Bill sat across, his bald head glistening in the overhead lights as he fished a cigarette from his jacket pocket. "Right then," Molly began, smoothing a crease in the creased map with twitching fingers. "So we're infiltrating as clown and bunny, got it?" Bill rolled his eyes, taking a long drag from his smoke. "Why the fuck do I have to be the bunny?" He stubbed it out, flicking burnt ash onto the floor. "I look like a bloody Easter striped sausage already." Molly ignored him, her large glasses sliding down her nose as she studied the blueprint. "Okay, entry point - stairwell at the back. Drop altitude, about twelve feet. Might wanna lay down some bubble wrap for your fall, _Bunny_." She mock coughed, grinning. Bill kicked his feet up on the table. "Just tell me straight - I don't land right and hurt myself, I'm seriously considering castrating you." "Well, it's a party," Molly said, scowling. "You're highly likely to end up rolling around in the confetti anyway - perfect cover." His glare softened. "Look, you know I'm not used to...concealing. Especially while thinking. What am I really meant to say when the kid pulls me into a grizzly hug?" "Be yourself, Bill. You're a fucking ham. No one's gonna be expecting a dud." Bill scowled, snapping the blueprint shut. "Your plan would've been better if a real bunny was hired." Molly bristled but before she could respond, her stomach growled. "Do you think they'll have cake?" Which wasn't really what she was getting at but it was hard to focus on details with her zipped up handbag at the front of her mind. "Birthday cake. I bet it's small, punch-sized cakes three high in those massive muffin trays..." "It's not going to be all about you, Molly," Bill chided, eye thickening with a smirk. "Who says it's even for a birthday?" "Well, I suppose a christening's next on the list, but we've got so little time." She turned back to her map, motion testing her front pockets for her keys. "Bill will probably make his way up to some kid's room - most likely very keen, beaming with adoration. And I'll say you need the toilet and then proceed on my merry way to the gift room - put in the confetti and Christmas tinsel and such.' Bill shook his head, gears turning in a way that made her want to crush his skull. "Should I bring silicone balls? Saw motion-detection alarms specifically for motion activated," he said. "No, just climb it like a fucking tree. And stockings, some kiddie pee pad. I've got the rest covered." The line for 'rest' made Bill's eyes widen. "You know what's funny? I just realised we've got a drink thief on the fucking _Jacob's Ladder_." "And you know what's funnier?" Molly snapped her bag top into place. "No one's hired parents yet. That's just a fucking loophole."
id
"gen-1727810497-ql5Rhl0nuW3lTTlPaW9Q"
total_cost
0.000363
upstream_id
"chatcmpl-36deb24a-5b85-4628-a7f3-336481413925"
created_at
"2024-10-01T19:22:32.536849+00:00"
Tuesday, October 1st, 2024 at 7:22:32 PM GMT+00:00
model
"eva-unit-01/eva-qwen-2.5-14b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Infermatic"
latency
323
moderation_latency
(null)
generation_time
53939
finish_reason
"stop"
tokens_prompt
67
tokens_completion
690
native_tokens_prompt
72
native_tokens_completion
690
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000363
Evaluation details
Result
Evaluator
Details
Meta Data
99.3769%
Matches word count
n/a
neededClean
false
words
505
0.0025%
Dialogue to Total Word Ratio
Ratio: 55.45%, Deviation: 14.55%
neededClean
false
wordsTotal
505
wordsDialogue
280
49.6897%