NC Bench
Overview
About
Tests
Models
Model
Phi-3 Mini 128k
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35TD1TWNERDKJWHWGAF4QG4
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: "Alright, this is it, Bill. We swipe gifts from the healthiest alarm system—a kid's party." Bill: "Relax, we’ve got this. No one—babies, kids, or parents—hear us outside if we chill near the patio.” Molly: "Ha, genius. So, emergency exit plan? There’s just a roundabout and it goes uphill…” Bill: "Nice breakdown. Let's use that. Perfect car chase material. And our getaway? What?" Molly: "There’s the ’82 Skylark in the garage—cheap insurance. No rush lights, plus a baby can't spot it." Bill: "Perfect. Can you bring a dubstep playlist? I need some adrenaline." Molly: "Playlist ready. Oh! And those kidney-shaped cups? Unmarked containers will make them look like our own. Smart.” Bill: "That, plus we’ll place our jargon in napkins as apology notes. 'Sorry kids, we lost the tickets'... delayed poetic justice.” (They review their plan with a mix of dread and excitement. Molly, with a sly smile: "Last thing, carry that awkward silence, will you?" Bill: "It's all about that. Keeps you alive through the upgrades. Alright, this is insane but... absolutely hilarious.”) Molly: "To healthier alternative medicine or the thrill of the heist, here we stand.” Bill: "And laughs, that’s our best piece of stolen jewelry, right? Let's roll."
id
"gen-Zr9Bg8bFv02aath0sdZ0wE8icYYt"
total_cost
0.0000511
upstream_id
"cmpl-965db1a2ffdc43848ceec8c4f35b7eac"
created_at
"2024-07-19T15:40:11.941472+00:00"
Friday, July 19th, 2024 at 3:40:11 PM GMT+00:00
model
"microsoft/phi-3-mini-128k-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Azure"
latency
4783
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
82
tokens_completion
323
native_tokens_prompt
83
native_tokens_completion
428
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0000511
Evaluation details
Result
Evaluator
Details
Meta Data
98.7124%
Matches word count
n/a
neededClean
false
words
206
99.0727%
Dialogue to Total Word Ratio
Ratio: 86.89%, Deviation: 3.11%
neededClean
false
wordsTotal
206
wordsDialogue
179
98.8925%