NC Bench
Overview
About
Tests
Models
Model
Inflection 3 (PI)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-10-13-01JA3119AD53VK7BM6GW45YMRH
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the dimly lit motel room, brainstorming, while Bill lounged on a worn-out armchair. "We’ll strike at dusk, when they’re all hopped up on sugar," Molly declared. Bill raised an eyebrow. "You think two grown adults won’t look out of place at a kids’ party?" Molly smirked. "Ah, but we’ll blend in—I’ve got us costumes. You’ll be a giant purple dinosaur." Bill groaned. "Perfect. Just what I’ve always wanted." "I’ll be a fairy princess." "You’ve really given this some thought, haven’t you?" "Trust me, Bill, it’s going to be a piece of cake. We’ll sneak in, snatch the presents, and disappear like magic," Molly concluded with a mischievous grin. Bill shook his head in disbelief. "Well, let’s just hope this plan doesn’t end up being a comedy of errors."
id
"gen-1728826201-sOBH1BKIwdq45IqX66WW"
total_cost
0.002135
cache_discount
(null)
created_at
"2024-10-13T13:30:10.419709+00:00"
Sunday, October 13th, 2024 at 1:30:10 PM GMT+00:00
model
"inflection/inflection-3-pi"
app_id
182717
streamed
true
cancelled
false
provider_name
"Inflection"
latency
1847
moderation_latency
(null)
generation_time
6846
finish_reason
(null)
tokens_prompt
68
tokens_completion
193
native_tokens_prompt
74
native_tokens_completion
195
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.002135
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
132
0.0000%
Dialogue to Total Word Ratio
Ratio: 71.21%, Deviation: 21.21%
neededClean
false
wordsTotal
132
wordsDialogue
94
0.0000%