NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Mini
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-14-01JRTSTB6KQZYK14Y88RVJFY3H
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Alright, Bill, you sure about this? Sneaking into a kiddie party to steal gifts—feels a bit like stealing candy from, well, a bunch of kids.” “Exactly why it’s genius, Molly! No security, just sugar-fueled chaos and distracted adults. We swoop in, grab the goods, and bounce before anyone realizes the cake’s missing.” “Mmmm, cake. Wait, we’re not here for cake. We’re here for presents. Toys. Batteries included.” “Right. Batteries. Now, which room’s the motherlode? The one with the most screaming children or the one with the suspiciously quiet Aunt Edna?” “Quiet Aunt Edna is probably wrapping her own present in a shadowy corner. Too risky. The playroom’s a frenzy of toddlers playing musical chairs—and maybe some cash-value Xbox stuff.” “Kids have terrible attention spans. We get in with the clown, distract them with balloon animals, then—poof! presents gone.” “Do you know how to make balloon animals?” “Bill, the last thing I made looked like a deflated dachshund on a bad day.” “Perfect. Scare the kids into submission.” “Molly, you’re the brains, and also, apparently, the clown. I fetch the getaway car?” “I’ll bring the sugar-free gum. Won’t let those kids see us coming until it’s too late." “We pull this off, and we’re legends. Or grounded for life.” “Legends who never get invited to birthday parties again.” “Deal. Let’s do this heist.”
id
"gen-1744656215-SJHbeu9gqfr8EgaDjEPb"
upstream_id
"chatcmpl-BMJ4JQ71ZteDM4H3X70HOlbNHC2od"
total_cost
0.00055
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:43:39.267356+00:00"
Monday, April 14th, 2025 at 6:43:39 PM GMT+00:00
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
476
moderation_latency
88
generation_time
1942
tokens_prompt
68
tokens_completion
326
native_tokens_prompt
67
native_tokens_completion
327
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00055
Evaluation details
Result
Evaluator
Details
Meta Data
0.2141%
Matches word count
n/a
neededClean
false
words
228
36.7879%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
228
wordsDialogue
228
18.5010%