NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Mini
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-14-01JRTSZSGHSVY2PNX1H6Q543NT
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Alright, Molly, you sure about this? Breaking into a kids’ party? Might as well be stealing candy from a sleeping bear.” “Bill, please. It’s not just any kids’ party. It’s Jenny’s eighth birthday. You know what that means? Mountains of presents. Cash value? Off the charts.” “Yeah, but kids screaming, parents hovering, clowns lurking... I’m not exactly feeling the glamour heist vibe.” “Relax. We’re not stopping the party. We’re blending in. I brought balloons.” “You brought balloons? Molly, you’re a criminal mastermind, not a party clown.” “Exactly! Nobody suspects the one with extra helium. I walk in, I mingle, I nab the loot. You’re on lookout. Easy.” “Lookout at a kids’ party? What, guard the punch bowl?” “Guard the punch bowl, guard the cake, whatever. Just don’t drop the bags or make a scene.” “Alright, but if a kid asks why I’m eyeing their gift like it’s a candy stash, I’m blaming you.” “Deal. Just remember, Bill, fortune favors the bold... and those who aren’t allergic to glitter.” “Great, because there’s definitely going to be glitter.” “Now grab a hat. We’re crashing a party, and I intend to leave with more than crumbs.”
id
"gen-1744656394-qxBD9Z8uk5JtqfCy31Mw"
upstream_id
"chatcmpl-BMJ7CgKJdTyqxtKkusY2mHed4PiGN"
total_cost
0.000478
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:46:37.11951+00:00"
Monday, April 14th, 2025 at 6:46:37 PM GMT+00:00
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
455
moderation_latency
130
generation_time
1471
tokens_prompt
68
tokens_completion
281
native_tokens_prompt
67
native_tokens_completion
282
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000478
Evaluation details
Result
Evaluator
Details
Meta Data
97.6276%
Matches word count
n/a
neededClean
false
words
193
36.7879%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
193
wordsDialogue
193
67.2078%