NC Bench
Overview
About
Tests
Models
Model
GPT-4.1
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2025-04-14-01JRTSQQV6XMCZ8EYSPX90GZPB
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“You want me to do what?” Bill hissed, tugging his cap lower. “Party infiltration, Bill. We blend in. We nab the gifts.” “These are kids. They’ve got the sniffles. They eat paste.” “Exactly! No one suspects thieves at a pony party. We’ll be legends.” “I don’t do glitter. It gets in my beard. It stays for months.” “You’ll survive. It’s for the loot. Did you see that pile? Giant gift mountain.” “We’ll stick out! I can’t even fit into a bouncy castle.” “Think positive. You’re tall. You could say you’re the magician’s assistant.” “I’m not wearing a cape.” “Bill, come on. What if we’re already in disguise? Party hats. Maybe a balloon animal or two.” “A balloon dog won’t stop an angry mom.” “No one suspects a guy handing out cupcakes.” “I’m not baking, either.” “Fine. Then you get the piñata. Bet it’s loaded.” “Last time I hit a piñata, candy hit me in the face. Hard.” “Should’ve ducked. Anyway, you’ll handle distractions. I grab the bags.” “So, you get the loot, I get trampled by toddlers.” “Teamwork. And we split the haul.” “Fine, Molly. But one sprinkle lands on me, I’m out. And I’m not singing happy birthday.”
id
"gen-1744656130-kFPS4pAXTteWNliWg44x"
upstream_id
"chatcmpl-BMJ2wwJyclSIzuDOH022OJ6ecV0ku"
total_cost
0.002486
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:42:15.044612+00:00"
Monday, April 14th, 2025 at 6:42:15 PM GMT+00:00
model
"openai/gpt-4.1-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
752
moderation_latency
103
generation_time
3187
tokens_prompt
68
tokens_completion
293
native_tokens_prompt
67
native_tokens_completion
294
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.002486
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
199
0.0000%
Dialogue to Total Word Ratio
Ratio: 96.98%, Deviation: 46.98%
neededClean
false
wordsTotal
199
wordsDialogue
193
49.9995%