NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHZ1QF8J879QRZHFWET0ZBE
Run Details
Model:
Grok 4.20 (Beta, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
47.7s
Cost:
$0.069654
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.08%
Dialogue to Total Word Ratio
(details)
0.0403%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The two experienced thieves had come up with many plans in their career but this one took the cake for being the most absurd and potentially hilarious. They had chosen a warm Saturday afternoon when the party was scheduled to start, making sure to arrive early enough to observe but not so early as to arouse suspicion from local residents. Molly and Bill were parked in a shady spot across from the large suburban home where the children's party was taking place. The yard was filled with inflatable games, tables laden with snacks, and of course the impressive pile of presents stacked high next to the birthday cake stand. Children in party clothes ran around chasing each other and playing with bubbles being blown by an adult. The sun was shining brightly, making the whole scene look like something from a magazine rather than a target for two seasoned thieves. Molly had spent days planning this, researching the family, the schedule of the party, and even the types of presents likely to be there based on the invitation list she had managed to obtain through dubious means. Bill was less enthusiastic but always went along with her schemes because they usually worked out in funny ways that avoided serious trouble. They had their bag of tricks ready, including fake IDs for the entertainment company, colorful wigs, and a variety of silly props that would allow them to mingle without raising too much suspicion. The plan involved entering during a specific game, using the distraction to access the gifts, and then making their escape while the party was still in high gear. It was a bold move but they believed the comedic element would play in their favor as no one would expect such a brazen act at such an innocent event. The flowers in the garden were in full bloom adding to the cheerful vibe. Birds sang in the trees seemingly laughing at the ridiculous plan. "Molly, this is the worst idea you've ever had. Stealing from a children's party? We'll be the villains of the neighborhood." Bill's voice was filled with skepticism as he stared at the happy scene. Molly didn't seem phased as she applied the last touches to her disguise in the mirror. The tension was mounting but they both knew there was no turning back now. "It's not that bad. Kids get too many presents anyway. They'll never miss a few. Besides, it's all in good fun." She smiled confidently while adjusting her bright red nose. Bill paced in the limited space of the van, his mind racing with possible things that could go wrong during the operation. He wondered if this would be the heist that finally got them caught. "Good fun? What if they have security or one of those watchful moms? And how do we carry all those presents out without looking suspicious?" His concerns were valid but Molly had answers for everything as she always did in these situations. She pulled out a large bag filled with balloon animals and other items. The props looked ridiculous but that was the point. "We'll use our clown disguises and the big red wagon as our prop. During the magic show, we load up and say it's part of the illusion. Trust me, it will work." The words seemed to calm Bill a little though he still had reservations about targeting such a wholesome event. He looked at the house again and imagined the aftermath of their actions. "I trust you but this feels wrong. Maybe we should just rob a bank like normal people." Molly finished her preparation and turned to him with a determined expression, ready to launch into the heist. Her eyes sparkled with the thrill of the upcoming adventure. "Where's the comedy in that? This is our chance to pull off the ultimate prank heist. Now stop worrying and let's finalize the details." They continued to discuss the finer points as they prepared to exit the van and begin their infiltration, the air filled with the sounds of the ongoing party that they were about to turn upside down in their unique way. The neighbors were none the wiser and the kids continued their games, oblivious to the comedic criminals in their midst planning to make off with their treasures. With everything set, they stepped out and headed toward the party with smiles on their faces and mischief in their hearts. The entire neighborhood seemed peaceful but soon it would be the scene of one of the most unusual heists ever conceived by the pair of comical crooks who specialized in such lighthearted but illegal escapades.
created_at
"2026-03-12T21:25:39.438Z"
3/12/2026, 9:25:39 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
45184
moderation_latency
(null)
generation_time
47507
tokens_prompt
77
tokens_completion
1128
native_tokens_prompt
184
native_tokens_completion
11586
native_tokens_completion_images
0
native_tokens_reasoning
10727
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0696536
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"b2eceb58-6edb-3ec8-b140-289bb5b272b8"
is_byok
false
latency
283
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773350739-ixyo8EQQmtVZOZMUvywt"
upstream_id
"b2eceb58-6edb-3ec8-b140-289bb5b272b8"
total_cost
0.0696536
cache_discount
0.0002304
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
775
0.08%
Dialogue to Total Word Ratio
Ratio: 18.06%, Deviation: 11.94%
neededClean
false
wordsTotal
775
wordsDialogue
140
0.0403%