NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHYYWTRG81JTDPMDX8KYX7H
Run Details
Model:
Grok 4.20 (Beta, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
46.6s
Cost:
$0.058662
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
1.68%
Dialogue to Total Word Ratio
(details)
0.8398%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat in a booth at the back of the local coffee shop, poring over their notes for the upcoming job. The plan was unconventional even for them, but Molly was convinced it would work. Bill was not so sure. He fidgeted with his coffee cup, spilling a little on the table as he shifted in his seat. "Molly I can't believe we're doing this. A kids party? To steal presents?" Bill asked with disbelief in his voice. She leaned forward, her eyes sparkling with the thrill of the scheme, ignoring his concerns as she always did when she had a crazy idea. "It's the ultimate heist Bill. All the gifts are in one place and the security is just a few parents. Easy pickings," Molly replied confidently. Molly rolled her eyes and took a sip of her drink before launching into her prepared speech about the merits of their plan. "Feel bad? These kids have more presents than they need. We're doing them a favor by teaching them not to take things for granted. Now for the plan you will dress as a clown and I will be the magician," Molly said. Bill's face turned pale at the thought of the costume. He had always hated clowns since he was a child and now he had to become one. "A clown? That is humiliating. I don't want to wear the makeup and the big shoes," Bill protested. She ignored his protest and continued to describe the scene in vivid detail, gesturing with her hands to illustrate the movements they would make. "It's either that or we abort the whole thing. The clown will provide the laughs while I use my magic tricks to distract and then we grab the loot during the singing of the birthday song," Molly explained. He nodded slowly, starting to see the logic despite himself. The coffee was cold now but he drank it anyway to have something to do. "What if the kids see us taking the presents?" Bill questioned. Molly smiled and pulled out a small notebook from her bag, showing him some sketches she had made of the room layout. "We will tell them it's a special magic trick where the gifts disappear and reappear later. But they won't notice because they will be too busy with the games and food. The timing is critical," Molly answered. "How do we escape after?" Bill wondered. She pointed to the map on the table, tracing the route with her finger. "The van will be waiting in the alley. We load the bags quickly and drive away playing the ice cream truck music to blend in. It's genius," Molly said with a laugh. Bill finally smiled a little, caught up in her enthusiasm despite his initial reservations. "This could go wrong in so many ways but I guess I'm in. Let's go over the details again," Bill agreed reluctantly. The pair spent the rest of the afternoon refining their comedic heist strategy, laughing at the absurdity of it all and imagining the looks on the children's faces when they discovered their missing gifts. They knew it was risky but the potential payoff in both money and laughs made it irresistible for the two seasoned thieves.
created_at
"2026-03-12T21:24:06.622Z"
3/12/2026, 9:24:06 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
44775
moderation_latency
(null)
generation_time
46446
tokens_prompt
77
tokens_completion
776
native_tokens_prompt
184
native_tokens_completion
9754
native_tokens_completion_images
0
native_tokens_reasoning
9140
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0586616
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"e6e88a72-0322-602b-8deb-b99f0eeb9131"
is_byok
false
latency
272
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773350646-6DJqkkoKLNCyQgdqkTqS"
upstream_id
"e6e88a72-0322-602b-8deb-b99f0eeb9131"
total_cost
0.0586616
cache_discount
0.0002304
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
542
1.68%
Dialogue to Total Word Ratio
Ratio: 40.96%, Deviation: 9.04%
neededClean
false
wordsTotal
542
wordsDialogue
222
0.8398%