NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHS40V9ZEWXJWTRBMA9EX4P
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 30s
Cost:
$0.032815
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
17.56%
Dialogue to Total Word Ratio
(details)
8.7801%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled in their dimly lit apartment, blueprints and sketches covering the old wooden table. The air smelled of coffee and anticipation as they finalized their comedic heist plan. The pair had spent days observing the party preparations from afar, noting the times when the children would be most distracted. Their van was loaded with props and bags for the loot, parked discreetly nearby. This job was supposed to be their big break from small time cons. They had come up with the idea after seeing the advertisement for the lavish birthday bash in the local paper. It seemed like the perfect target for their particular set of skills in thievery and performance arts. Molly traced paths with her finger while Bill fidgeted nervously. "Bill, we go in during the magic show slot. The kids will be mesmerized," Molly said. Bill shook his head, rubbing his temples as the clock ticked loudly. "But Molly, breaking into a children's party? It feels wrong. Those presents are for little Timmy's birthday," he replied. She waved dismissively and began pacing the room, her footsteps echoing her excitement. "Wrong? It's an opportunity. All those shiny boxes stacked high. We can be in and out before the ice cream is served. You handle the entertainment part," Molly continued. Bill pictured sticky fingers and screams, frowning as he crossed his arms. "What if the kids want face painting? I can't paint for crap," Bill interjected. Molly laughed, pulling ridiculous costumes from a bag. The outfits sparkled garishly. "You'll be fine. Wear the clown suit. I'll be the strict assistant keeping order. We'll say the real entertainers sent us as backups. During the chaos of opening presents, we load them into the goody bags we brought. It's brilliant," she explained. Bill fingered the fabric doubtfully, holding it up to his frame. "Brilliant? Last brilliant idea had us running from guard dogs. These are children's parties with watchful eyes everywhere. One wrong move and we're done," Bill said. Molly mapped the escape route with markers on paper, ignoring him. "Done? No way. The van is parked behind the bushes. We use the bouncy castle to block the view. Grab the big ones first, the ones with bows. Leave the homemade stuff. Think of the money, Bill. No more petty thefts," Molly stated. He examined the plan, greed slowly overtaking his reluctance. "Okay, but we don't take the cake. That's sacred. And if a kid cries, we give back one present. Those are my terms," Bill negotiated. Molly slapped his shoulder, smiling widely with shared mischief. "Agreed. No cake theft and one present return policy for criers. Now let's rehearse our act. You trip over your feet for laughs while I direct the crowd," Molly suggested. "Sounds like a plan, Molly. Let's make those kids remember this birthday for all the wrong reasons," Bill said with a smirk. They practiced their routine in the cluttered space, mixing laughter and bickering while perfecting silly tricks and signals. As the clock approached the hour, nervousness mixed with excitement in the air. They knew the risks but the lure of all those unopened gifts was too strong to resist. With their plan set, they donned their disguises and headed out into the afternoon sun, ready to crash the celebration in style.
created_at
"2026-05-01T12:44:50.415Z"
5/1/2026, 12:44:50 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
87448
moderation_latency
(null)
generation_time
88927
tokens_prompt
77
tokens_completion
3985
native_tokens_prompt
184
native_tokens_completion
13061
native_tokens_completion_images
(null)
native_tokens_reasoning
12381
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0328153
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"7b8baf14-9cd9-931d-b72c-d04186eed161"
is_byok
false
latency
80
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639490-vNfS1zcVOua5C6PMLl92"
session_id
(null)
api_type
"completions"
id
"gen-1777639490-TnquBruwRmORhNBvHFCs"
upstream_id
"7b8baf14-9cd9-931d-b72c-d04186eed161"
total_cost
0.0328153
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
551
17.56%
Dialogue to Total Word Ratio
Ratio: 44.10%, Deviation: 5.90%
neededClean
false
wordsTotal
551
wordsDialogue
243
8.7801%