NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSYD8V9FTFHSDH2D2CF099
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 22s
Cost:
$0.032008
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
10.62%
Dialogue to Total Word Ratio
(details)
5.3078%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
In the shadow of large oak trees that lined the property, Molly and Bill crouched low behind a row of neatly trimmed bushes, their binoculars trained on the chaotic scene before them. The children's party was a whirlwind of activity, with young ones in pointed hats chasing each other across the grass, parents sipping punch under a canopy, and in the center of it all, an enormous table piled high with wrapped boxes and bows of every color imaginable. This was no ordinary heist for the pair of experienced thieves. Usually they went after valuable art or cash from safes, but recent bad luck had led them to this desperate and comical idea of robbing a kids' birthday bash for all the presents. They had spent days observing the family routines and knew the schedule by heart. "Bill, we strike when they bring out the cake. Everyone gathers around then," Molly whispered, her eyes gleaming with anticipation. "Stealing from kids feels wrong. Can't we find another target?" Bill replied, his voice filled with hesitation and doubt. "Nonsense. The presents are worth a fortune. We'll be in and out before they sing happy birthday," she replied confidently, brushing off his moral qualms as she always did. "Fine. But how do we avoid detection? There are like twenty rugrats out there," he muttered, scanning the area with a worried frown. "Simple. You distract them with magic tricks and I'll haul the gifts to the cart disguised as a pony ride," Molly explained, her voice steady and assured as she outlined the details of their crazy scheme. "What if the pony cart idea fails? Ponies don't usually show up at these things," Bill countered worriedly, imagining all the ways their plan could go south in spectacular fashion. "But what about the parents? They might notice two new clowns," he added after a moment of thought, his anxiety showing clearly. "We act like we were hired. I'll say the regular guy got sick," Molly responded with a shrug, unconcerned by the potential flaws. "Pickle? Okay. This better work or we're the laughingstock of the thief world," he said with a nervous chuckle, recalling their earlier discussion about a code word that had seemed funny at the time. "Trust me. It'll be the easiest score yet," Molly assured him, patting his shoulder in a show of support. "This is insane. But let's do it. For the loot," he added finally, resigning himself to the plan despite his reservations. After finishing their discussion, the two thieves turned their attention to the final preparations for the job that lay ahead of them. Molly carefully checked the large empty bags they would use for transporting the stolen goods, making sure they were sturdy enough for a quick getaway. Meanwhile, Bill practiced pulling a stuffed rabbit from his sleeve with little success, fumbling the trick several times and cursing under his breath. The comedic duo then bickered lightly over who would wear the most embarrassing disguise, with Bill eventually conceding to the oversized clown nose that made him look completely ridiculous and out of place. Their plan involved sneaking through the side fence during a rousing game of musical chairs, using the natural chaos of the children's event to their advantage without drawing too much unwanted suspicion from the adults. Bill would perform a few lame jokes and simple magic tricks to draw the children's attention away from the gift table while Molly swiftly transferred the presents into large sacks that they had labeled as trash bags for better camouflage. They even had a backup plan involving releasing a bunch of balloons to create a colorful diversion if things began to go awry at any point. One wrong move and they could find themselves being chased across the lawn by a horde of angry parents and a swarm of confused, crying children demanding their toys back immediately.
created_at
"2026-05-01T12:59:15.102Z"
5/1/2026, 12:59:15 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
133580
moderation_latency
(null)
generation_time
141961
tokens_prompt
77
tokens_completion
4580
native_tokens_prompt
184
native_tokens_completion
12765
native_tokens_completion_images
(null)
native_tokens_reasoning
12014
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0320081
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"c402a38f-1e5c-8df9-e170-e4ce581316f3"
is_byok
false
latency
67
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777640355-6rIaDVKkhy4FInfQkCBR"
session_id
(null)
api_type
"completions"
id
"gen-1777640355-SCVQr33FOXyA7XcJUrwZ"
upstream_id
"c402a38f-1e5c-8df9-e170-e4ce581316f3"
total_cost
0.0320081
cache_discount
0.0001344
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
648
10.62%
Dialogue to Total Word Ratio
Ratio: 23.30%, Deviation: 6.70%
neededClean
false
wordsTotal
648
wordsDialogue
151
5.3078%