NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSRCAMD6XZFM4XQ6TB5KMW
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 24s
Cost:
$0.018005
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.35%
Dialogue to Total Word Ratio
(details)
6.6761%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat in their cluttered van, surrounded by an array of ridiculous props including oversized shoes, colorful wigs, and a collection of whoopee cushions that threatened to go off at any moment. The vehicle was parked a safe distance from the vibrant party venue where colorful streamers adorned the entrance and joyous music played loudly from within. Molly, with her notebook open on her lap, was busy jotting down last minute ideas for distractions while Bill stared out the window at the arriving guests carrying wrapped boxes of various sizes, a look of sheer panic etched across his face. The plan to infiltrate the children's party and make off with all the presents was one that had evolved over several late night sessions filled with pizza and poor decisions into a full fledged operation complete with costumes, code names, and multiple backup plans. They knew the risks were high with parents and excited children everywhere but the potential payoff in pilfered video games, bicycles, and designer clothing made it irresistible for the pair of down on their luck thieves who specialized in the unusual. "Okay, let's go over this one more time," Molly said. "We arrive as the replacement entertainers after we call in a fake complaint about the original clown." "And I have to wear the wig?" Bill asked. "Yes, the purple one with bells," Molly replied. "It completes the look." "I feel silly already," Bill said with a groan. "That's the point. While the kids are laughing at your terrible jokes, I sneak the presents into the sacks disguised as party favors," Molly explained. Bill scratched his head. "How do we explain the van full of toys if someone asks?" "It's for the magic act. We pull them from hats or something," Molly answered. "Sounds fishy to me," Bill said. "Have faith in the plan. It's foolproof," Molly responded confidently. "Foolproof or fool hardy?" Bill questioned with raised eyebrows. "Both. Now stop worrying," Molly said. "What if we get caught by a kid?" Bill asked. "Offer them candy and run," Molly said. "This is the worst idea ever," Bill muttered under his breath. "But it'll be hilarious," Molly shot back with a wink. "Promise me we'll be quick," Bill pleaded. "In and out like pros," Molly assured him. Their conversation stretched on as they covered every possible scenario from a sudden rainstorm ruining their cardboard getaway props to an overeager child wanting to join their act. The van was filled with the sound of their voices mixing with the rustle of map papers and the occasional accidental honk from a clown horn. Molly described in vivid detail how Bill would perform a series of bumbling tricks that would keep the audience captivated while she worked her way to the gift table under the guise of assisting with decorations. She had even prepared a fake resume for their entertainment business that boasted performances at several non existent royal events. As the minutes ticked by, the two hardened criminals who usually dealt in more adult oriented capers found themselves laughing at the absurdity of donning face paint and giant bow ties to pull off the ultimate party prank heist. The setting sun painted the sky in hues of pink and gold, providing an ironically beautiful backdrop for their scheming. With the plan as polished as it was going to get, Molly and Bill shared a final high five before donning their disguises and stepping out into the unknown world of screaming kids and tempting treasures. This heist would either make them legends in the comedic crime world or have them fleeing from an army of angry parents armed with party hats.
created_at
"2026-05-01T12:55:57.529Z"
5/1/2026, 12:55:57 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
78561
moderation_latency
(null)
generation_time
83630
tokens_prompt
77
tokens_completion
1774
native_tokens_prompt
184
native_tokens_completion
7137
native_tokens_completion_images
(null)
native_tokens_reasoning
6404
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0180053
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"9ee2efa7-0e9b-95ff-9ffa-2d667f6d3cb1"
is_byok
false
latency
78
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777640157-IdGHTt9olaTxCaMoUH2f"
session_id
(null)
api_type
"completions"
id
"gen-1777640157-hGDae5f8edwSVMmeFwxw"
upstream_id
"9ee2efa7-0e9b-95ff-9ffa-2d667f6d3cb1"
total_cost
0.0180053
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
613
13.35%
Dialogue to Total Word Ratio
Ratio: 23.65%, Deviation: 6.35%
neededClean
false
wordsTotal
613
wordsDialogue
145
6.6761%