NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSDFPQE4D422TYW37CMQM2
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 59s
Cost:
$0.027755
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
26.92%
Dialogue to Total Word Ratio
(details)
13.4623%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The flickering light from the old laptop screen illuminated the faces of Molly and Bill as they huddled in their cluttered hideout, surrounded by maps, blueprints, and half-eaten pizzas from last night's stakeout. This particular job had them both on edge, not because of the danger, but due to the unconventional target of a lively children's birthday party complete with clowns, games, face painting, and a massive pile of brightly wrapped presents that could fetch a pretty penny on the black market for stolen goods. Molly, the meticulous planner with her sharp wit and years of experience in petty crime, was sketching out the final details on a napkin while Bill, her burly but often bumbling partner in crime, munched loudly on a thick sandwich, sending crumbs cascading onto the dusty floor like tiny confetti. Molly looked up from her notes with a determined glint in her eye. "Bill, we hit the party at four. Sneak in as guests, grab the presents fast." Bill swallowed hard, his eyes widening in disbelief. "From kids? Molly, that's harsh. What if they chase us?" She waved off his concerns with a flourish of her hand. "We'll wear disguises. You as a magician, me as the assistant. Create a diversion with tricks then load up the van." He shifted uncomfortably in his creaky wooden chair. "Magician? I can't do magic. And those little monsters are unpredictable. Remember the last time we dealt with children?" "Stop worrying. The plan is foolproof. Parents will be busy with the cake. We enter through the side gate during musical chairs. Exit with the loot while they're singing happy birthday. Simple as pie," Molly assured him confidently, her voice steady despite the absurdity of their scheme. Bill nodded slowly, though doubt still lingered on his weathered face. "Alright, but no clown makeup. I draw the line there. Split the take fifty fifty?" "Fifty fifty after my cut for planning. Now let's go over the escape route one more time," she replied, pointing at the crudely drawn map. "And remember to act natural around the pony rides," Molly added as an afterthought. "Pony rides? This is getting too complicated for a bunch of toys," he grumbled in response. The conversation continued on as they fine-tuned their outrageous strategy, incorporating elements like using a remote controlled car borrowed from a previous job as a distraction for the youngsters and blending in seamlessly with the other distracted adults at the event. The room filled with their back and forth banter and occasional bursts of laughter, turning the serious business of thievery into what felt more like a comedic routine. Molly emphasized the importance of precise timing to avoid any mishaps with overzealous party planners, while Bill kept interjecting with his silly fears about sticky fingers and sudden tantrums that could derail everything. Despite the humorous nature of targeting an innocent children's celebration for all the presents, they treated the planning with the same level of professionalism as their previous bank jobs. The heist promised to be one for the record books, whether for the substantial haul of new toys, video games, and gift cards or for the inevitable chaos that would surely ensue when the theft was discovered amid the colorful party streamers, scattered wrapping paper, and deflated balloons. They imagined the parents scratching their heads in confusion and the children perhaps not even noticing if their presents vanished, maybe even blaming it on a magical unicorn or the birthday boy's sneaky dog. With each passing minute their plan grew increasingly elaborate and ridiculous, ensuring that this preparatory scene would set the stage for a heist destined to go hilariously wrong.
created_at
"2026-05-01T12:50:00.54Z"
5/1/2026, 12:50:00 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
113604
moderation_latency
(null)
generation_time
119181
tokens_prompt
77
tokens_completion
2878
native_tokens_prompt
184
native_tokens_completion
11037
native_tokens_completion_images
(null)
native_tokens_reasoning
10311
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0277553
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"1f1cefaf-fa46-cab6-253b-b6501472d06f"
is_byok
false
latency
99
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639800-aoCWYWy6YYyJxPDa8g52"
session_id
(null)
api_type
"completions"
id
"gen-1777639800-7AoQKAJfs7yAtha59Wxw"
upstream_id
"1f1cefaf-fa46-cab6-253b-b6501472d06f"
total_cost
0.0277553
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
609
26.92%
Dialogue to Total Word Ratio
Ratio: 24.88%, Deviation: 5.12%
neededClean
false
wordsTotal
611
wordsDialogue
152
13.4623%