NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSDTFCRZFR6AQCA2RYAKX4
Run Details
Model:
Grok 4.3 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5m 2s
Cost:
$0.063778
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
5.19%
Dialogue to Total Word Ratio
(details)
51.9508%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Under the cover of night, parked in an inconspicuous station wagon across the street from the festive house with colorful balloons tied to the mailbox and loud music pumping from inside, the two thieves Molly and Bill leaned over a hand drawn map on the dashboard. Molly whispered urgently about their daring plan to her partner. The plan was simple yet full of potential for comedic mishaps if anything went even slightly wrong during the infiltration of the happy home. The success depended on timing and luck in equal measure. "Bill, listen up," she began, "this kids' birthday bash is our golden ticket. All those presents wrapped in shiny paper under the tree in the living room, just waiting for us to swipe them while the little brats are busy with pin the tail on the donkey." Bill chuckled softly while adjusting his fake mustache disguise which had a tendency to slip off during intense moments of the heist, and replied, "Yeah, but how do we get past the moms with their eagle eyes? I mean, we can't just waltz in as Uncle Bob and Aunt Sue again like last time. That ended with us getting stuck in a bouncy castle." The dim streetlights flickered overhead as Bill's words hung in the air, highlighting the risks involved in their reckless endeavor. Molly nodded thoughtfully, then continued, "Exactly, so here's the genius part: we'll pose as the balloon animal artists. I bring my pump and make a few dachshunds out of balloons to distract, you grab the gifts and load them into the wagon while everyone's oohing and aahing." Bill scratched his head, "But what if one of the kids recognizes us from that embarrassing clown school dropout story? Remember when I popped all the balloons trying to make a giraffe? Total disaster." Molly rolled her eyes, "Oh come on, Bill, it's not like we're taking their favorite teddy bears. These are corporate gift exchanges gone wrong anyway. Think of it as redistributing wealth from spoiled toddlers to us, the struggling heist artists. Now, for the exit strategy: once we have the stash, we use the confetti cannon as a smoke screen to escape. Perfect, right?" Bill sighed, "Alright, I'm in, but only if you promise no more hot glue from your crafting phase. This time, keep it simple, no gadgets." The two thieves laughed quietly in their vehicle, their elaborate plan solidifying in the shadows as the joyful kids' laughter echoed from the party house next door, filling them with a mix of anticipation and nervousness for the upcoming escapade that they knew would be their most hilarious yet and perhaps score some leftover cake from the party kitchen to celebrate their victory later. Their hearts raced with excitement as they went over every detail one last time in their heads, making sure nothing was left to chance. The whole thing seemed a bit crazy but they were determined to pull it off now.
created_at
"2026-05-01T12:50:11.571Z"
5/1/2026, 12:50:11 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
287736
moderation_latency
(null)
generation_time
301683
tokens_prompt
77
tokens_completion
6407
native_tokens_prompt
184
native_tokens_completion
25446
native_tokens_completion_images
(null)
native_tokens_reasoning
24865
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0637778
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"9c677090-9756-6f5e-2dac-b1eb03f7c0e7"
is_byok
false
latency
76
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639811-KXu2uf2itDdmL495lSKr"
session_id
(null)
api_type
"completions"
id
"gen-1777639811-FIeNxzQF94jK2X2lAC0P"
upstream_id
"9c677090-9756-6f5e-2dac-b1eb03f7c0e7"
total_cost
0.0637778
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
494
5.19%
Dialogue to Total Word Ratio
Ratio: 57.69%, Deviation: 7.69%
neededClean
false
wordsTotal
494
wordsDialogue
285
51.9508%