NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSJ1NZERVQRSN959EY83EY
Run Details
Model:
Grok 4.3 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 51s
Cost:
$0.059155
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
99.61%
Dialogue to Total Word Ratio
(details)
99.7948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat hunched over in the back of their beat up van that was disguised with a fake pizza delivery sign to avoid suspicion from any nosy locals. The community center across the street was decked out with streamers and balloons for a big children's birthday party that had attracted a crowd of hyperactive kids and tired parents. This was no ordinary job for the comedic duo of thieves whose previous heists had gone hilariously wrong. Today their target was the mountain of brightly wrapped presents stacked near the front of the party room, which they planned to liberate during the height of the festivities when chaos reigned supreme. Bill had brought along a comically large bag that he claimed was for laundry but was actually for stuffing gifts, while Molly had rigged up a simple plan involving sneaking through the service entrance. "Bill, this is going to be a cakewalk if we time it right," Molly said as she peered through a crack in the van window. "But what if one of those kids with sticky hands grabs onto me? I sure don't want frosting all over my getaway outfit," Bill replied with a worried tone. She laughed softly before responding, "Just keep moving and act like you're looking for the bathroom. I'll handle the grabbing while you watch for any approaching adults." "Are you up for this or should I do the heavy lifting?" she continued. Continuing the setup they had discussed for hours, she proposed they blend in by wearing the extra silly hats from a previous failed job as party supply. The plan was to enter separately, make it to the table during the blindfolded game, load up the bag, and exit through separate doors before meeting back at the van to divide the spoils which they hoped would include gadgets and games that could be sold for a quick buck. Molly emphasized the importance of not getting sidetracked by the tempting birthday cake which always smelled amazing but could slow them down. "Accepting some cake might help us blend in but don't get seconds like last time," Bill chimed in. "Don't forget the extra sacks are in the back. We can't leave any present behind because each one is a potential payday for us," she replied quickly. "Yeah, and don't trip on balloons or we'll get caught in this silly party heist," he added. She nodded in agreement. "Now let's go over the signals again. If all is clear I'll nod twice then you move in for the heist of the presents." He agreed with a thumbs up from the shadows of the van interior. They knew this heist had to go smoothly to add to their quirky reputation. With one last check of their tools including a pair of kid sized gloves for better grip on gift boxes they prepared to make their move as the sounds of music started blaring from the party inside signaling the start of activities.
created_at
"2026-05-01T12:52:30.02Z"
5/1/2026, 12:52:30 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
228408
moderation_latency
(null)
generation_time
230901
tokens_prompt
77
tokens_completion
6495
native_tokens_prompt
184
native_tokens_completion
23597
native_tokens_completion_images
(null)
native_tokens_reasoning
23039
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0591553
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"04966e8b-9381-9f1e-9de8-4314473f66a7"
is_byok
false
latency
120
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639950-GGCsC6QO2YQEJdhQXl2a"
session_id
(null)
api_type
"completions"
id
"gen-1777639950-fiw1b0rXhd49yIvkjAKW"
upstream_id
"04966e8b-9381-9f1e-9de8-4314473f66a7"
total_cost
0.0591553
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
99.61%
Dialogue to Total Word Ratio
Ratio: 29.72%, Deviation: 0.28%
neededClean
false
wordsTotal
498
wordsDialogue
148
99.7948%