NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZSVSEN6PN3Q85BZE369SA
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
7.5s
Cost:
$0.001450
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the map spread across the diner booth, chewing on the end of her pen until the cap split. "The bouncy castle’s the sweet spot," she said. "One kid pops the wrong glove, and the whole thing deflates like a sad balloon animal." Bill topped up her coffee from a chipped Thermos and shook his head. "No, the trick is the birthday-boy table. They pile every gift there like it’s a miniature Fort Knox, all wrapped in glitter tape that gives you away if you tear too loud." He tapped the paper with his spoon and winced. "We hit it right after the pinata swings." Molly laughed once, low and quick, then folded the map smaller. "Last time we tried the last-minute grab, the aunt with the razor glare cornered us. This year, we go in as clowns, slip behind the face-painting booth, and load the van while they’re busy screaming at the magician for missing swords." Bill’s eyebrows jumped. "Clowns? You hate makeup. Besides, those red noses squeeze your sinuses so you sound like a dying duck every time you breathe. We’d give ourselves away in ten seconds." Molly leaned forward, voice dropping the way she did when the job finally felt real. "So we skip the makeup. Borrow the chef coats from delivery day and push a sheet cake cart into the gym. People don’t notice the cake guy; they notice the frosting. You push, I hide inside the shell with a bundle of duffel bags. When the lights go for the wish, I crawl out, fill the bags, and you roll everything back out the service door." Bill tilted his head, counting seconds the way he always did before nodding. "Cake cart works. But the tiny villains always want a piece while we’re rolling. One sneeze and cake becomes a crime scene." She snorted into her mug. "Then we hit them with the decoy confetti cannons first. Seventy percent glitter, thirty percent itch powder. They’ll be too busy crying to notice we’re gone." Bill folded his arms and leaned into the fake leather seat, letting the plan settle. "And when we exit, the DJ’s fog machine will cover us better than any smoke grenade. Kids scream, parents panic, we coast out with fifty thousand dollars’ worth of Pokémon cards and plush dragons." Molly snapped the pen cap back on and grinned, the booth light catching the grease streak on her chin. "One party, one exit, zero witnesses who can spell our names."
created_at
"2026-05-10T13:04:09.53Z"
5/10/2026, 1:04:09 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
471
moderation_latency
(null)
generation_time
6894
tokens_prompt
77
tokens_completion
598
native_tokens_prompt
182
native_tokens_completion
516
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0014503
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"14f84b64-7ba4-9f63-b44a-f985fe68f40e"
is_byok
false
latency
148
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418249-Tpv5joo1K5BKxZJM79mO"
session_id
(null)
api_type
"completions"
id
"gen-1778418249-rA7AyzIW0glDFRLsO4MY"
upstream_id
"14f84b64-7ba4-9f63-b44a-f985fe68f40e"
total_cost
0.0014503
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
417
0.00%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 40.00%
neededClean
false
wordsTotal
420
wordsDialogue
294
0.0000%