NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8W96QNH9A4GKG7HZGC0PK0
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
11.5s
Cost:
$0.001368
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slouched against the battered van’s hood, tracing a finger across the crumpled blueprint. “Think about it, Bill. The bouncy castle’s the weak spot—those parents won’t leave the punch bowl unguarded for more than twenty seconds.” Bill dropped a duffel of black bags and yanked the zipper open. “Yeah, but the moment we step inside they’ll peg us as the creepy uncle and aunt who forgot the RSVP. I say we go in as clowns. Gives us face paint and an excuse to carry oversized bags.” Molly snorted, wiping imaginary dust from her jacket. “Clowns terrify kids, not parents. We want the grown-ups relaxed enough to ignore the giant sacks under the table. I’m thinking balloon guys handing out free helium. Toddlers love it; the adults look away.” “Helium tanks weigh a ton,” Bill said, hefting an imaginary cylinder. “Unless you plan on huffing fake voices all night, we’ll need real muscle. Maybe just dress as maintenance.” “They called the maintenance guy last week. Unless you plan on arriving with a busted trampoline and sounding like you fix moon bounce springs for fun.” Bill sighed and checked his cheap digital watch. “Party starts at three. Presents go on the folding table by the sliding glass door exactly when the magician starts. His big finale is confetti cannons. Everyone claps, heads turn, we dive.” Molly tapped the paper. “I’ll create the diversion—drop dad’s peanut butter cup into the dog bowl. When the mutt goes nuts, every parent flocks to the allergy announcement board posted on the garage door. You snake in, grab the gift bags stacked alphabetically, and load them into the wagon we stashed under the neighbor’s oak.” Bill raised an eyebrow. “You sure the dogs won’t swallow aluminum foil and blow our cover?” “Positive. The retriever hasn’t seen peanut butter since Christmas. We’ll be heroes by the time they notice the presents are history.” Bill folded the blueprint and shoved it into his pocket. “Okay, but we bail the second the mother starts counting loot bags. I don’t need some forty-year-old in yoga pants tackling me for a Spider-Man lunchbox.” Molly cracked a grin. “Relax. By then we’ll be two blocks down in the van, laughing our way to the pawn shop that never asks for receipts.”
created_at
"2026-05-10T12:02:37.949Z"
5/10/2026, 12:02:37 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
775
moderation_latency
(null)
generation_time
11401
tokens_prompt
77
tokens_completion
605
native_tokens_prompt
182
native_tokens_completion
483
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0013678
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"8e496657-cb71-98c6-b826-6eb541297c2f"
is_byok
false
latency
356
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414557-XhX8MEJTARHoLxH7xS9p"
session_id
(null)
api_type
"completions"
id
"gen-1778414557-q6csjHz6nXj6sNZDDfxO"
upstream_id
"8e496657-cb71-98c6-b826-6eb541297c2f"
total_cost
0.0013678
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
380
0.00%
Dialogue to Total Word Ratio
Ratio: 81.77%, Deviation: 51.77%
neededClean
false
wordsTotal
384
wordsDialogue
314
0.0000%