NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WVK6GSPXTT82JWQHZW8Y2
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10.4s
Cost:
$0.002053
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0040%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"You're sure about this, Bill?" Molly hissed, leaning over the crumpled invitation. "It's a kiddy party. No guards, no alarms—sounds too easy." "It is too easy," Bill replied, tapping the napkin sketch of the garden layout. "But that means twice the loot. The Millers spent half the neighborhood budget on these presents. One good sweep and we're done before the cake is sliced." Molly frowned. "What if a parent spots us? We’re not exactly invisible." Bill shrugged. "We wear the silly hats from last year’s parade. Instant camouflage. Every adult there will be too busy filming the clown." "The clown? You’re joking." "Serious as taxes. The kids won’t notice an extra hat in the crowd. We blend in, smile, steal." Molly wrapped her knuckles on the table. "All right. Phase one: slip through the side hedge while the magician distracts everyone. Phase two: locate the gift table." "Phase three," Bill added, "load the bags. Remember to leave the wrapping paper—less noise, less evidence." "What about the security camera by the garage?" "Already scoped it. A quick spray of that fake snow from the craft aisle fogs the lens for ten minutes. Kids love the stuff, so no one questions why it’s there." Molly smirked. "And you thought the smoke machine from the magic store was overkill." "That was your idea," Bill said, grinning. "Mine was the toy-walkie-talkies. Every mom will think we’re dads phoning upstairs." Molly rolled her eyes. "They’ll see our faces." "Bandanas under the hats. Temporary, stylish, gone in a blink." Bill tapped his pen on the diagram again. "The biggest danger isn’t cameras—it’s the parents’ parking lot. Too many eyes backing up minivans. So we enter on foot, exit on foot." "With twenty presents each?" "Light bags, heavy pockets. Stuff the small toys first. Electronics go in my coat lining. You handle plushies—nobody counts teddy bears." Molly folded the map. "We recon at noon tomorrow. I’ll bring coffee, you bring zip-ties for quick bundling." "And the kid-sized backpacks?" "I’ve got four. Bright colors, perfect misdirection. Nobody questions a couple hauling ‘next door’ gifts to the car." Bill leaned back. "One last thing. If the birthday girl recognizes us—" "She won’t. We’ll be in full birthday disguise: balloons, cheap sunglasses, forced smiles." "They’ll smell fear." Bill laughed. "Fear smells like frosting—we’ll reek of it anyway." Molly tapped her watch. "We do this clean, we’re out in eight minutes flat. You time it." "I already did. Longest recorded magician set is twelve minutes. Plenty of margin." She sighed. "You’re too confident. What if the weather turns?" Bill held up a folded poncho. "Emergency party poncho. Bright yellow, matches the theme. We look like helpful volunteers, not thieves." Molly shook her head, but grinned. "Fine. Hats, bags, ponchos, timing. We’ve got the heist checklist." "We’ve also got sixty other kids’ presents on that list. Let’s try not to trip over a piñata this time." "Last time was the laser museum, not a birthday," Molly muttered. Bill raised his coffee cup. "To small children, big hauls, and terrible memories." She clinked her mug against his. "To eight quick minutes and absolutely no clowns involved after this." Bill winked. "Cheers to that, partner. Now go practice smiling like you mean it."
created_at
"2026-05-10T12:12:40.538Z"
5/10/2026, 12:12:40 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
481
moderation_latency
(null)
generation_time
10185
tokens_prompt
77
tokens_completion
873
native_tokens_prompt
182
native_tokens_completion
757
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0020528
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"d10f96fd-4293-940a-9054-0da92e221c70"
is_byok
false
latency
140
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415160-AGqFV67MfTSt95Slroj5"
session_id
(null)
api_type
"completions"
id
"gen-1778415160-cPFshQIbIZOUzVpEqBm5"
upstream_id
"d10f96fd-4293-940a-9054-0da92e221c70"
total_cost
0.0020528
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
543
0.01%
Dialogue to Total Word Ratio
Ratio: 83.73%, Deviation: 13.73%
neededClean
false
wordsTotal
547
wordsDialogue
458
0.0040%