NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WK9NX4DZ672BPPTW567BV
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
12.3s
Cost:
$0.001478
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat hunched over the crumpled blueprints of the Pevensie house on a folding table lit by a single crooked lamp. "Right," Molly said, tapping a crayon-scribbled square marked "Ball Pit." "We go in during the musical-chairs round when everyone’s shrieking. You distract the clown, I slip behind the gift table." Bill snorted. "Distract him how? Juggle three whoopie cushions? I’m a thief, not Bozo." "It’s a children’s party, Bill. No one questions a grown man flailing about. Just spill some juice and act upset about the rug." He squinted at the diagram. "At least the goodie bags are near the door. We could grab those too, fewer witnesses." Molly rolled her eyes. "We’re not touching the goodie bags. This isn’t about loot profit; it’s about the big haul under the tree. Last year they gave away iPads. iPads, Bill." "Last year’s party was the Johnsons," he muttered. "They’ll learn." She traced a line from the kitchen window to the living-room sofa. "In, grab, out through the patio before the parents break the piñata. Forty seconds max." Bill leaned back, chair creaking. "What if some snot-nosed five-year-old starts crying?" "You stuff a balloon in his mouth and keep walking." "That’s your plan—balloon gag?" "Better than yours, which was honking like a goose while carting a sack through the front hall." He grinned despite himself. "Honking does draw focus." Molly slid two black drawstring bags across the table. "These are for the loot. Silent, flexible, no rustle like canvas. No fingerprints either, if you manage to keep your gloves on longer than thirty seconds." Bill sighed, rubbing his temple. "One day we’ll hit a bank like normal criminals." "One day," she agreed, "but today we’re going after the mother lode of wrapped electronics. You got the clown mask?" He patted the duffel at his feet. "Pink, polka-dotted, smells faintly of funnel cake." "Good. We trade him a bigger nose for his absence." Bill checked his watch. "Party starts in four hours. Two hundred screaming six-year-olds, one exhausted magician, and a mountain of paper with bows. You ready?" Molly zipped her jacket. "Born ready. Just remember—no presents, no payday."
created_at
"2026-05-10T12:08:08.646Z"
5/10/2026, 12:08:08 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
452
moderation_latency
(null)
generation_time
12186
tokens_prompt
77
tokens_completion
597
native_tokens_prompt
182
native_tokens_completion
527
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0014778
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"859b4201-7003-9712-a605-a45d267ea2db"
is_byok
false
latency
117
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414888-8QJEAUdhTUY8lHBtE93r"
session_id
(null)
api_type
"completions"
id
"gen-1778414888-yEYj51PIcZacFkjTpFN4"
upstream_id
"859b4201-7003-9712-a605-a45d267ea2db"
total_cost
0.0014778
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
361
0.00%
Dialogue to Total Word Ratio
Ratio: 73.78%, Deviation: 43.78%
neededClean
false
wordsTotal
370
wordsDialogue
273
0.0000%