NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8PH29NK8432NJTBKXZR33
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
6.3s
Cost:
$0.000300
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
91.90%
Dialogue to Total Word Ratio
(details)
45.9521%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table in their dingy apartment, blueprints of the community center spread out like a treasure map. Bill paced back and forth, his mustache twitching with excitement. The plan was insane: crash a birthday bash for twenty screaming toddlers and swipe every gift under the rainbow piñata. "Listen, Bill," Molly said, jabbing a finger at the diagram. "We dress as clowns—big red noses, floppy shoes. No one suspects clowns at a kids' party." Bill snorted, adjusting his imaginary bow tie. "Clowns? What if I honk my nose at the wrong moment and blow our cover? Nah, I'm thinking piñata repairmen. We sneak in with a giant fake one, swap it for the real deal, and bash our way to the goodies." Molly rolled her eyes, sketching a quick balloon animal on the edge of the paper. "Repairmen? These kids'll swarm us like piranhas. We'll need diversions—maybe a smoke bomb that smells like cotton candy. In and out in ten minutes, grab the bikes, bikes full of Barbies and Legos." "Genius," Bill grinned, high-fiving her. "But what about the cake? One kid sees us fleeing with his remote-control dinosaur, he'll wail louder than a siren." "Easy," Molly whispered, leaning in. "We leave a decoy—a single sock monkey. Let the rugrats fight over that while we pedal off rich." Bill chuckled, folding the blueprint. "To the heist of the century! For the love of ill-gotten gains." As they clinked coffee mugs like champagne glasses, Molly couldn't shake the image of tiny fists pounding after them—pure chaos, pure comedy gold.
created_at
"2026-02-20T19:33:47.733Z"
2/20/2026, 7:33:47 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2452
moderation_latency
(null)
generation_time
6155
tokens_prompt
77
tokens_completion
405
native_tokens_prompt
216
native_tokens_completion
560
native_tokens_completion_images
0
native_tokens_reasoning
203
native_tokens_cached
154
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0003001
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"805de1bf-0149-84c8-125f-a14e406309c0"
is_byok
false
latency
1098
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616027-QdSORM6zoboiVC9KqXDd"
upstream_id
"805de1bf-0149-84c8-125f-a14e406309c0"
total_cost
0.0003001
cache_discount
0.0000231
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
269
91.90%
Dialogue to Total Word Ratio
Ratio: 55.39%, Deviation: 5.39%
neededClean
false
wordsTotal
269
wordsDialogue
149
45.9521%