NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY84QW4H0EKT0C8D295RFFZ
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3.2s
Cost:
$0.000270
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
62.77%
Dialogue to Total Word Ratio
(details)
31.4001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned over the rickety kitchen table, blueprints of the backyard spread out like a treasure map. "Okay, Bill, listen up. The Anderson's kid turns five tomorrow—whole neighborhood's invited. Presents piled high under that piñata. We slip in as caterers, snag the loot during the cake smash, and poof, we're gone with a van full of Barbie dreams and Lego fortunes." Bill scratched his stubbled chin, eyeing the diagram. "Caterers? Molly, you can't even boil water without setting off the smoke alarm. What if some snot-nosed brat spots us? Kids are like tiny detectives with super senses." She grinned, waving a fake mustache. "That's why we go deluxe: balloon animals for distraction. You twist 'em into getaway cars while I stuff the bags. Parents'll be too busy filming the chaos to notice." He snorted. "Balloon animals? Last time I tried that, it looked like a deflated sausage. And the dog? That mangy mutt at their place barks at shadows. One woof, and we're piñata fodder—whacked by overcaffeinated moms." "Details, details," Molly said, punching his arm. "We dose the punch with sleepy-time herbs, blame it on the caterer—us! By midnight, we'll be fencing stolen teddy bears on the black market. Richer than a kid's piggy bank." Bill sighed, but smirked. "Fine, but if I end up in a clown suit, you're explaining it to the cops."
created_at
"2026-02-20T19:24:04.876Z"
2/20/2026, 7:24:04 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1053
moderation_latency
(null)
generation_time
2876
tokens_prompt
77
tokens_completion
349
native_tokens_prompt
216
native_tokens_completion
499
native_tokens_completion_images
0
native_tokens_reasoning
183
native_tokens_cached
154
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0002696
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"9340363b-0088-e5f1-246d-8ed45dd84bd3"
is_byok
false
latency
80
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771615444-lDnQi3BqZlckfBFHN2Ty"
upstream_id
"9340363b-0088-e5f1-246d-8ed45dd84bd3"
total_cost
0.0002696
cache_discount
0.0000231
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
230
62.77%
Dialogue to Total Word Ratio
Ratio: 81.74%, Deviation: 8.26%
neededClean
false
wordsTotal
230
wordsDialogue
188
31.4001%