NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY98941VAZANF2ZZY9K3HQP
Run Details
Model:
Grok 4.1 Fast
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
7.0s
Cost:
$0.000315
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
57.09%
Dialogue to Total Word Ratio
(details)
77.3577%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the diner booth table, blueprints of "Little Timmy's Birthday Bash Palace" spread out like a treasure map. Bill slurped his coffee, eyes gleaming. "Bill, it's genius," Molly whispered. "Hundred screaming rugrats, piñata stuffed with cash gifts. We crash as party entertainers—me the magician, you the clown." Bill snorted. "Clown? I'd look like a deranged balloon animal. What if the kids mob me?" "Perfect distraction!" Molly grinned. "You juggle pies in their faces while I sleight-of-hand the present pile into duffel bags under the bounce house." "But the parents? Those soccer moms with wine boxes?" "Punch bowl's our alibi. Spike it with 'magic potion'—sleepy-time juice. Harmless, just drowsy." Bill leaned in. "Entry: bounce house unzip from inside? Or piñata drop-line from the chandelier?" "Chandelier! Candy explosion covers our grab. Out the back via cake delivery van." "Van's gold—'Happy Birthday from Uncle Thief!'" Molly high-fived him. "At midnight, we're rich. Kids cry, we laugh to the bank." Bill chuckled. "Best heist ever. No alarms, just tears and frosting." "Deal sealed. Costumes tomorrow?" "Deal. But if I get glitter in my crack, you're waxing it out."
created_at
"2026-02-20T19:43:29.421Z"
2/20/2026, 7:43:29 PM
model
"x-ai/grok-4.1-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
624
moderation_latency
(null)
generation_time
6785
tokens_prompt
77
tokens_completion
392
native_tokens_prompt
216
native_tokens_completion
589
native_tokens_completion_images
0
native_tokens_reasoning
298
native_tokens_cached
154
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0003146
router
(null)
provider_responses
0
endpoint_id
"9821aa0e-dce3-4c27-9716-161359123d0b"
id
"9cca7e2c-ebdc-d0da-64f0-2e188b878142"
is_byok
false
latency
377
model_permaslug
"x-ai/grok-4.1-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616609-kpIqQkW6ZDQNCbcq0KF5"
upstream_id
"9cca7e2c-ebdc-d0da-64f0-2e188b878142"
total_cost
0.0003146
cache_discount
0.0000231
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
193
57.09%
Dialogue to Total Word Ratio
Ratio: 81.35%, Deviation: 8.65%
neededClean
false
wordsTotal
193
wordsDialogue
157
77.3577%