NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8W47X281CSJ0B48CT509QS
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 1s
Cost:
$0.000253
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints spread across the grease-stained table were not of a bank vault or a diamond exchange, but a brightly colored layout of the Miller residence, complete with a designated zone for a bouncy castle. Molly tapped a manicured finger on the living room corner, her eyes narrowing with the intensity of a general planning a blitzkrieg. This is the epicenter, Bill. The primary target zone. It’s a high-density cluster of wrapped cardboard and ribbons. Bill squinted at the map, adjusting his spectacles, which were currently held together by a piece of neon pink bubblegum. Is that a drawing of a dinosaur? It’s a tactical representation of a prehistoric-themed obstacle course, Molly snapped. Focus. The objective is simple: total gift extraction. We move in during the peak distraction phase, which is precisely when the clown begins the balloon animal demonstration. The clown? Bill asked, leaning back until his chair groaned in protest. Are we really going up against a professional entertainer? Those guys are unpredictable. They have high mobility and a complete lack of respect for personal space. Molly rolled her eyes, pacing the cramped basement. We aren't fighting the clown, you idiot. We are using him. He is our diversion. While the targets are mesmerized by a poodle made of latex, we slip into the gift corner. How do we handle the security? Bill inquired, rubbing his chin. I’m assuming the parents are the primary sentries. They’re hyper-vigilant, especially when sugar levels start to spike. We bypass the sentries by utilizing the tactical distraction of the cake cutting, Molly explained, her voice dropping to a conspiratorial whisper. Once the singing begins, the sensory overload will create a window of opportunity. Bill scribbled a note on a napkin. Sensory overload. Got it. What’s our extraction method? If we try to carry twenty-five oversized boxes through a backyard, we’re going to look like a very suspicious moving company. Molly pulled a crumpled, bright yellow plastic wagon from beneath the table. The Trojan Horse approach. A wagon? Bill stared at the toy. Molly, we are professional thieves. We are legends in the underworld. We cannot rob a six-year-old's birthday party using a sandbox accessory. It’s a heavy-duty, multi-terrain transport vehicle, she hissed. And it’s inconspicuous. No one suspects a person pulling a wagon filled with colorful debris. You’re insane, Bill muttered, though he was already reaching for his camouflage party hat. I’m prepared, Molly corrected. Now, check your pockets. Do you have the distraction device? Bill reached into his vest and pulled out a handful of high-grade, multi-colored confetti. All present and accounted for. Then let's move, Molly commanded. We have a deadline, and I refuse to be late for the juice boxes.
created_at
"2026-05-10T11:59:55.305Z"
5/10/2026, 11:59:55 AM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
282
moderation_latency
(null)
generation_time
60666
tokens_prompt
77
tokens_completion
736
native_tokens_prompt
79
native_tokens_completion
608
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00025347
router
(null)
provider_responses
0
endpoint_id
"91f05fe1-9a18-4bcf-8cfa-093c369d8cea"
id
"chatcmpl-abb8f252ab335109"
is_byok
false
latency
282
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414395-1vVSY1aSO9e7uqLfjxo2"
session_id
(null)
api_type
"completions"
id
"gen-1778414395-QtVtALK8rpA8fqVtN89y"
upstream_id
"chatcmpl-abb8f252ab335109"
total_cost
0.00025347
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
451
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
460
wordsDialogue
0
0.0000%