NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WZ7YQS05F2NH59XH1EZD7
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 57s
Cost:
$0.003351
Provider:
Cloudflare
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the brightly colored map across the grease-stained table, smoothing the creases with a frantic intensity. It was not a bank vault blueprint, but a crayon-drawn floor plan of a suburban living room. The dim light of the basement flickered, casting long, dramatic shadows against the concrete walls. Listen up, Bill, Molly whispered, tapping a red circle on the paper. The target is the central gift pile, located beneath the cupcake tower. Bill squinted at the diagram through thick, fogged glasses. He leaned forward into the pool of light, his breathing heavy in the cold air of the garage. Exactly, Molly said, her eyes narrowing. She gripped the edge of the table. One wrong move and we are coated in pink sparkles for a month. I hate sparkles, Molly, Bill muttered, rubbing his weary eyes. They are impossible to get out of a jumpsuit. Focus, she snapped, pointing a gloved finger at the map. She ignored his whining, her mind already three steps ahead. We enter through the sliding door when the bouncy castle distraction begins. Bill scratched his chin, looking deeply concerned. What about the sentries? Molly rolled her eyes. The parents? They will be distracted by the cheese and the inevitable meltdown. And the kids? Bill asked, pulling a small, battered notebook from his pocket. He scribbled a quick note before looking up again. They are unpredictable, Molly. Molly leaned in close, her voice dropping to a conspiratorial hiss. We use the Sleeping Dragon maneuver. We strike during the sugar high. What if someone sees us? Bill asked, his voice trembling slightly. I am not built for stealth in a hat. Molly shoved a tiny, conical hat toward him. You will wear the disguise. It is perfect camouflage. It is a neon blue hat, Molly, Bill said, poking the felt with a finger. It screams thief. It screams festive guest to anyone with a juice box, she countered. Bill sighed, staring at the list of loot. Legos, remote-controlled cars, chemistry sets. This is high-stakes. It is the biggest score of the season, Molly insisted. The resale is astronomical. I am worried about the dogs, Bill admitted, glancing at the door. There are no dogs, Molly said. Only a confused retriever named Buster. That sounds even more dangerous, Bill whispered. Molly grabbed her flashlight and checked the batteries with a professional click. Check your gear. Do you have the gift bags? I have the reinforced ones, Bill replied. They hold dinosaurs. Good, Molly said. Do not touch the bouncy castle. We are not here to play. Bill looked at the bouncy castle icon on the map. Can I at least bounce once? For morale? No bouncing, Bill! Only stealing!
created_at
"2026-05-10T12:14:40.096Z"
5/10/2026, 12:14:40 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
238
moderation_latency
(null)
generation_time
116668
tokens_prompt
77
tokens_completion
11409
native_tokens_prompt
81
native_tokens_completion
11142
native_tokens_completion_images
(null)
native_tokens_reasoning
10705
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0033507
router
(null)
provider_responses
0
endpoint_id
"91f05fe1-9a18-4bcf-8cfa-093c369d8cea"
is_byok
false
latency
181
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Parasail"
status
429
1
endpoint_id
"e2a16fa6-1f65-413e-9ecb-93901089ddb1"
id
"id-1778415280425"
is_byok
false
latency
157
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Cloudflare"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415280-1zMYkKFVX2ROCNA8dUc4"
session_id
(null)
api_type
"completions"
id
"gen-1778415280-xsBhZlvU28PuaRDqIHoG"
upstream_id
"id-1778415280425"
total_cost
0.0033507
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Cloudflare"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
451
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
455
wordsDialogue
0
0.0000%