NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91KCRBZS5PYJTYS6QKXXAN
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 2s
Cost:
$0.001535
Provider:
Cloudflare
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint, a color-coded printout of a Pinterest party theme, across the dashboard. "Target is the Henderson residence," she whispered. "Seven-year-old Timmy’s birthday. High density of gift-wrapped goods, high probability of high-value electronics." Bill leaned back, picking a piece of lint off his black turtleneck. "It’s a bouncy castle, Molly. We’re breaking into a bouncy castle." "It’s a tactical, inflated obstacle course, Bill! Keep up." "There’s going to be juice boxes. And those little sandwiches with the crusts cut off. I can't perform a high-stakes heist on an empty stomach." "Focus! The window of opportunity is narrow. We strike during the cake ceremony. That’s when the parents are distracted by the singing and the inevitable sugar-induced chaos of the guests." "What if a kid sees us?" "Then you deploy the distraction protocol." "The distraction protocol? You mean the bubbles?" "Yes, Bill. The professional-grade, high-output bubble machine. You deploy, they chase the suds, we grab the Lego Technic sets and exit via the side gate." "And the clown?" Molly paused, her eyes narrowing. "The clown is the wildcard. He’s unpredictable. He knows balloon animals, Bill. He knows how to manipulate shape and volume. He could be an undercover agent." "He's a guy named Gary in a polyester wig, Molly. He's currently struggling to make a poodle." "That's what he wants you to think! It's a psychological play." Bill sighed, rubbing his temples. "I’m telling you, the risk-to-reward ratio is skewed. We could be hitting a jewelry store. Instead, we're dodging toddlers in dinosaur pajamas." "The jewelry store has lasers, Bill! This place has streamers. It's much more festive, and therefore, much more volatile." "I'll bring the extra-large grabber tool," Bill muttered. "In case the prize is on a high table." "And the tactical diaper bag?" "Already packed. It’s disguised as a floral tote. Very inconspicuous." "Good. Now, remember: no sudden movements. If a child asks if you're a superhero, you nod and say, 'Not today, citizen.' Do not engage in conversation about Minecraft. It’s a rabbit hole we cannot afford." "Understood. No Minecraft. Just loot and bubbles." "Precisely. Now, check your perimeter. I think I see a juice box being opened. The operation is imminent."
created_at
"2026-05-10T13:35:34.676Z"
5/10/2026, 1:35:34 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
302
moderation_latency
(null)
generation_time
62052
tokens_prompt
77
tokens_completion
5294
native_tokens_prompt
81
native_tokens_completion
5090
native_tokens_completion_images
(null)
native_tokens_reasoning
4674
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0015351
router
(null)
provider_responses
0
endpoint_id
"e2a16fa6-1f65-413e-9ecb-93901089ddb1"
id
"id-1778420134937"
is_byok
false
latency
257
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Cloudflare"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420134-lVBRgXPmfc3MecDsyQwH"
session_id
(null)
api_type
"completions"
id
"gen-1778420134-CHVx5cUHpSNCJe8BX65m"
upstream_id
"id-1778420134937"
total_cost
0.0015351
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Cloudflare"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
366
0.00%
Dialogue to Total Word Ratio
Ratio: 88.86%, Deviation: 18.86%
neededClean
false
wordsTotal
377
wordsDialogue
335
0.0000%