NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR919K07BBRANCX2Y129TDWQ
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.2s
Cost:
$0.000201
Provider:
Cloudflare
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the sticky picnic table, her eyes narrowed. "This is the most high-security perimeter I have ever encountered, Bill. There are bouncy castles everywhere." Bill leaned in, chewing on a piece of stolen licorice. "It’s a tactical nightmare. Look at that obstacle course. It’s designed to slow down anyone with a heavy gait." "We aren't just any thieves," Molly hissed, pointing to a drawing of a pink tent. "We are professionals. The target is the central gift pile. Located directly behind the clown station." "The clown?" Bill whispered, his voice trembling. "Molly, the clown is a wildcard. He has unpredictable movements and possibly greasepaint-based weaponry." "Focus, Bill. The clown is a distraction. The real threat is the sugar rush. If the kids enter a frenzy, we won't be able to move the loot through the crowd." "What’s the extraction plan?" Bill asked, wiping sweat from his forehead. "We can't just walk out with three crates of LEGO and a motorized dinosaur." "We use the balloon animal diversion," Molly explained, tapping the map. "I’ll create a flurry of balloon poodles near the snack table. While they are distracted by the squeaking, you grab the heavy hitters. The electronics, the remote-controlled cars, the designer dollhouses." "And the loot bags?" Bill queried. "The ones with the tiny plastic whistles and the stickers?" "Those are secondary objectives. We prioritize high resale value. No whistles, Bill. I am not risking a felony for a whistle." "What if a child spots us?" Bill asked, looking nervously toward the imaginary party. "They have eyes like hawks, Molly. And they don't fear authority." "If a child approaches, you deploy the emergency protocol," Molly commanded. "Tell them we are the professional gift-wrapping consultants. They won't question our presence if we have rolls of shiny paper." "That’s thin, Molly. Really thin. What if they ask for a gift wrap?" "Then you wrap the damn gift, Bill! Just keep them busy!" Bill sighed, pulling a tiny screwdriver from his pocket. "I should have stayed in the jewelry business. At least diamonds don't throw juice boxes at you." "Jewelry is predictable," Molly countered, folding the blueprint with surgical precision. "Children are chaos. And chaos is where the profit lies." "I hate chaos," Bill muttered. "I really do." "Do you have the getaway wagon?" Molly asked, her eyes gleaming with greed. "It's disguised as a motorized ice cream truck," Bill confirmed. "But the music is catchy. It might compromise our stealth." "Then turn the volume down and keep your eyes on the prize. We strike at two o'clock, right after the cake is served." "Copy that," Bill said, standing up. "May the odds be ever in our favor, and may the clown be easily distracted."
created_at
"2026-05-10T13:30:13.391Z"
5/10/2026, 1:30:13 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
120
moderation_latency
(null)
generation_time
4958
tokens_prompt
77
tokens_completion
732
native_tokens_prompt
79
native_tokens_completion
643
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0002008
router
(null)
provider_responses
0
endpoint_id
"e2a16fa6-1f65-413e-9ecb-93901089ddb1"
id
"id-1778419813504"
is_byok
false
latency
101
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Cloudflare"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419813-HPiMOXTxBsbofMLJ0m7M"
session_id
(null)
api_type
"completions"
id
"gen-1778419813-tyZe6rEJMfX6CLFkP7oV"
upstream_id
"id-1778419813504"
total_cost
0.0002008
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Cloudflare"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
456
0.00%
Dialogue to Total Word Ratio
Ratio: 79.78%, Deviation: 29.78%
neededClean
false
wordsTotal
460
wordsDialogue
367
0.0000%