NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91F832NJ7TGVG5PPPYMXAN
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
6.1s
Cost:
$0.000263
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tactical goggles, which were actually just oversized swimming goggles she had spray-painted black. She pointed a laser pointer at a crudely drawn map of a suburban backyard. "The target is the bouncy castle, Bill. That is where the concentrated wealth is located," Molly whispered, her voice dropping into a dramatic rasp. Bill chewed on a piece of stale licorice, looking unimpressed. "Molly, it is a six-year-old’s birthday party. The 'wealth' is mostly plastic dinosaurs and scented erasers." "Do not underestimate the resale value of a limited-edition Lego set, you amateur," she snapped. "I am not an amateur. I am a realist. We are breaking into a gathering of toddlers," Bill replied. "It is not a gathering of toddlers. It is a high-security perimeter of sugar-fueled chaos," Molly corrected. "Now, look at the layout. The cake is the diversion. While the adults are distracted by the strawberry frosting, we strike the gift pile." Bill leaned in, squinting at the map. "What is the plan for the lookout? We can't exactly have a guy in a trench coat standing by the juice boxes." "I have already handled that. I will be disguised as a disappointed aunt," Molly said firmly. Bill stared at her. "You? An aunt? You look like you belong in a heavy metal band." "It is called method acting, Bill. Get with the program. You will be the clown." "I am not wearing the face paint, Molly. Absolutely not," Bill declared, crossing his arms. "The clown is the perfect cover! Who suspects a man in oversized shoes of grand larceny?" "They suspect anyone in oversized shoes. It is a biological imperative," Bill argued. Molly slammed her hand on the table, causing a pile of glitter to scatter. "Focus! We enter through the side gate during the 'Pin the Tail on the Donkey' segment. That is our window of opportunity." "And if a kid catches us?" Bill asked. "What is our extraction protocol for a tantrum?" "We use the distraction technique. Throw a handful of confetti and run toward the driveway," Molly instructed. "That is the most unprofessional escape plan I have ever heard," Bill muttered. "It is foolproof! Now, do you have the heavy-duty burlap sacks?" "I have grocery bags. They have little cartoon ducks on them," Bill said. Molly sighed, rubbing her temples. "Fine. The ducks will add a layer of psychological warfare. It will confuse the parents." "Great. We are stealing presents in duck bags. We are legends," Bill said sarcastically. "We are professionals," Molly insisted, pointing at the map one last time. "Tomorrow, we feast on the spoils of childhood joy." "I am mostly just looking forward to the leftover pizza," Bill added. "Eyes on the prize, Bill! Eyes on the prize!"
created_at
"2026-05-10T13:33:18.827Z"
5/10/2026, 1:33:18 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1051
moderation_latency
(null)
generation_time
5802
tokens_prompt
77
tokens_completion
718
native_tokens_prompt
79
native_tokens_completion
633
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00026268
router
(null)
provider_responses
0
endpoint_id
"eb382d64-dce1-42d5-b23d-6fefb91948a3"
id
"chatcmpl-b5e33e26386eb5f2"
is_byok
false
latency
1051
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419998-lbeKfFnTOfX4Yg9kABEf"
session_id
(null)
api_type
"completions"
id
"gen-1778419998-DYKZEO5AWVSCW4L72jr5"
upstream_id
"chatcmpl-b5e33e26386eb5f2"
total_cost
0.00026268
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
458
0.00%
Dialogue to Total Word Ratio
Ratio: 74.62%, Deviation: 24.62%
neededClean
false
wordsTotal
465
wordsDialogue
347
0.0000%