NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR9378DF88STQ7QQQZQMQ38J
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5m 14s
Cost:
$0.010377
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
82.23%
Dialogue to Total Word Ratio
(details)
41.1127%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn crayon map across the grease-stained workbench. The flickering fluorescent light above them cast long, dramatic shadows across the cluttered garage, making the tiny drawings of a suburban backyard look like a high-stakes battlefield. She tapped a red circle with a jagged fingernail, her expression grim. Bill rubbed his chin, his eyes darting nervously toward the colorful sketch. He looked less like a master thief and more like a man waiting for a dental appointment. He shifted uncomfortably in his creaky metal chair, the screech of the legs echoing through the empty space. "Do we really have to go in through the bouncy castle?" Bill asked. "It seems structurally unsound for a professional heist." Molly adjusted her tactical vest, which was disproportionately stuffed with extra batteries and several packs of fruit snacks. She paced the small concrete floor with the intensity of a drill sergeant. "The castle is our insertion point," Molly insisted. "The bouncing will mask our heavy footsteps." "I don't know, Molly. What if a kid lands on me mid-extraction? I have a very sensitive lumbar region." Molly rolled her eyes and let out a long, weary sigh. She leaned over the table, her shadow looming large against the brick wall. "Then you hold still and pretend to be a giant, misunderstood plushie. We need those LEGO sets, Bill." Bill's eyes widened, and he leaned closer to the map, sniffing the air as if he could smell the plastic bricks. "What about the snack situation? I heard there's a chocolate fountain. A professional needs sustenance." Molly snapped her fingers, the sound echoing in the hollow garage. She stood up straight, pointing at the target zone. "Focus! We aren't here for the brownies. We want the Nintendo Switches and the drones." "And the glitter bombs?" Bill whispered, his voice trembling slightly. He gripped the edge of the workbench until his knuckles turned white. "The intel suggests a high concentration of sequins and tiny whistles." Molly pulled a pair of dark goggles from her belt. "We'll wear tactical goggles. Now, listen. I go left, past the juice boxes. You go right, through the wrapping paper." "Is there a backup plan if the birthday boy sees us?" "If Timmy sees us, you initiate Protocol Piñata." "Protocol Piñata? That sounds unnecessarily violent." "Just dance and hope they mistake you for an entertainer. It's foolproof." Bill slumped in his chair, looking utterly defeated. "I'm not a very good dancer. I have the grace of a startled moose." "Move with purpose! Once the cake is cut, that's our window." "A sensory vacuum? Is that a legitimate term or just heist jargon?" "It's a heist term. Just grab the boxes and get to the van." "The van is parked behind the clown's rental truck, right?" "Yes. And Bill?" "Yeah?" "If you touch the gummy bears, the deal is off." "I make no promises, Molly." Molly grabbed her gear and headed for the door, her heavy boots clicking against the cold floor. "Let's get moving. The cake waits for no man." Bill scrambled after her, tripping over a discarded wrench and a pile of old newspapers. "Wait, do we have a getaway driver?" "You're the driver, Bill. That's why we're in a van." "Right. I'll go check the tire pressure and the juice box inventory."
created_at
"2026-05-10T14:03:54.169Z"
5/10/2026, 2:03:54 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1217
moderation_latency
(null)
generation_time
313755
tokens_prompt
77
tokens_completion
22510
native_tokens_prompt
82
native_tokens_completion
25917
native_tokens_completion_images
(null)
native_tokens_reasoning
21643
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.01037664
router
(null)
provider_responses
0
endpoint_id
"eb382d64-dce1-42d5-b23d-6fefb91948a3"
id
"chatcmpl-aa535006c1ce2768"
is_byok
false
latency
1209
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778421834-RI43LTkhQLadHq8l7jfX"
session_id
(null)
api_type
"completions"
id
"gen-1778421834-H4hAjnyGNqchlRuSYVHo"
upstream_id
"chatcmpl-aa535006c1ce2768"
total_cost
0.01037664
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
552
82.23%
Dialogue to Total Word Ratio
Ratio: 51.98%, Deviation: 1.98%
neededClean
false
wordsTotal
556
wordsDialogue
289
41.1127%