NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90CSXFA2SSANH0GZYD8DMY
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.6s
Cost:
$0.000212
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky kitchen table, pointing a grease-stained finger at the bouncy castle. "This is the perimeter, Bill. The inflatable fortress is our primary obstacle." Bill leaned in, squinting through his cracked spectacles. "Why is the obstacle a bouncy castle, Molly? It looks like a giant, neon-pink marshmallow." "It’s tactical, you idiot," Molly snapped. "The rhythmic jumping creates seismic interference. It’ll scramble our motion sensors." "I thought the motion sensors were for the jewelry store we hit last month," Bill said, scratching his chin. "This is a high-stakes birthday party, Bill. The loot is concentrated in the gift corner, right next to the juice box station." Bill pulled a small, plastic magnifying glass from his pocket. "What’s our extraction plan for the LEGO sets? They’re notoriously jagged. They’ll shred our tactical gloves." "We use the velvet-lined laundry baskets," Molly whispered, her eyes gleaming with larceny. "We swoop in during the 'Pin the Tail on the Donkey' segment. That’s when the visual surveillance is at its lowest." "And the birthday boy?" Bill asked, his voice trembling slightly. "What if he’s a screamer? I can’t handle high-pitched acoustic alerts." "You’ll wear the noise-canceling headphones I bought you at the pharmacy," Molly reassured him. "Just focus on the unwrapped boxes. Avoid the glitter bombs. They’re a distraction tactic used by the parents." Bill sighed, leaning back in his creaky chair. "Is it really worth it? We’re risking felony charges for a collection of dinosaur figurines and scented markers." "It’s not about the markers, Bill," Molly said, her voice dropping to a gravelly serious tone. "It’s about the principle. No child should have more fun than a professional thief." "That’s a very thin moral compass you’re navigating by," Bill muttered. "Silence! Do you have the disguises?" Molly demanded. Bill reached into a brown paper bag and pulled out two oversized, glittery party hats. "I found these in the clearance bin. They’re festive, yet inconspicuous." "They’re neon green, Bill. We’ll look like targets, not guests." "It’s called camouflage, Molly. In a room full of chaos, the brightest object is the hardest to track." Molly stared at the hat, then at the blueprints, then back at Bill. "Fine. But if we get caught because you start eating the party snacks, I’m leaving you to face the toddlers alone." "I make no promises regarding the cocktail sausages," Bill warned. "Then let’s move," Molly commanded, grabbing her gear. "Operation: Naptime begins at fourteen-hundred hours."
created_at
"2026-05-10T13:14:30.198Z"
5/10/2026, 1:14:30 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
96
moderation_latency
(null)
generation_time
14506
tokens_prompt
77
tokens_completion
698
native_tokens_prompt
79
native_tokens_completion
607
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00021191
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-R2QqzYXmpQgYgx4io0i90OXi"
is_byok
false
latency
47
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418870-N7Rzw8WgMfB4muGVNZUw"
session_id
(null)
api_type
"completions"
id
"gen-1778418870-Gkk3IsMi9yrjNYlvyJZl"
upstream_id
"chatcmpl-R2QqzYXmpQgYgx4io0i90OXi"
total_cost
0.00021191
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
410
0.00%
Dialogue to Total Word Ratio
Ratio: 72.18%, Deviation: 22.18%
neededClean
false
wordsTotal
417
wordsDialogue
301
0.0000%