NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90CTJC0MSVQ0G1V2WQPAK8
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 50s
Cost:
$0.002400
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
54.50%
Dialogue to Total Word Ratio
(details)
27.2500%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dim light of the single bulb flickered over the blueprints, which were actually a series of frantic, colorful crayon drawings on the back of a dinosaur-themed coloring book. Molly traced the perimeter of the living room with a sharpened pencil, her eyes narrowed in a look of professional, life-or-death intensity. The target was a suburban ranch-style house, currently under siege by twelve sugar-crazed six-year-olds. To an outsider, it looked like a standard weekend afternoon, but to Molly, it was a high-security fortress. "The perimeter is soft," Molly whispered, tapping a red circle around the snack table. "Too soft," Bill replied. "The bouncy castle provides too much cover for the host." "Ignore the castle. We focus on the gifts under the dining table. We need to be ghosts, Bill. Invisible, silent, and smell-free." "What about the toddler in the sticky pajamas?" "He is a wildcard," Molly said. "Avoid his path at all costs." "I will try, but I cannot promise I won't get distracted by the cupcakes," Bill muttered. Molly ignored him, pointing her pencil at a cluster of brightly wrapped boxes. The haul was legendary. They were looking at high-end building blocks, remote-controlled drones, and potentially a handheld gaming console. It was the score of a lifetime, provided they did not get caught by a frantic mother armed with a juice box or a lukewarm chicken nugget. Every box represented a potential payday, a way out of their current life of low-stakes petty theft. "We move during the PiƱata distraction," Molly instructed. "The chaos could be overwhelming," Bill cautioned. "The sugar rush is a variable we cannot control." "It is the only way. Once the donkey breaks, we sweep the hallway." "And if the birthday boy sees us?" "Then you deploy the distraction." Bill reached into his tactical vest and pulled out a small, crinkly plastic bottle. "The bubble solution?" he asked. "Exactly. Pop some bubbles and keep him occupied while I bag the Lego sets." Bill sighed, staring at the crayon-drawn living room. "I feel like a criminal, Molly." "You are a criminal, Bill. Now, grab the bags." He nodded, though his eyes wandered to a stray blue crayon. He picked it up and began shading in the bouncy castle with grim determination. Molly watched him, wondering if she could have picked a partner who was not so easily distracted by the allure of coloring within the lines. The heist was set. Tomorrow, they would strike the most dangerous crowd in the tri-state area: a group of children who had just discovered the magic of chocolate cake.
created_at
"2026-05-10T13:14:30.867Z"
5/10/2026, 1:14:30 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
133
moderation_latency
(null)
generation_time
109971
tokens_prompt
77
tokens_completion
7416
native_tokens_prompt
82
native_tokens_completion
7043
native_tokens_completion_images
(null)
native_tokens_reasoning
6740
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00240036
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RJxQzW49ooo1mHHVyQeoomd9"
is_byok
false
latency
46
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418870-rrBhNkb4ei4tj9iaYBFM"
session_id
(null)
api_type
"completions"
id
"gen-1778418870-5suNYWbBLG2vxLTJAqAH"
upstream_id
"chatcmpl-RJxQzW49ooo1mHHVyQeoomd9"
total_cost
0.00240036
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
429
54.50%
Dialogue to Total Word Ratio
Ratio: 33.48%, Deviation: 3.48%
neededClean
false
wordsTotal
442
wordsDialogue
148
27.2500%