NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WEQH5K635FZVRYRD6JE2S
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 26s
Cost:
$0.003032
Provider:
Io Net
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0007%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the crinkled construction paper over the dashboard of the beat-up van. The interior smelled of stale coffee and nervous energy. "Alright, listen up. This is the most high-stakes job of our careers. One slip, one squeaky sneaker, and we are toast." "It’s a five-year-old’s birthday, Molly," Bill said, leaning back with a bag of stale pretzels. "The most dangerous thing in there is the juice boxes." "That’s exactly what they want you to think," Molly hissed. "The juice boxes are a distraction. While they are all sticky-handed and distracted by the fruit punch, that is when we strike the gift table." "What is the extraction plan?" Bill asked, crunching loudly. "The bouncy castle is a nightmare for heavy lifting. I will bounce, I will hit the ceiling, and the whole thing goes down." "We are not using the bouncy castle," Molly snapped. "We use the side entrance, through the sliding glass door. We grab the LEGO Technic sets, the remote-controlled drones, and the high-end building blocks. We leave the stuffed animals. They are too bulky and have zero resale value." "What about the clown?" Bill whispered, his eyes widening. Molly paused. "What about the clown?" "He is a wildcard, Molly. I saw him at the last gig. He makes balloon animals. You know how fast those things can be? He has precision. He has reach." "The clown is not our problem," Molly insisted. "Our problem is the birthday boy, Leo. He is a tactical genius. I heard he can spot a hidden snack from fifty paces." "He is a kid, not a sniper," Bill countered. "He is a kid with a high-frequency scream, Bill! If he sounds the alarm, the parents descend like hawks. We need to be in and out during the Happy Birthday song. That is the sonic cover we need." "And the cake?" Bill asked. "Do we secure the cake?" Molly stared at him, deadpan. "We are thieves, Bill, not gluttons. We are here for the plastic gold, not the buttercream." "I am just saying, a distraction involving a rogue cupcake could be effective," Bill muttered. "Focus! We enter at 14:00 hours. We move in a low-profile crawl under the snack table. We bypass the piñata—that is a booby trap if I ever saw one—and we head straight for the pile of wrapped glory." "What if the piñata is filled with high-grade candy?" Bill asked. "I could facilitate a secondary theft." "No secondary thefts! We follow the protocol. Grab, bag, and vanish before the singing ends." "Got it," Bill said, grabbing his mask. "Steal the toys, avoid the clown, ignore the cake. Easy." "It is not easy!" Molly yelled. "It is a minefield of glitter and chaos!"
created_at
"2026-05-10T12:05:38.989Z"
5/10/2026, 12:05:38 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
565
moderation_latency
(null)
generation_time
85305
tokens_prompt
77
tokens_completion
5944
native_tokens_prompt
82
native_tokens_completion
6039
native_tokens_completion_images
(null)
native_tokens_reasoning
5260
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0030318
router
(null)
provider_responses
0
endpoint_id
"0bc24d30-bd26-4fa8-968b-260c0011a4a5"
id
"chatcmpl-936d2a965caf048f"
is_byok
false
latency
565
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Io Net"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414739-2xeEFJlcLFYw6t83xifL"
session_id
(null)
api_type
"completions"
id
"gen-1778414739-owwPLxSlvGFrLRiI1djJ"
upstream_id
"chatcmpl-936d2a965caf048f"
total_cost
0.0030318
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Io Net"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
454
0.00%
Dialogue to Total Word Ratio
Ratio: 84.91%, Deviation: 14.91%
neededClean
false
wordsTotal
464
wordsDialogue
394
0.0007%