NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Y1NF8KCQZ66FV2YDYKH77
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 36s
Cost:
$0.002579
Provider:
Io Net
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the glitter-streaked map across the sticky diner table. "Listen closely, Bill. This is the big one. Operation: Naptime." "Operation: Naptime?" Bill leaned in, his eyes wide. "Is there a tactical nap involved? Because I could really use one." "We are stealing the loot, Bill. We are not napping." "Right, right. The presents. The high-end Legos, the remote-controlled drones, the limited-edition action figures. Got it." "Exactly. Now, the entry point. We avoid the sliding glass door. The mother is stationed there with a tray of organic juice boxes. She’s a hawk." "A juice-box-wielding hawk? Terrifying." "We go through the laundry room window. It’s unlatched, assuming the toddler hasn't jammed it with a rogue crayon." "And the security detail?" Bill whispered, glancing nervously at a nearby waitress. "I heard reports of a clown." Molly sighed deeply. "The clown is our primary tactical obstacle. He is unpredictable, highly mobile, and smells like face paint and existential dread. We bypass him entirely." "What if he tries to make a balloon animal? I have a very low threshold for balloon animals, Molly. It’s a psychological vulnerability." "Focus! Once we’re inside the living room, we strike the present mountain. We bag everything, move to the kitchen, and exit through the garage." "How do we handle the gift-opening ritual?" Bill asked. "We don't," Molly replied. "We strike while the birthday boy is busy being the center of attention. By the time he's tearing through wrapping paper, we're halfway to the safe house with the Lego Star Wars set." "Wait, what about the bouncy castle?" Bill asked. "The seismic activity from the jumping could trigger a perimeter alarm." "It’s a backyard, Bill, not a nuclear silo. There are no alarms. Just hyperactive seven-year-olds." "Seven-year-olds are dangerous," Bill muttered. "They have no respect for tactical boundaries or personal space." "Precisely. We move during the cake ceremony. The singing provides the necessary acoustic cover. While they’re distracted by the sugar rush, we sweep the mountain." "And the getaway?" "You'll be idling in the minivan around the corner." Bill slumped in his seat. "A minivan? I feel like a heist requires more menace. A black sedan? A motorcycle with sidecars?" "You drive a beige Honda Odyssey with a Baby on Board sticker, Bill. Accept your reality." "Fine. But if the clown corners me, I’m claiming I’m part of the traveling circus." "Just grab the loot and get in the van," Molly snapped. "And for the love of God, do not touch the bouncy castle."
created_at
"2026-05-10T12:33:28.049Z"
5/10/2026, 12:33:28 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4443
moderation_latency
(null)
generation_time
156418
tokens_prompt
77
tokens_completion
5442
native_tokens_prompt
82
native_tokens_completion
5134
native_tokens_completion_images
(null)
native_tokens_reasoning
4756
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0025793
router
(null)
provider_responses
0
endpoint_id
"0bc24d30-bd26-4fa8-968b-260c0011a4a5"
id
"chatcmpl-bf860dac83b4f748"
is_byok
false
latency
4443
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Io Net"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416408-HlP39rUzu4GTmGD7vq0N"
session_id
(null)
api_type
"completions"
id
"gen-1778416408-f4dMxlGRj6qp7k5qAe2O"
upstream_id
"chatcmpl-bf860dac83b4f748"
total_cost
0.0025793
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Io Net"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
415
0.00%
Dialogue to Total Word Ratio
Ratio: 89.91%, Deviation: 19.91%
neededClean
false
wordsTotal
426
wordsDialogue
383
0.0000%