NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8X65JE9NZ38GCB1DQ4R4ZD
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 22s
Cost:
$0.003130
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
49.9982%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement was sweltering, smelling of damp concrete and the faint, lingering scent of a strawberry-scented unicorn candle that served as their only light source. Molly leaned over the tactical map, her eyes narrowed with the grim determination of a seasoned operative. The map was actually a child's drawing of a suburban bungalow, rendered in erratic strokes of cerulean and crimson crayon, but under the flickering flame, it looked like a legitimate intelligence briefing. She traced a finger along the path to the living room, her movements slow and deliberate. Across the table, Bill sat in a state of heightened anxiety, polishing his glasses with a napkin that was dangerously close to being a piece of used deli paper. He stared at the drawing of the bouncy castle, contemplating the structural dangers of inflatable architecture with the intensity of a theoretical physicist. "Listen up," Molly whispered. "The target is the Henderson residence. We strike during the birthday festivities, specifically in the window between the cake and the sugar meltdown." Bill adjusted his glasses, leaning into the flickering light. "What about the bouncy castle? The rhythmic thumping could trigger a seismic event." Molly tapped the paper with a heavy finger. "The castle is a distraction. While the kids defy gravity, we hit the gift corner for the heavy hitters." "And the clown?" Bill asked. "An external variable," Molly replied. "Ignore him. We want drones, LEGOs, and dinosaurs. Leave the sweaters for the amateurs." Bill nodded, though his gaze drifted to a stray gummy bear near the edge of the pizza box. He reached for a heavy-duty flashlight and a pair of specialized climbing gloves, checking the equipment for any sign of dust or sticky residue. He was a man who believed that even a heist for toys required the highest level of tactical preparation and a very clean set of hands. "And the exit?" Bill asked. "Standard extraction," Molly said. "We exit through the laundry room, bypass the toddler, and vanish." Bill stared at her, his expression shifting from terror to grim acceptance. "It is madness. I will pack the wet wipes." Molly smirked, folding the crayon map with practiced, professional precision. "That is the spirit. Let us go get those LEGOs."
created_at
"2026-05-10T12:18:27.029Z"
5/10/2026, 12:18:27 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
281
moderation_latency
(null)
generation_time
141654
tokens_prompt
77
tokens_completion
9764
native_tokens_prompt
82
native_tokens_completion
9188
native_tokens_completion_images
(null)
native_tokens_reasoning
9178
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00312966
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RDRTfx2EKq70zS9AWcRgEEsI"
is_byok
false
latency
93
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415507-wmMBW49GV1no1Vfoi6Y3"
session_id
(null)
api_type
"completions"
id
"gen-1778415507-fJqBeHxr13bBMSQTqaf3"
upstream_id
"chatcmpl-RDRTfx2EKq70zS9AWcRgEEsI"
total_cost
0.00312966
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
371
100.00%
Dialogue to Total Word Ratio
Ratio: 30.03%, Deviation: 0.03%
neededClean
false
wordsTotal
373
wordsDialogue
112
49.9982%