NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91H66KF5WRMNHNX6YGZJWQ
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.5s
Cost:
$0.000205
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned over the blueprints of the Sunnyvale Community Center, stabbing a finger at a diagram of the bouncy castle. "We hit the sugar rush, Bill. That is the window of opportunity," Molly whispered. "The sugar rush?" Bill squinted at the map. "You mean when the toddlers enter a state of hyperactive psychosis?" "Exactly. When the parents are distracted by the chaos and the kids are vibrating at a frequency that defies physics, we move in." "Molly, this is a birthday party. We are professional thieves. We usually hit banks, or high-end art galleries, or at least a jewelry store that doesn't smell like grape juice." "The payoff is the same, Bill! Think of the volume. It is not just one diamond; it is forty-five high-end, battery-operated, brand-name toys. It is a logistical goldmine." "It is a logistical nightmare. I am not wearing a clown suit." "Who said anything about a clown suit? You are going undercover as the professional face-painter." "I do not have the temperament for face-painting, Molly. I have a resting crime face. I cannot draw a butterfly on a six-year-old without looking like I am marking my territory." "Just wear the apron. Keep your head down. When the cake is cut, you grab the pile of wrapped boxes near the gift table and bring them to the van." "And you?" "I will be the exhausted mother in the corner, weeping softly into a juice box. No one questions a woman who has clearly reached her limit with toddlers." "That is uncomfortably close to reality." "Focus, Bill! The target is the pile of presents. Once the singing starts, the perimeter is unguarded." "What if a kid asks me for a Spiderman mask? I cannot do Spiderman. I can barely draw a circle." "Then you tell them you are out of red pigment and move to the next victim. Do not engage in conversation." "What if they cry?" "Then you create a diversion by dropping a tray of cupcakes. Use the chaos." "I hate this plan. I really do. My dignity is plummeting faster than your moral compass." "Dignity does not pay the rent, Bill. Now, do you have the heavy-duty gift bags?" "I have them. But I am telling you, if I end up on a playground bully's TikTok, I am retiring." "Just grab the Lego sets and keep moving. Are we clear?" "Crystal. But I want a larger cut for the emotional trauma of dealing with a bouncy castle." "Fine. Twenty percent extra. Now, let's go get some face paint."
created_at
"2026-05-10T13:34:22.426Z"
5/10/2026, 1:34:22 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
107
moderation_latency
(null)
generation_time
11156
tokens_prompt
77
tokens_completion
636
native_tokens_prompt
79
native_tokens_completion
587
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00020511
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RmgbLKc5x9hnE5iKCrUgOEIz"
is_byok
false
latency
54
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420062-0O7OhqYgbgAqcf9HKZWZ"
session_id
(null)
api_type
"completions"
id
"gen-1778420062-EvWxLuBPPqy2lEApZM0q"
upstream_id
"chatcmpl-RmgbLKc5x9hnE5iKCrUgOEIz"
total_cost
0.00020511
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
424
0.00%
Dialogue to Total Word Ratio
Ratio: 93.78%, Deviation: 23.78%
neededClean
false
wordsTotal
434
wordsDialogue
407
0.0000%