NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8W9ZWSGJGEFCDJVE2RT10Z
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
7m 18s
Cost:
$0.003351
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
4.39%
Dialogue to Total Word Ratio
(details)
2.1946%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint was taped to the grease-stained table, illuminated by a single, flickering bulb that hummed with a low, rhythmic anxiety. Molly tapped a red circle drawn around the living room with a gloved finger. "This is the kill zone, Bill," she whispered. "It is a playdate, Molly," Bill replied, picking a piece of lint off his tight, tactical turtleneck. "It is a high-density gift repository, Bill. Do not be pedestrian," she snapped. "But there is a bouncy castle in the perimeter," Bill argued, gesturing toward the backyard sketch. "How are we supposed to move stealthily through inflatable architecture? The squeaking alone will alert the sentries." Molly ignored him, sliding a grainy, long-lens photo of a clown across the table. The clown's painted smile looked predatory under the dim light, and his oversized shoes seemed to threaten the very concept of silence. "The clown is a wildcard," she said. "His movements are erratic and his balloon-animal production is unpredictable." "He is just a guy named Gary in face paint," Bill sighed. "He is probably more worried about his balloon animals than us." "That is exactly what a professional infiltrator wants you to think," Molly countered. "We need precision, Bill. Precision and sugar." "I just want to know if there are any security dogs," Bill added. "Just a Golden Retriever named Buddy," Molly replied. "He is easily bribed with peanut butter." Bill adjusted his black mask, which kept slipping down his nose. "What about the biological hazards? I heard there is a high concentration of spilled juice boxes and sticky fingers." "We use the tactical wipes," Molly said, her eyes narrowing. "And the heavy-duty, non-see-through trash bags. We strike during the Happy Birthday song. It is the perfect sonic cover." "The singing will be off-key, Molly," Bill pointed out. "It will be more distracting than a siren." "Exactly. Chaos is our best friend," she replied. Molly pulled out a small, velvet-lined box. Inside sat a single, high-tech laser pointer and a bag of premium, organic gummy worms. "Why the worms?" Bill asked, leaning in. "Distraction," Molly said. "If the toddlers become a physical obstacle, we deploy the glucose. It will create a sugar-induced frenzy, giving us a forty-second window to extract the LEGO sets and the remote-controlled drones." "It feels morally questionable," Bill muttered, staring at the worms. "It is high-stakes larceny, Bill. Now, grab the tactical cake server," Molly commanded. Bill stared at the silver utensil. He wondered if they could have just robbed a bank.
created_at
"2026-05-10T12:03:03.713Z"
5/10/2026, 12:03:03 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2648
moderation_latency
(null)
generation_time
437853
tokens_prompt
77
tokens_completion
8770
native_tokens_prompt
81
native_tokens_completion
8351
native_tokens_completion_images
(null)
native_tokens_reasoning
8095
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00335093
router
(null)
provider_responses
0
endpoint_id
"03b96766-ef7e-454f-9519-c76b70654b49"
id
"f739ef30370ea42b1dfa267a9168fe20"
is_byok
false
latency
2495
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414583-6UFCSUYiQ9eQAzoZIKOt"
session_id
(null)
api_type
"completions"
id
"gen-1778414583-KOQE5qUnZRyfcJhpQDOY"
upstream_id
"f739ef30370ea42b1dfa267a9168fe20"
total_cost
0.00335093
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
416
4.39%
Dialogue to Total Word Ratio
Ratio: 57.91%, Deviation: 7.91%
neededClean
false
wordsTotal
430
wordsDialogue
249
2.1946%