NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR909YGYMRNHNBMVH9HFG9FZ
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
13.6s
Cost:
$0.000221
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint was spread across the grease-stained table, though it looked less like a tactical map and more like a frantic coloring book. Molly tapped a manicured fingernail against the centerpiece of the drawing: a massive, multi-colored fortress of cardboard and wrapping paper. "It’s a logistical nightmare, Bill," Molly whispered, her eyes narrowing at the tactical layout of the bouncy castle. "The security perimeter is comprised entirely of hyperactive six-year-olds. They have no respect for personal space or the laws of physics." Bill leaned in, adjusting his headset, which was actually just a pair of disconnected gaming headphones he’d found in a junk drawer. "We go in during the cake phase. That’s when the sugar crash hits. That’s our window of opportunity." "And the clown?" Molly asked, her voice dropping an octave. "I’ve seen his kind before. They’re unpredictable. They work for the chaos." "The clown is a distraction, Molly. A red-nosed diversion," Bill insisted, pointing to a circled zone near the snack table. "Once the 'Happy Birthday' song hits the bridge, we execute the 'Gift Grab' maneuver. I’ll create a diversion by pretending to trip over a rogue balloon animal. While the crowd is focused on my simulated agony, you sweep the pile." Molly sighed, rubbing her temples. "And if the birthday boy sees us? He’s a toddler, Bill. They have nothing to lose. They’ll scream, and then the parents—the real heavy hitters—will descend." "Not if we’re wearing the disguises," Bill said proudly, pulling two brightly colored party hats from his bag. "We aren't thieves. We are merely eccentric uncles here to inspect the quality of the party favors." "You look like a man who’s about to be asked to help clean up spilled juice," Molly countered. "We need more subtlety. We need stealth. We need to blend into the background of this pastel-colored purgatory." "Subtlety doesn't win prizes, Molly. Aggression wins prizes," Bill argued, though his eyes flickered toward the pile of Lego sets depicted on the map. "Think about the loot. High-end remote-controlled cars. Educational building blocks. Even some premium scented markers. It’s a goldmine." "It’s a felony in a room full of juice boxes," Molly muttered, though she began sketching a more efficient route through the obstacle course. "Fine. But if that clown starts making balloon poodles, we abort. I am not being interrogated by a man in oversized shoes." "Agreed," Bill said, saluting her with a half-eaten pretzel. "To the party. For glory. For the Lego sets." "For the silence," Molly corrected, grabbing her tactical glitter-glue. "Now, let's go over the exit strategy one more time. If we get cornered by the piñata, what's the move?"
created_at
"2026-05-10T13:12:56.613Z"
5/10/2026, 1:12:56 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
167
moderation_latency
(null)
generation_time
13452
tokens_prompt
77
tokens_completion
712
native_tokens_prompt
79
native_tokens_completion
635
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00022143
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RiSe9tTcSFWYC5EKYkIKdPUe"
is_byok
false
latency
60
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418776-4zPb2La5tnQKPSvNZKNx"
session_id
(null)
api_type
"completions"
id
"gen-1778418776-cI3maEtya5YqyB9IAF3w"
upstream_id
"chatcmpl-RiSe9tTcSFWYC5EKYkIKdPUe"
total_cost
0.00022143
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
443
0.00%
Dialogue to Total Word Ratio
Ratio: 64.16%, Deviation: 34.16%
neededClean
false
wordsTotal
452
wordsDialogue
290
0.0000%