NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8X8D58R8PFZVHZZZFGFENQ
Run Details
Model:
Gemma 4 31B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 1s
Cost:
$0.002145
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
95.01%
Dialogue to Total Word Ratio
(details)
90.6936%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread a detailed hand-drawn map of the suburban living room across the scarred coffee table. She pointed a red laser pointer at a sketched pile of balloons. "The primary target is the gift table," Molly whispered. "High visibility, low security, and completely unguarded during the festivities." Bill chewed on a piece of peppermint gum, looking deeply skeptical. He shifted his weight, making the old floorboards creak. "Are we really sure about the timing? The cake comes out at three. That's when the sugar-fueled chaos usually peaks." Molly glared at him. She adjusted her thick glasses, which were sliding down her nose from the oppressive heat. "Exactly, Bill. When the sugar rush hits, the perimeter collapses. We strike precisely during the Pin the Tail on the Donkey event." Bill scratched his chin. He looked at the blueprints, which were actually just crayon sketches on a piece of salvaged cardboard. "What about the parents? They have eyes everywhere, and some of those soccer moms look like they could take us in a fight." Molly smirked. She reached into her oversized bag and pulled out two neon-colored, polyester clown suits that smelled of mothballs. "We blend in. We are the hired entertainment. I’m the professional balloon artist, and you’re the clumsy, silent mime." Bill stared at the shimmering fabric with visible horror. He stepped back, nearly knocking over a rickety floor lamp. "I can't be a mime, Molly. I have a thing about absolute silence. It makes me nervous and twitchy." Molly sighed and slammed her hand on the table, rattling the laser pointer and a half-empty bag of chips. "It’s a stealth mission, Bill! Now, let's discuss the getaway. The stolen tricycle is far too slow for a clean escape." Bill pointed toward the window where a modified golf cart sat idling loudly in the driveway. "I upgraded the engine with parts from a heavy-duty lawnmower. We can hit thirty miles per hour if we don't hit a curb." Molly nodded, finally satisfied. She circled the gift table in a heavy red marker with aggressive precision. "Perfect. Now, what exactly is the contingency plan if a toddler catches us in the act?" Bill shrugged. He looked genuinely confused by the prospect of a child obstacle. "I don't know. Maybe a lollipop? A handful of glitter to blind them while we bolt for the fence?" Molly shook her head. She looked at the wall clock, noting they were behind schedule. "We use the distraction technique. Throw a handful of confetti, scream like a banshee, and run for the exit." Bill looked at the clown suit again, his expression softening as he imagined the costume. "Do I at least get to wear the big red nose?" Molly rolled her eyes and pushed the suit toward him with a sigh of resignation. "Yes, Bill. You get the nose, the oversized shoes, and the giant bow tie. Now put it on and let's move."
created_at
"2026-05-10T12:19:40.335Z"
5/10/2026, 12:19:40 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
141
moderation_latency
(null)
generation_time
180972
tokens_prompt
77
tokens_completion
5302
native_tokens_prompt
82
native_tokens_completion
5617
native_tokens_completion_images
(null)
native_tokens_reasoning
4544
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00214512
router
(null)
provider_responses
0
endpoint_id
"9d4644e9-a5e5-41f4-b810-d75f484d2c24"
is_byok
false
latency
16
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Parasail"
status
429
1
endpoint_id
"29b94af5-d8f8-419a-9cf1-ae1f2831142b"
is_byok
false
latency
138
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Venice"
status
429
2
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RRop2WPDdD8H5ADH8obFlFaq"
is_byok
false
latency
54
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415580-kT4hnwj1PsYyZSlwS3Ns"
session_id
(null)
api_type
"completions"
id
"gen-1778415580-hvHoCYetqTtURUxrVaUC"
upstream_id
"chatcmpl-RRop2WPDdD8H5ADH8obFlFaq"
total_cost
0.00214512
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
95.01%
Dialogue to Total Word Ratio
Ratio: 51.01%, Deviation: 1.01%
neededClean
false
wordsTotal
494
wordsDialogue
252
90.6936%