NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90WJ66Y7WRH70BY652HTQD
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
27.2s
Cost:
$0.000243
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the kitchen table, which was actually just a piece of plywood balanced on two crates of stolen laundry detergent. She pointed a laser pointer at a crude drawing of a suburban living room. The target was a seven-year-old’s birthday bash, a chaotic vortex of sugar and screams that provided the perfect tactical cover for a smash-and-grab. "The cake is the pivot point," Molly whispered, her eyes narrowing. "Once the candles are lit, every single adult in the room will suffer from tunnel vision. That is our window." Bill was currently struggling to fit into a giant, plush dinosaur costume. He looked less like a prehistoric predator and more like a lime-green beanbag chair with limbs. He let out a muffled grunt as he tried to zip the polyester hide over his belly. "Why am I the dinosaur?" Bill asked, his voice echoing inside the foam head. "Because you're the distraction, Bill. You’re going to initiate a 'prehistoric rampage' in the backyard. While the children are fleeing in terror or trying to ride you like a pony, I’ll be sliding through the dog door." Bill paused, his tiny felt arms flapping uselessly. "What if they like me? What if the kids want to hug the dinosaur?" Molly sighed and rubbed her temples. "Then you use the 'T-Rex Tantrum.' Knock over a bowl of chips, scream like a banshee, and lure them toward the trampoline. Just don't get stuck in the fence again." Bill shifted his weight, causing the costume to make a loud, swishing sound. "I'm just saying, a ninja outfit would have been more aerodynamic. This thing smells like a damp carpet." "Ninjas don't blend in at a themed party, Bill. Dinosaurs do. Now, let's review the loot. We have three Lego sets, a miniature drone, and a rumored high-end gaming console. It’s a goldmine." Bill looked down at his oversized claws. "How am I supposed to signal you when the coast is clear? I can't exactly use a walkie-talkie while I'm pretending to be an extinct reptile." "Three loud roars," Molly instructed. "Two for 'all clear,' three for 'run for the getaway van,' and one long, mournful bellow if you accidentally sit on the birthday cake." Bill let out a tentative, high-pitched squeak. "Like this?" Molly stared at him for a long beat, then slowly lowered her laser pointer. "We are definitely losing our professional reputation in this industry." "I think I look majestic," Bill replied, attempting to strike a menacing pose and accidentally tipping backward, crashing into the crates of detergent with a heavy thud. Molly didn't look up from the map. "Just get in the van, you oversized lizard."
created_at
"2026-05-10T13:23:06.574Z"
5/10/2026, 1:23:06 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
166
moderation_latency
(null)
generation_time
27138
tokens_prompt
77
tokens_completion
681
native_tokens_prompt
79
native_tokens_completion
613
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00024321
router
(null)
provider_responses
0
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-Rh1EKJEJfKo8Xr5j7DznoLuU"
is_byok
false
latency
60
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419386-wzfilF9FoG2xZHQTWXpx"
session_id
(null)
api_type
"completions"
id
"gen-1778419386-qm4WY8CgwOViggmuFLDA"
upstream_id
"chatcmpl-Rh1EKJEJfKo8Xr5j7DznoLuU"
total_cost
0.00024321
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
447
0.00%
Dialogue to Total Word Ratio
Ratio: 53.51%, Deviation: 23.51%
neededClean
false
wordsTotal
456
wordsDialogue
244
0.0000%