NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Z4MJQRQTD1VJWNXW7665A
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
24.1s
Cost:
$0.000253
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0050%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill stared at the blueprint of the suburban living room with the intensity of a man planning the liberation of a small nation. He tapped a pencil against a hand-drawn X marking the gift table. Molly leaned over his shoulder, chewing on a piece of neon-green bubblegum. "The perimeter is guarded by three golden retrievers and a temperamental toddler," Bill whispered, his voice trembling with dramatic gravity. Molly popped a bubble. "It's a five-year-old's birthday party, Bill. Not the Louvre." "That's exactly what the authorities want us to think," he replied, sliding a photograph of a bounce house across the table. "Look at the structural integrity of this inflatable fortress. One wrong step and we're trapped in a nylon purgatory of primary colors." Molly sighed and began sorting through their gear. She laid out a pair of oversized sunglasses, a bag of high-grade confetti, and two oversized clown shoes. Bill looked at the shoes with genuine suspicion. "Why the footwear?" he asked. "Camouflage," she explained. "We enter as 'Boffo and Bingo,' the freelance entertainment. Nobody questions a clown who wanders toward the presents. It's practically a legal requirement." Bill frowned, crossing his arms. "I don't do slapstick. I'm a professional infiltrator." "You're a man who once got his head stuck in a banister while trying to steal a decorative vase," Molly countered. "Just put on the oversized shoes and try not to sneeze." Bill sighed and pointed to the timing chart. "The window of opportunity is narrow. The cake is served at 2:00 PM. That is the Moment of Maximum Distraction. While the children are entering a sugar-induced frenzy, we strike." Molly checked her watch and smirked. "And the exit strategy?" "We slip out through the sliding glass door, utilizing the hedge maze for cover," Bill explained, his eyes gleaming. "Then we vanish into the suburbs like ghosts in polka-dot ties." Molly reached for a nearby bag of party favors and tossed a plastic whistle to him. "You're taking this way too seriously. We're stealing Lego sets and Barbie dream houses, not the Crown Jewels." "The thrill is in the execution, Molly!" Bill hissed, accidentally blowing the whistle. The shrill blast echoed through the room, causing both of them to jump. "Great," Molly muttered, heading for the door. "Now the neighbors know the professionals have arrived." Bill scrambled to put on his oversized shoes, tripping over his own feet in the process. He looked up from the floor with a look of grim determination. "The plan is still a go," he declared, though his voice was muffled by a stray piece of confetti stuck to his lip. Molly rolled her eyes and led the way to the getaway sedan, wondering if it was too late to recruit a thief who didn't treat a suburban cul-de-sac like a war zone.
created_at
"2026-05-10T12:52:34.016Z"
5/10/2026, 12:52:34 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
364
moderation_latency
(null)
generation_time
24028
tokens_prompt
77
tokens_completion
731
native_tokens_prompt
79
native_tokens_completion
640
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00025347
router
(null)
provider_responses
0
endpoint_id
"9d4644e9-a5e5-41f4-b810-d75f484d2c24"
is_byok
false
latency
18
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Parasail"
status
429
1
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RLxYDfHFUSbdSbfaYBxTfkK4"
is_byok
false
latency
53
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417554-b3pSEq4T6VdswPns9SNH"
session_id
(null)
api_type
"completions"
id
"gen-1778417554-KAsrqniPiXl5j4QPabSW"
upstream_id
"chatcmpl-RLxYDfHFUSbdSbfaYBxTfkK4"
total_cost
0.00025347
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
469
0.00%
Dialogue to Total Word Ratio
Ratio: 46.03%, Deviation: 16.03%
neededClean
false
wordsTotal
478
wordsDialogue
220
0.0050%