NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91M021B3Z4AKZJ89Z83ZAQ
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.0s
Cost:
$0.000247
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"It is a tactical strike, Bill. A precision operation," Molly said, slapping a blueprint of a suburban living room onto the table. "It is a five-year-old's birthday party, Molly. Why do we have a blueprint?" "Because the layout of the snack table is critical to our extraction route. Now, look. Entry is through the sliding glass door. We slip in during the cake singing. That is when the adults are distracted by the sheer sonic horror of twenty children screaming in unison." "I don't like the uniforms," Bill muttered, gesturing to the neon-pink tutu and oversized sunglasses lying on the chair. "They are disguises! We are blending in as eccentric aunts from the coast." "I am a six-foot-four man with a beard, Molly. I cannot blend in as an eccentric aunt." "Put on the tutu, Bill. Do you want the LEGO Star Wars sets or not?" Bill sighed, picking up the tulle skirt. "I just feel like we are targeting a vulnerable demographic. These are toddlers." "They have an abundance of resources they cannot possibly utilize! Half of those toys will be abandoned by Tuesday. We are basically providing a recycling service." "And the plan for the presents? Just grab and run?" "Hardly. We use the 'Confusion Diversion.' I drop a bag of glitter in the center of the room, you scoop the gifts into the oversized laundry hamper, and we exit via the patio." "What if the parents catch us?" "I have a decoy. A recording of a child crying in the upstairs bathroom. The moment I hit play, every adult in the room will sprint toward the stairs in a panic. We will have a forty-five second window of absolute anarchy." "You've thought about the glitter cleanup?" "Bill, we are thieves, not janitors." "I'm just saying, glitter is forever. We'll be sneezing sparkles for a month. Also, why are we stealing the presents? We could just rob a bank. A small bank. One with a slow vault." "Banks are boring. Banks have lasers and silent alarms. You know what doesn't have lasers? A suburban bungalow in the suburbs of Ohio. Plus, the resale value on limited edition building blocks is skyrocketing." "I still think the tutu is a step too far." "It's a fashion statement, Bill. Now, practice your 'eccentric aunt' voice. Give me a 'Oh, look at the darling little cupcake!'" "Oh, look at the darling little cupcake," Bill droned, sounding like a depressed robot. "No passion! More whimsy! More 'I’ve spent too much time in Sedona' energy!" "I hate you so much," Bill said, stepping into the tutu. "You love the LEGOs. Now, let's go. We have to hit the craft store for more glitter."
created_at
"2026-05-10T13:35:54.441Z"
5/10/2026, 1:35:54 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
724
moderation_latency
(null)
generation_time
15990
tokens_prompt
77
tokens_completion
680
native_tokens_prompt
79
native_tokens_completion
624
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00024739
router
(null)
provider_responses
0
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-Ri1RM6mnF59C3zpaBOoAQK0G"
is_byok
false
latency
113
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420154-aHHwwdqdtNmvUABD6WOW"
session_id
(null)
api_type
"completions"
id
"gen-1778420154-7yV7JmIqwqrsIsGXkmF0"
upstream_id
"chatcmpl-Ri1RM6mnF59C3zpaBOoAQK0G"
total_cost
0.00024739
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
449
0.00%
Dialogue to Total Word Ratio
Ratio: 89.45%, Deviation: 19.45%
neededClean
false
wordsTotal
455
wordsDialogue
407
0.0000%