NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WF71DQRR93JTE5QAWW4BT
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
22.6s
Cost:
$0.000249
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread a crayon-drawn map across the kitchen table. She pointed to a circle labeled The Danger Zone. "The bouncy castle is our primary blind spot," Molly whispered. Bill squinted at the map. "Why are we stealing toys, Molly? It is a crime against innocence." "Innocence doesn't pay the rent, Bill. Those are high-end educational blocks. The resale value on the secondary market is astronomical." Bill sighed, leaning back in his chair. "What is the entry point?" "The hedge. We slip through the azaleas, bypass the juice box station, and strike during the cake cutting." "The cake cutting?" Bill asked. "That is when the chaos peaks." "Exactly," Molly replied. "While the children are fighting over frosting, we liberate the gift pile." Bill tapped his chin. "What about security?" "A golden retriever named Buster. He is easily bribed with premium beef jerky." "And the parents?" Bill questioned. "They will notice a mountain of presents vanishing into thin air." Molly smirked, pulling out two oversized neon-pink tutu dresses. "We blend in. We are the eccentric aunts from out of town." Bill stared at the tulle with genuine horror. "I am not wearing a tutu, Molly." "It is a tactical garment, Bill. It provides maximum concealment for the loot bags." "I have a reputation in the underworld," Bill groaned. "I cannot be known as the man in the pink fluff." "Do you want the blocks or not?" Bill looked at the tutu, then at the map. "Will there be actual cake?" "Three tiers of vanilla sponge," Molly confirmed. "Fine," Bill muttered. "But I get to keep the LEGO Star Wars set." "Greedy," Molly noted. "Now, let's discuss the extraction. We exit via the slide." "The slide is for toddlers," Bill countered. "The slide is a high-speed getaway vehicle, Bill. Get with the program." Bill stood up and gingerly held the pink fabric. "How do I put this thing on?" "Step into the circle and pray for the best," Molly instructed. Bill tugged the tutu over his hips, sighing as the elastic snapped against his waist. "I feel a draft." "That is the wind of success, Bill." "It feels more like a breeze on my thighs," he complained. Molly checked her watch and grabbed her bag of jerky. "Check your pockets. Do you have the distraction balloons?" "Yes," Bill replied, sounding defeated. "I have the balloons." "Good. Now, remember the signal. If I sneeze twice, we abandon the teddy bears and run for the fence." "What if I actually sneeze?" Bill asked. "Then you are on your own," Molly said, heading for the door. "Let's go make some toddlers cry."
created_at
"2026-05-10T12:05:54.869Z"
5/10/2026, 12:05:54 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
148
moderation_latency
(null)
generation_time
22416
tokens_prompt
77
tokens_completion
692
native_tokens_prompt
79
native_tokens_completion
629
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00024929
router
(null)
provider_responses
0
endpoint_id
"69279254-6a96-4e9d-abc6-ea76e6bf8e97"
is_byok
false
latency
73
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Novita"
status
429
1
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RzUoDKcH6KFgCRukzQNWHesc"
is_byok
false
latency
48
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414754-t8Y9uEYEOEMMoXWV3awJ"
session_id
(null)
api_type
"completions"
id
"gen-1778414754-kOOrSGTSg3wxetWdJ6pm"
upstream_id
"chatcmpl-RzUoDKcH6KFgCRukzQNWHesc"
total_cost
0.00024929
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
436
0.00%
Dialogue to Total Word Ratio
Ratio: 70.45%, Deviation: 20.45%
neededClean
false
wordsTotal
440
wordsDialogue
310
0.0000%