NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91T4729BE0AJA7G64M60H1
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
28.8s
Cost:
$0.000259
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread a crayon-drawn map of the suburban backyard across the kitchen table. She pointed a sharp fingernail at a plastic slide. "This is the insertion point," Molly whispered. Bill squinted at the drawing. "Why is the dog a giant red circle?" "Because Barnaby is a menace, Bill. He’s a golden retriever with a god complex." Bill leaned back, crossing his arms. "I still don't get the angle. It's a six-year-old's birthday. Why the tactical gear?" "Because six-year-olds are feral," Molly snapped. "They have the sensory awareness of bats and the cruelty of Roman emperors. We go in fast, we grab the pile under the gift table, and we vanish before the cake is sliced." Bill sighed, eyeing his oversized neon-yellow raincoat. "I feel ridiculous. I look like a human highlighter." "You're blending in! It's a theme party. You're a 'Cheerful Sun.' Now, do you have the diversion?" Bill reached into his pocket and produced a small, remote-controlled bubble machine. "The Bubble-Tron 3000. One click and the perimeter is engulfed in soapy chaos." "Perfect. Now, the extraction." Molly traced a line toward the fence. "We exit via the hedge. We avoid the trampoline at all costs." "Why the trampoline?" Bill asked. "Once you hit the bounce, you're visible from space. You'll be a beacon for the parents." Bill frowned. "What about the presents? They're wrapped in heavy foil. We can't just stuff them in our pockets." "That's why we're using the oversized strollers," Molly explained, gesturing to two modified baby carriages filled with velvet lining. "We push them out like we're just tired parents. Nobody questions a tired parent." Bill paused. "Wait, who is the target gift? Is there a big prize?" "The Limited Edition Galactic Defender set," Molly said, her eyes gleaming. "Retail value two hundred dollars, but the resale on the dark web is astronomical." Bill nodded slowly. "And the cake? We aren't taking a slice?" Molly glared at him. "This is a heist, Bill, not a potluck." "Just a cupcake?" Bill pleaded. "Focus! Now, let's review the timeline. We enter at 2:00 PM, exactly when the piñata breaks. The noise will mask our footsteps." "What if the piñata is empty?" Bill asked. "Then we improvise," Molly replied. "I've got a handful of confetti and a smoke grenade in my purse." Bill stared at her. "A smoke grenade? For a backyard in the suburbs?" "Better to over-prepare than to be captured by a group of first-graders," Molly insisted. "Are you in or out?" Bill sighed and pulled on his yellow hood. "I'm in. But I'm taking a cupcake."
created_at
"2026-05-10T13:39:15.306Z"
5/10/2026, 1:39:15 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
153
moderation_latency
(null)
generation_time
18653
tokens_prompt
77
tokens_completion
683
native_tokens_prompt
79
native_tokens_completion
655
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00025917
router
(null)
provider_responses
0
endpoint_id
"576fca5b-f09c-4bda-b560-9989c39ac3ab"
is_byok
false
latency
10078
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Together"
status
504
1
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RFzSNy9x7TYrnkvV0PUWxwzn"
is_byok
false
latency
91
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420355-fchnSoi7ABm0Cnfbf5jg"
session_id
(null)
api_type
"completions"
id
"gen-1778420355-vEykEuAlOpRqAUgalz58"
upstream_id
"chatcmpl-RFzSNy9x7TYrnkvV0PUWxwzn"
total_cost
0.00025917
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
430
0.00%
Dialogue to Total Word Ratio
Ratio: 73.64%, Deviation: 23.64%
neededClean
false
wordsTotal
440
wordsDialogue
324
0.0000%