NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91A4EY66DAZY890A3JKMQX
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 7s
Cost:
$0.000315
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread a hand-drawn map of the suburban backyard across the kitchen table. She pointed a pencil at a small drawing of a bouncy castle. "The inflatable fortress is the primary obstacle, Bill," Molly whispered. Bill squinted at the map, chewing on a piece of licorice. "Why is it a fortress? It is a bounce house." "It is a tactical diversion," she snapped. "The children will be contained within the bounce zone while the cake is served. That is our window." Bill leaned back, crossing his arms. "And the presents? Where are they?" "The dining room table," Molly replied. "Piled high. We are talking high-end Legos, remote-control drones, the works." Bill frowned. "Is this really a heist or are we just robbing a six-year-old?" "It is a redistribution of plastic, Bill," she insisted. "Besides, the parents are spending four hundred dollars on an entertainer who dresses as a depressed pirate. It is practically a crime not to steal." Bill sighed and looked at his gear. "I have the stealth suits." Molly looked at the pile of neon-colored pajamas on the chair. "Why are they neon?" "They are camouflage," he argued. "If we blend in with the party favors, we are invisible." "We look like oversized highlighter pens," Molly groaned. "It is a bold choice," Bill countered. "What about the extraction?" Molly tapped the map. "The hedge maze. I have timed the loop. If we exit through the azaleas, we hit the getaway van in ninety seconds." Bill scratched his chin. "What if a kid sees us?" "Do we have the bribes?" she asked. Bill patted his pocket. "Three bags of organic fruit snacks and a handful of dinosaur stickers." "Perfect," Molly said. "Will you handle the distraction?" "I can do a very convincing fake sneeze," Bill offered. "It creates a vacuum of attention." "Just don't trip over the balloon arch this time," she warned. "That arch was structurally unsound," Bill grumbled. Molly stood up, her eyes gleaming. "Remember, we move fast, we take the boxed sets, and we leave the plushies." "Why leave the plushies?" Bill asked. "Too much bulk, and they might be bugged," she whispered. Bill stared at her for a long moment. "You think a teddy bear is bugged?" "In this neighborhood? Absolutely," Molly replied. "Now, put on your neon pajamas. We have a birthday to ruin." Bill sighed and stepped into the suit. "I really miss robbing banks." "Banks don't have fruit snacks, Bill," Molly reminded him. "Now move."
created_at
"2026-05-10T13:30:31.27Z"
5/10/2026, 1:30:31 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3764
moderation_latency
(null)
generation_time
66524
tokens_prompt
77
tokens_completion
654
native_tokens_prompt
79
native_tokens_completion
598
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0003148
router
(null)
provider_responses
0
endpoint_id
"576fca5b-f09c-4bda-b560-9989c39ac3ab"
id
"oiXsVgR-zfFMX-9f994c8997a7d7ab"
is_byok
false
latency
3764
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419831-iL1bzbbWAZjjSHkedYdY"
session_id
(null)
api_type
"completions"
id
"gen-1778419831-u7OpXerZnaKExQkSEdB7"
upstream_id
"oiXsVgR-zfFMX-9f994c8997a7d7ab"
total_cost
0.0003148
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
413
0.00%
Dialogue to Total Word Ratio
Ratio: 68.97%, Deviation: 18.97%
neededClean
false
wordsTotal
419
wordsDialogue
289
0.0000%