NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR92HMSWYZKK649GZX1506JH
Run Details
Model:
Gemma 4 31B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
30.1s
Cost:
$0.000163
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill spread a glittery map across the table. "We hit the bounce house at noon," he whispered. Molly scoffed, crossing her arms. "Why the bounce house, Bill? It's a tactical nightmare." "High ground, Molly! Plus, the kids are distracted by the sugar rush." "I'm more worried about the clowns," she countered. "Clowns are unpredictable. They have those long balloons." Bill shrugged. "I've got a plan for the clowns. I'll distract them with a fake juggling act." "You can't juggle, Bill." "I can pretend to fail miserably! It's a classic misdirection." Molly sighed, staring at the blueprint of the living room. "And the loot? The presents are piled near the cake." "Exactly. We swoop in during the Pin the Tail on the Donkey game," Bill explained. "While they're blindfolded, we clear the table." "What about the noise? Wrapping paper is loud." "I've already factored that in," Bill said, producing two pairs of oversized fuzzy slippers. "Stealth mode, Molly." "Fuzzy slippers? This is the worst heist in history." "But we'll be the most comfortable criminals in the neighborhood." Molly looked at the slippers and sighed. "Fine. But I get the LEGO sets."
created_at
"2026-05-10T13:52:05.958Z"
5/10/2026, 1:52:05 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
354
moderation_latency
(null)
generation_time
19729
tokens_prompt
77
tokens_completion
312
native_tokens_prompt
79
native_tokens_completion
294
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0001628
router
(null)
provider_responses
0
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
is_byok
false
latency
55
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
429
1
endpoint_id
"29b94af5-d8f8-419a-9cf1-ae1f2831142b"
is_byok
false
latency
116
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Venice"
status
429
2
endpoint_id
"9d4644e9-a5e5-41f4-b810-d75f484d2c24"
is_byok
false
latency
18
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Parasail"
status
429
3
endpoint_id
"69279254-6a96-4e9d-abc6-ea76e6bf8e97"
is_byok
false
latency
10078
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Novita"
status
504
4
endpoint_id
"576fca5b-f09c-4bda-b560-9989c39ac3ab"
id
"oiXzBan-zfFMX-9f996c6591a80910"
is_byok
false
latency
354
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778421125-JkLlvEVqmYoBgRvyR6fH"
session_id
(null)
api_type
"completions"
id
"gen-1778421125-UCsnGSNpw9JkCtXXn4L8"
upstream_id
"oiXzBan-zfFMX-9f996c6591a80910"
total_cost
0.0001628
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
190
0.00%
Dialogue to Total Word Ratio
Ratio: 75.26%, Deviation: 25.26%
neededClean
false
wordsTotal
190
wordsDialogue
143
45.2419%