NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAY91ZK29PRV3BW3T9234A
Run Details
Model:
Gemma 3 4B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
4.2s
Cost:
$0.000027
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
1.11%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon and warm bread hung thick in the air, a comforting blanket against the drizzle outside. Flour dusted the countertop like a fresh snowfall, and the rhythmic thump of the mixer punctuated the quiet hum of conversation. “He really wouldn't have just *left* the money, would he?” “Not Mr. Abernathy,”
a voice replied
, low and thoughtful. “Not after all these years. He was meticulous. Almost…obsessive.” A hand sifted through a stack of perfectly formed croissants, arranging them with deliberate care. “That much cash just sitting there, unattended? It defies logic.” “Unless someone wanted to make it disappear,”
the other voice suggested
, a hint of wry amusement in the tone. “And Mr. Abernathy had plenty of people who might consider a little disappearing act.” “You’re thinking about Silas,” the first
voice stated
, the movement of the hand ceasing. “He was the last one to see him, wasn’t he? Complaining about the price of eggs, remember?” “Silas always complained about something,” a sigh was audible. “But letting a substantial sum of money vanish? That’s a new level of grumbling.” A careful dusting of powdered sugar settled on a tray of gingerbread men. “Let's just say, this town has a surprising number of secrets, and a whole lot of delicious pastries.”
created_at
"2026-02-20T20:12:58.823Z"
2/20/2026, 8:12:58 PM
model
"google/gemma-3-4b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
299
moderation_latency
(null)
generation_time
4110
tokens_prompt
98
tokens_completion
350
native_tokens_prompt
76
native_tokens_completion
294
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00002656
router
(null)
provider_responses
0
endpoint_id
"d3de3bd4-81bc-48fb-924f-2a87b2a36e75"
id
"chatcmpl-R6DVYuAGRqQVAWPgXRTgSIGZ"
is_byok
false
latency
116
model_permaslug
"google/gemma-3-4b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771618378-pMtVhsinpvkpoSuY9ujm"
upstream_id
"chatcmpl-R6DVYuAGRqQVAWPgXRTgSIGZ"
total_cost
0.00002656
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
1.11%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"a voice replied"
1
"the other voice suggested"
2
"voice stated"