NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25S4GGRNNT2TRJ7EVK2QPF
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.6s
Cost:
$0.002460
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The brass bell above the bakery door chimed, lost beneath the warm, yeasty hum of the industrial ovens. Behind the glass counter, a row of powdered jelly donuts sat like plump, sugar-dusted cushions. "You’re staring at the cinnamon swirls again. That always means trouble." "I am merely contemplating the structural integrity of the glaze. And whether a certain retired librarian was seen behind the apothecary last night." "Mabel? At midnight? She claims she was home nursing her sciatica." "Then why did I find a lavender-scented handkerchief snagged on the rose trellis behind the shop? One with her initials embroidered in yellow silk?" "Oh, dear. That is rather damning. Especially since she swore she hasn’t walked three blocks in a month." "Exactly. Now, pass me one of those swirls. Investigating perjury requires a significant amount of butter." "Only if you promise not to drop crumbs on the crime scene photos this time. Chief Higgins was furious about the grease smudges." "That was lard from the peach pie, and it actually helped highlight the bootprints." "Just take the pastry and keep your head down. Mayor Thornton is coming in for his morning brioche, and he already looks suspicious."
created_at
"2026-05-20T07:49:37.941Z"
5/20/2026, 7:49:37 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
767
moderation_latency
(null)
generation_time
2459
tokens_prompt
81
tokens_completion
318
native_tokens_prompt
68
native_tokens_completion
262
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00246
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"kmcNaofRBrKgjrEPqamNoQs"
is_byok
false
latency
767
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263377-OkkreRYd2JPm5Fsv1YjU"
session_id
(null)
api_type
"completions"
id
"gen-1779263377-AeNtNzteQ3R87OaxNAIl"
upstream_id
"kmcNaofRBrKgjrEPqamNoQs"
total_cost
0.00246
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)