NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8XXKEK6JA38S88FF98MA0D
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1.7s
Cost:
$0.000418
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of caramelized sugar and yeast hung heavy in the air, masking the metallic tang of the morning rain. Flour dust danced in the sliver of sunlight piercing the front window, settling on the cooling racks of cinnamon rolls. "You’re early. The first batch of sourdough isn't even out of the oven yet." "I figured the sirens would wake you anyway. Did you see the flashlights near the library?" "I saw enough. That side door has been temperamental since the mayor’s office moved in, but it’s never been left wide open." "It wasn't just left open, Arthur. The lock wasn't picked—it was shattered." "Keep your voice down. Mrs. Gable is already eyeing us from the register, and she’s had enough fuel for her gossip mill to last until Sunday." "Let her watch. If the rumors about the rare folios are true, the whole town is going to be whispering by noon." "Take a seat in the back booth. I’ll make a fresh pot of coffee, and you can tell me exactly what you found before the sheriff realizes you were trespassing." "I wasn't trespassing. I was investigating." "In this town, there’s rarely a difference."
created_at
"2026-05-10T12:31:14.906Z"
5/10/2026, 12:31:14 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
534
moderation_latency
(null)
generation_time
1570
tokens_prompt
81
tokens_completion
298
native_tokens_prompt
68
native_tokens_completion
267
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0004175
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"k3oAauixBbrjqtsPzZyQQQ"
is_byok
false
latency
534
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416274-lM4L4h7nSuvnxjmGJ01I"
session_id
(null)
api_type
"completions"
id
"gen-1778416274-yNkhRIjc5VMP1BXh2513"
upstream_id
"k3oAauixBbrjqtsPzZyQQQ"
total_cost
0.0004175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)