NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR9146J3EQFZWWTTGY2KBQ5P
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.8s
Cost:
$0.000873
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peered through the high-powered binoculars at the sprawling suburban backyard. Sunlight glinted off a mountain of foil-wrapped loot piled squarely on a mahogany table. Bill shifted behind her, his knees cracking like dry kindling. See that, Bill? A literal feast of birthday spoils. Do you see the one with the blue ribbon? That is almost certainly a limited edition gaming console. Or a very heavy toaster. Either way, we are eating like kings tonight. Bill squinted past her shoulder, rubbing his temple. It is a party for seven-year-olds, Molly. There will be clowns. I have a deeply irrational fear of clowns. They have too many teeth and their shoes are scientifically impossible. Focus, Bill. We are not here for the clowns. We are here for the spoils. The perimeter is guarded by an inflatable castle and a very distracted golden retriever. If we hit the fence, drop in, and scoop the presents, we could be out before the cake even hits the table. What if they catch us? These parents look like they belong to a neighborhood watch association that practices tactical maneuvers on the weekends. I saw a man in a beige sweater measuring the lawn earlier. That is suspicious behavior. That is just suburban malaise, not a tactical threat. We go in at the signal. When the clown starts the balloon animal segment, you snag the pile. I will handle the distraction. What kind of distraction? Please tell me it does not involve you wearing a costume. I have a smoke pellet and a very loud whistle. A whistle? That is your big heist-grade distraction? You are going to blow a whistle at a bunch of hyperactive children? We will be swarmed. I will be tackled by a toddler in a superhero cape. I cannot be taken down by a boy named Hunter. My reputation will never recover. Your reputation is already in the toilet, Bill. You once tried to crack a safe using nothing but a stick of gum and a stern look. Just follow the plan. Do you have the duffel bags ready? I have the bags. But if I get bitten by a hamster or sprayed with juice boxes, I am taking an extra ten percent of the cut. Ten percent? You are lucky I am not charging you for the privilege of working with a professional. Pull your mask up. The clown is finishing the giraffe. It is go time. Wait, are you sure that is a giraffe? It looks more like a deformed noodle. It is a giraffe, Bill. Now shut up and climb. If I break an ankle on this trampoline, I am telling the ambulance that you pushed me. If you break an ankle, I am leaving you as a gift for the birthday boy. Jump.
created_at
"2026-05-10T13:27:16.813Z"
5/10/2026, 1:27:16 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
586
moderation_latency
(null)
generation_time
3728
tokens_prompt
77
tokens_completion
662
native_tokens_prompt
67
native_tokens_completion
571
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00087325
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"tYcAav01_4bPsg-Yv_mICg"
is_byok
false
latency
584
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419636-PVK8PJVJiKzmShJQe0xO"
session_id
(null)
api_type
"completions"
id
"gen-1778419636-XbSzDDgivWrkV86isSkg"
upstream_id
"tYcAav01_4bPsg-Yv_mICg"
total_cost
0.00087325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
466
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
471
wordsDialogue
0
0.0001%