NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90VJGZFHSQ5XGYZ8JD53VD
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.4s
Cost:
$0.001001
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.8117%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle scraping against her tactical vest. She peered through the binoculars at the suburban lawn decorated with saggy balloon arches. Bill, are you sure about this? It looks like a high-security daycare facility. Bill crouched behind a manicured hedge, clutching a bag of glitter bombs like a grenade launcher. It is a birthday party, Molly. The security is four underpaid teenagers and an inflatable castle. Those teenagers look like they have combat training. And the castle is massive. What if it is a trap? It’s not a trap. It is a five-year-old’s dreamscape. The primary target, a mountain of gift-wrapped loot, is situated near the punch bowl. We hit, we grab, we vanish. What about the clown? I heard there is a clown. There is always a clown, Molly. Just ignore the clown. I cannot ignore a man with oversized shoes and a penchant for balloon animals. Clowns are unpredictable agents of chaos. Focus. Do you have the exit route mapped? I do. We retreat through the vegetable garden, hop the fence, and meet the getaway minivan behind the shed. But Bill, the presents. What if they are just socks? Nobody gives a five-year-old socks. We are looking at premium plastic, digital consoles, and high-end figurines. This is a goldmine. What about the cake? Leave the cake. It is a liability. Stains, crumbs, sticky evidence. We are professionals. I just feel like we should grab at least one cupcake. For morale. We are not here for morale. We are here for the haul. Are you wearing the shoe covers? Yes, they are neon pink slip-ons. I look like a disgruntled ballerina. You look like insurance against footprints. Now, look at that perimeter. The target is unguarded. The parents are busy with the piñata. That piñata is a paper-mache donkey of death. It is swinging wildly. Exactly. It is a distraction. The chaos is our cover. I still think we should have brought smoke grenades instead of glitter. Glitter is psychological warfare, Molly. Nobody wants to deal with it for the next three years. It is the perfect signature. You are a monster. I am an entrepreneur. On my signal. Wait, Bill, the clown is looking at us. He is not looking at us. He is looking at his balloon poodle. Don't look at the clown. He is definitely looking at us. He is waving. Oh god, he is coming over. Abort! Run for the minivan! But the presents! Forget the presents! He is holding a sword! That is a balloon sword, Bill! It is still a sword, Molly! Go, go, go! They scrambled backward, tangled in their own tutus, as a squeaky voice drifted across the lawn. The minivan engine sputtered, coughed, and finally roared to life, leaving nothing behind but a trail of sequins and shattered dignity.
created_at
"2026-05-10T13:22:34.151Z"
5/10/2026, 1:22:34 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
612
moderation_latency
(null)
generation_time
3286
tokens_prompt
77
tokens_completion
739
native_tokens_prompt
67
native_tokens_completion
656
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00100075
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"moYAataNFe-hz7IPsczGmAk"
is_byok
false
latency
612
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419354-ZrQ7jozRrvHkdv977zej"
session_id
(null)
api_type
"completions"
id
"gen-1778419354-RtAeRsAYq7NcqyS8aXLA"
upstream_id
"moYAataNFe-hz7IPsczGmAk"
total_cost
0.00100075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
476
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
485
wordsDialogue
0
1.8117%