NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Y074J246MKXNQQQ7AXA8Y
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.6s
Cost:
$0.000935
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
3.0453%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink party hat, the elastic digging into her chin with the sting of a thousand bee stings. She stared across the suburban cul-de-sac at the fortress of joy, a sprawling lawn overtaken by a massive, deflating bounce castle shaped like a deranged dragon. Bill stood beside her, hunched over in an oversized tuxedo that made him look like a penguin suffering an identity crisis. He clutched a clipboard as if it contained blueprints to the Louvre rather than a list of loot. Do you have the exit route mapped, Bill, or are we planning to be dragged away by a mob of sugar-crazed toddlers? I have optimized the path, Molly. We approach via the hedge, wait for the inevitable chaotic collapse of the piñata, and strike during the golden hour of cake consumption. The piñata is a distraction, not a tactical window. You know how fast these monsters move once they smell frosting. I have factored in the sugar rush. Their reaction times will be sluggish due to internal caloric overload. We take the pile, we retreat to the getaway minivan. You mean the minivan that currently has a toddler’s sticker collection covering the windshield? It provides excellent camouflage, Molly. Who suspects a getaway vehicle adorned with glittery unicorns and cartoon llamas? I suspect it. Every time I look at it, I feel my dignity eroding. Why are we stealing birthday gifts? This feels like a low point, even for us. These are high-end presents. We are literally talking about the latest generation of gaming consoles, designer building blocks, and stuffed bears that cost more than my last three rent payments. This is an untapped market, Molly. It is a morality vacuum. What if we get caught by a clown? I hate clowns. They have those soulless, painted eyes that see right through your criminal intent. If a clown interferes, you use the aerosol glitter spray I packed. It will blind him long enough for us to make our escape. You packed glitter spray? It is effective and festive. Stay focused. The clown is currently busy balloon-dogging. This is our moment. Are those laser tripwires or just strands of cheap party streamers? Those are streamers, Molly. Control your nerves. I am nervous because you told me the vault was a reinforced safe, but it turns out to be a cardboard fortress labeled Jimmy’s Big Day in glitter glue. It is a fortress of potential profit. Grab the duffel bag and stay low. If a six-year-old initiates a hostile negotiation, offer them a juice box and run. You are a menace, Bill. If we get arrested, I am telling the police you were the mastermind behind the entire kindergarten caper. They would never believe you. Now move, and try not to trip over the inflatable dragon tail. It is a lawsuit waiting to happen.
created_at
"2026-05-10T12:32:40.812Z"
5/10/2026, 12:32:40 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
562
moderation_latency
(null)
generation_time
3228
tokens_prompt
77
tokens_completion
724
native_tokens_prompt
67
native_tokens_completion
612
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093475
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"6XoAasm2AdbqqtsPo7OrmQw"
is_byok
false
latency
561
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416360-2xFt43SeAUlVT4gpaSOh"
session_id
(null)
api_type
"completions"
id
"gen-1778416360-Ff1pA2bhcNoQFw1bclQW"
upstream_id
"6XoAasm2AdbqqtsPo7OrmQw"
total_cost
0.00093475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
484
wordsDialogue
0
3.0453%