NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZQVEYGXDJ67Y25JQ8W04C
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.4s
Cost:
$0.000957
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle digging into her waist, and checked the perimeter of the backyard. Bill stood beside her, trying unsuccessfully to force his massive frame into a plastic dinosaur costume that kept ripping at the seams. He looked like an aggressive lizard having a mid-life crisis. The sound of children screeching in the distance pierced the quiet suburban evening, vibrating against their eardrums like a swarm of angry bees. Are you sure this is the target, Bill? The invitation said five years old, but those kids sound like they have the bloodlust of a wolf pack. Stop complaining about the acoustics and focus on the loot, Molly. It is a goldmine. The parents invited half the soccer league, which means the gift table is groaning under the weight of at least thirty LEGO sets and a motorized scooter. And you plan on carrying a scooter while wearing that tail? You look like Godzilla got stuck in a blender. It is a stegosaurus, damn it, and it provides tactical bulk. Look at the perimeter guard. Molly squinted toward the fence. A golden retriever was currently asleep on a trampoline, its rhythmic snoring punctuated by the occasional leg twitch. The beast is neutralized, but what about the clown? I heard the agency booked a professional named Bubbles who specializes in balloon animals and extreme surveillance. Bubbles is a hack who hides pocket lint in his pockets. He will be distracted by the communal cake. While they sing the birthday anthem, we descend like shadows. Shadows do not wear polka-dot wings, Bill. I feel ridiculous. Confidence is key. If anyone asks, we are the eccentric cousins from the side of the family that ignores social norms. The side that steals presents? The side that values efficiency over sentimentality. Now, check the bag. Did you bring the industrial-strength garbage bags for the haul? They were under the passenger seat. I thought they were for the tires. Why would I touch your supplies after the debacle at the wedding? That was one time, and it was a misunderstanding regarding a gift registry. Just tell me you have the lock-picking kit tucked into the dinosaur pouch. It is nestled right next to my tail. We move on the count of three. What happens if the power goes out? Then we work by the light of the glow sticks, which I also liberated from the snack table while you were busy debating the merits of the tutu. You are a menace, Bill. I am a professional. Watch the fence. Molly took a breath, smoothed her wings, and braced herself against the cedar planks. The backyard lights flickered, casting long, jumping shadows across the bouncy castle. She signaled with a sharp nod. They scrambled over the wood, landing silently on the manicured turf as the opening notes of a high-pitched birthday song began to drift toward them, signaling the start of the chaotic final act.
created_at
"2026-05-10T13:03:03.654Z"
5/10/2026, 1:03:03 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
668
moderation_latency
(null)
generation_time
4233
tokens_prompt
77
tokens_completion
756
native_tokens_prompt
67
native_tokens_completion
627
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00095725
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"B4IAapWMNeKsqtsPvc-6gAE"
is_byok
false
latency
668
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418183-j55SloPntG6ljmSvPJaw"
session_id
(null)
api_type
"completions"
id
"gen-1778418183-fY2t8vJ9MxGvwCQwytnO"
upstream_id
"B4IAapWMNeKsqtsPvc-6gAE"
total_cost
0.00095725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
498
wordsDialogue
0
48.8138%