NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Z7RG1AEZXPR5WQ54GPA5E
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.1s
Cost:
$0.000932
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green balloon hat, the latex squeaking against her forehead like a dying rodent. Across the cramped van, Bill struggled to force his hulking shoulders into a glittery pink fairy vest that threatened to disintegrate at any moment. The target was the Miller estate, a suburban fortress currently besieged by forty hyperactive toddlers and enough sugar to power a small industrial district. Are you absolutely certain we have to wear these ridiculous costumes, Bill asked, his voice strained as he yanked a stray sequin from his eye. Look, Molly whispered, brandishing a plastic wand with alarming intensity, we are infiltration experts. If we show up in dark tactical gear, the nanny will call the police before I even reach the gift table. If we show up as the Great Zamboni and Princess Sparkle-Dust, we are invisible. Invisible or a public health hazard, Bill muttered, testing the integrity of his velcro-strapped fairy wings. If you sneeze, the glitter alone will blind the perimeter guard, so pipe down. Do you have the decoy loot bags? Bill patted a cavernous sack tucked under his seat. They are filled with literal trash and broken Rubik's cubes. The parents will be too distracted by the clown to notice you swapping the haul. Perfect, Molly said, checking her watch. The piƱata drop is in T-minus five minutes. That is our window. Once the wood bats come out, chaos reigns, and we liberate the plunder. What about the birthday boy, Bill asked, squinting through a pair of heart-shaped sunglasses. He is six and possesses the observational skills of a bloodhound. If the kid gets in your way, just tell him you are a magical forest deity and offer him a fruit snack, Molly replied, opening the van door to a cacophony of shrieking children. I keep three packs of fruit leather in my pouch for exactly this purpose. Bill sighed, a long, rattling sound that deflated his fairy wings slightly. I hate this. I hate children, I hate glitter, and I especially hate that I am an accomplice to grand larceny at a juice box bistro. It is high-stakes art, Bill. Think of the adrenaline. Think of the sheer audacity, Molly said, stepping onto the manicured lawn with a flourish of her oversized cape. I am thinking about my dignity, which died somewhere between the face paint station and this pastel tutu, Bill groaned, following her past a decorative picket fence. Dignity does not pay the rent, darling. Now, put on your most whimsical smile and try not to look like you are plotting a felony. Remember, we are here to celebrate. We are here to rob a toddler, Bill corrected, adjusting his wand nervously. To celebrate, Molly hissed, flashing a grin as they approached the front porch, where a swarm of children lunged for a jumping castle like starving piranhas. Let the festivities begin.
created_at
"2026-05-10T12:54:16.329Z"
5/10/2026, 12:54:16 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
558
moderation_latency
(null)
generation_time
4037
tokens_prompt
77
tokens_completion
729
native_tokens_prompt
67
native_tokens_completion
610
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093175
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"-H8AapGrH_-Gz7IPmL_5iAo"
is_byok
false
latency
558
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417656-T03h0818PQSYqPUiveHT"
session_id
(null)
api_type
"completions"
id
"gen-1778417656-QfKEyogqas10CnkA8zG3"
upstream_id
"-H8AapGrH_-Gz7IPmL_5iAo"
total_cost
0.00093175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
485
wordsDialogue
0
13.5829%