NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZVD8EM1GFZPZ96TTVWAR0
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.3s
Cost:
$0.000911
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.2460%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink wig as she hunkered behind the suburban hydrangea bush. Beside her, Bill was trying to cram his six-foot frame into a tutu that was straining at every seam. The backyard beyond the fence was currently a chaos of shrieking toddlers and sugar-fueled mayhem. Look at that pile of loot, Bill whispered, tapping his binoculars against the fence. It is a goldmine of artisanal wooden trains and imported plush unicorns. Bill, pull your skirt down, you look like a disgruntled ballerina, Molly hissed, swatting his hand away. Do you have the tranquilizer darts for the golden retriever, or are we playing tag with a beast the size of a pony? I have the darts, but they are strawberry scented, Bill muttered, fumbling in his tactical fanny pack. Do you think the dog will appreciate the bouquet, or will he just think it is a gourmet treat? It is a golden retriever, Bill, it will probably thank you for the flavor profile before it passes out, Molly said, checking her watch. We move when the clown goes for his refill of cheap gin. He is currently juggling poorly and terrifying the toddler in the front row. Bill squinted through the slats. The clown is dropping a plate, this is our moment of absolute confusion. No, wait for the cake, Molly countered, gripping her satchel. The cake is the ultimate distraction. When the mother starts lighting those ridiculous sparkler candles, the parents will be distracted by the fire hazard. That is when we breach the gift perimeter. Do you remember which bag is which? Bill asked, his voice trembling slightly. We need the one with the remote-control car, not the one that just contains socks from the grieving grandmother. Bill, I have handled heists at gala auctions where the jewelry was guarded by ex-special forces, Molly sighed. I think a pile of birthday tributes guarded by a bunch of exhausted parents and a sugar-crashing circus act is well within my wheelhouse. Just focus on the diversion. Can you still juggle? I can juggle three rubber chickens, Bill said, pulling the poultry from his oversized pockets. Perfect, Molly whispered, creeping toward the open fence gate. Throw them into the bouncy castle. It will look like a comedic assault on the inflatables. Bill sighed, dusting off his tutu. I always wanted to be a professional gymnast, but this will have to suffice. Are we synchronized for the getaway? The getaway is a tricycle sprint down the Cul-de-sac, Molly signaled. Try not to run over the gnome, he is a classic suburban staple and I am not leaving him behind. Bill nodded, his eyes gleaming with the manic intensity of a man who was about to steal presents from a five-year-old. For the haul, Molly. For the haul, Bill. Now, go make a scene.
created_at
"2026-03-03T23:17:09.019Z"
3/3/2026, 11:17:09 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
720
moderation_latency
(null)
generation_time
3232
tokens_prompt
77
tokens_completion
709
native_tokens_prompt
67
native_tokens_completion
596
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00091075
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
720
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772579829-EowMWKlXziw7qu2et9KR"
upstream_id
(null)
total_cost
0.00091075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
473
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
481
wordsDialogue
0
0.2460%