NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJV0Q05PRF92DANTVSVNTXVA
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.8s
Cost:
$0.000986
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped her fingers against the steering wheel, her eyes fixed on the suburban sprawl of a cul-de-sac. Bill was currently struggling to pull a neon yellow clown wig over his balding head. Bill, why are we doing this at a clown-themed birthday party? It feels like overkill. It is about blending in, Molly. We are undercover specialists. If we look like we belong, nobody questions the guy carrying a sack near the gift table. You look like a nightmare that crawled out of a sewer. The kids are going to scream. That is fine. If they are screaming, they are distracted. Chaos is our best friend in this business. Check the blueprint again. I have memorized the floor plan. We enter through the side gate, bypass the bounce house, grab the haul, and exit through the garage. Easy. What about the parents? Someone is bound to notice two adults hovering over a pile of wrapped toys. I will tell them I am the hired entertainment. I am going to juggle. You do not know how to juggle, Bill. Details, Molly. I will just throw things in the air and hope gravity handles the rest. What is the status on the security guard? You mean the golden retriever tied to the fence? He is currently busy being pet by a four-year-old. He is not a threat. Excellent. Now, remind me, why are we stealing presents? Because the last mansion we hit was a bust, and I know for a fact these kids are getting high-end Lego sets. I want the Star Destroyer. You are going to rob a toddlers party for a plastic spaceship? It is a collector hobby, Molly. Plus, the snacks are usually top-tier. I am hoping for those little pigs in a blanket. Focus. We go in at three. The clown show starts at three-fifteen. That gives us a fifteen-minute window before the cake comes out. And if we get caught? I drop the loot, you blame the wig, and we run for the car. I am not wearing the wig. You have to. It matches the vest. I hate this partnership. You love the thrill of the heist. I love the profit. There is a difference. Whatever. Do you have the duffel bags? They are in the trunk, hidden under the yoga mats. Perfect. Put on your nose. I refuse to wear the red nose, Bill. Put it on, or I tell everyone you are the one who stole the punch bowl at the last gig. Molly snatched the rubber nose from the dashboard, jammed it onto her face with a defiant scowl, and checked her watch. One minute to showtime. Ready to make some children cry? Born ready. Let us go secure those Legos. Try not to trip on your oversized shoes. I make no promises. Check your pockets for the lockpicks. Already got them. Let us move out.
created_at
"2026-03-03T23:32:13.125Z"
3/3/2026, 11:32:13 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
718
moderation_latency
(null)
generation_time
3671
tokens_prompt
77
tokens_completion
709
native_tokens_prompt
67
native_tokens_completion
646
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00098575
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
718
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772580733-FJOd777sKGaLUzJ94SI8"
upstream_id
(null)
total_cost
0.00098575
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
493
wordsDialogue
0
25.9628%