NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYVR8MN8Q3SSEYGJEQ7AXC
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.2s
Cost:
$0.000891
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink tutu and squinted at the blueprints spread across the dashboard of their stolen minivan. The glitter covering her face made her itch, but she tapped the map with a manicured nail. Bill sat in the driver seat, wearing a giant mascot head that looked vaguely like a sentient potato, his muffled sigh echoing inside the cramped space. We hit the bouncy house at exactly two fifteen, Molly whispered, checking her watch. The sugar crash should be in full effect by then. If the clowns see us, Bill grunted, his voice vibrating against the foam interior, I am tackling them. I do not care if they are professional balloon artists, I am taking them down. Molly rolled her eyes and checked her lockpicking kit hidden inside a pastel butterfly backpack. Nobody is tackling a clown, Bill. We are stealthy, we are shadows, we are the parents of the birthday boy. Nobody recognizes anybody at these sensory nightmares. I just hate the smell of toddler breath, Bill muttered, shifting the mascot head to alleviate a cramp. Are you sure the vault is behind the gift table? It is a pile of glittery wrapping paper, Bill, not a bank vault, Molly snapped. You just grab the armfuls and run toward the slide. What if the kids form a defensive perimeter? They are four years old, Bill. Use the distraction. Which distraction? The one where you accidentally spill the entire cooler of organic juice boxes everywhere. Oh, that is cruel, Bill said, though he sounded impressed. It is a heist, not a bake sale. Are you in or are you going to keep complaining about the ventilation in that potato suit? I am in, Bill said, patting a duffel bag stuffed with empty velvet sacks. I just want to make it clear that if a piñata hits me, I am retaliating. I will not have my reputation ruined by a papier-mâché donkey. Nobody is going to hit you with a piñata if you move fast enough. What is the signal for a hasty retreat? Throw a handful of glitter and scream about a hidden puppet show, Molly instructed, leaning over to lock the doors. If we walk out with the presents, we don't look back. Even if a toddler starts crying? Especially if a toddler starts crying. That is how you know you have succeeded. Molly pulled her wig straight and checked her reflection in the rearview mirror. You look like a deranged fairy godmother, Bill said, finally lifting his mask to take a deep breath of non-potato air. And you look like a vegetable that has lost its way, Molly countered, kicking the door open. Let us go steal some joy. They stepped out into the bright afternoon sun, two grown criminals masquerading as chaos agents, marching determinedly toward the distant sound of screeching children and aggressive nursery rhymes.
created_at
"2026-03-03T22:59:51.712Z"
3/3/2026, 10:59:51 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
607
moderation_latency
(null)
generation_time
3110
tokens_prompt
77
tokens_completion
685
native_tokens_prompt
67
native_tokens_completion
583
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00089125
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
607
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772578791-elwx7TWPnpaJXHDa0ve9"
upstream_id
(null)
total_cost
0.00089125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
483
wordsDialogue
0
13.5829%