NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJV0Q6QGK07BV1SE45MPEM98
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3.8s
Cost:
$0.000929
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The suburban cul-de-sac was bathed in the sickly yellow glow of motion-sensor porch lights. Molly crouched behind a manicured hedge, adjusting her tactical turtleneck, while Bill struggled to untangle his festive party hat from a low-hanging pine branch. They were targeting the Henderson estate, a fortress of suburban excess currently hosting a six-year-old’s birthday bash. Through the bay window, a clown with a painted-on smile made balloon swords for a throng of toddlers, his movements frantic and desperate. Inside the living room, a mountain of glitter-wrapped loot sat unguarded near the coat rack, a siren song of plastic and cardboard. Molly peered through her binoculars, tracking the perimeter. She cursed under her breath as a golden retriever barked at a squirrel. Bill, are you clear on the objective? Molly whispered into her headset, her eyes fixed on the target. The pile near the window is primarily remote-controlled cars and high-end LEGO sets. If we hit the kitchen door now, we have a three-minute window before the clown finishes his magic routine. Bill yanked his hat free, the cardboard ripping at the seam. Did you pack the reinforced duffel bags? Because if those sets are the modular space stations, they will not fit in anything less than industrial grade equipment. Are we sure this is worth the risk, Molly? I still feel terrible about ruining little Timmy’s big day. Molly rolled her eyes, shifting her weight as the grass dampened her chinos. Timmy is an heir to a fortune. He will get another fleet of toys by Tuesday. Our goal is the inventory. Keep your eyes on the balloons. I just think we could have picked an easier mark, Bill muttered, checking his pocket watch. The clown just pulled a live pigeon out of a top hat. Any second now, the chaos index is going to climb to eighty percent. Should I deploy the smoke pellets? Absolutely not, Molly hissed. We are shadows, not fireworks displays. We enter, we extract, we disappear before the cake-cutting ceremony. If you trip over a juice box again, I am leaving you behind to face the PTA moms alone. Bill adjusted his belt, his face hardening in the dim light. I did not trip, that was a calculated tactical stumble. Let us do this, then. On your mark, we crawl toward the sliding glass door. Molly checked her watch, her pulse steady. Three, two, one. Move. They surged forward, silhouettes dancing across the manicured lawn. The scent of vanilla frosting and cheap juice boxes drifted through the night air. With the precision of seasoned pros, they hit the porch edge just as a sudden eruption of toddler screams echoed from inside, signaling the start of a chaotic game of musical chairs. It was the perfect diversion. They slid along the siding, reaching for the latch, ready to liberate the mountain of cardboard from its pampered existence.
created_at
"2026-03-03T23:32:19.836Z"
3/3/2026, 11:32:19 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
641
moderation_latency
(null)
generation_time
3680
tokens_prompt
77
tokens_completion
728
native_tokens_prompt
67
native_tokens_completion
608
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00092875
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
641
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772580739-DkbAoSJIxIo6GYOG0OnS"
upstream_id
(null)
total_cost
0.00092875
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
493
wordsDialogue
0
13.5829%