NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8VMS58NGC2MNDNT5AR8J4R
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3.2s
Cost:
$0.000948
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.6885%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sun beat down on the suburban cul-de-sac, turning the manicured lawns into shimmering mirrors of suburban mediocrity. Molly adjusted her floral sun hat, its wide brim doing little to hide the intense glitter of her lockpick set tucked into her waistband. Beside her, Bill was sweating through a ridiculously bright yellow mascot costume that smelled faintly of mothballs and despair. He looked significantly less like a beloved party entertainer and significantly more like a giant, neon warnings sign for local law enforcement. They crouched behind a decorative hedge, watching a clown struggle with a tangled knot of balloon animals near the front door. Are you sure about this, Molly? It feels like we are losing our touch. Molly rolled her eyes, checking her watch. We are not losing our touch, Bill. We are diversifying our portfolio. There is a mountain of high-end loot behind that door, and most of it is still in the boxes. It is the perfect crime. It is a birthday party for a six-year-old. Someone is definitely going to cry. Let them cry, Bill. Tears are just the soundtrack of a successful acquisition. Do you have the grab-bags ready? I have two extra-large contractor bags and a fake mustache I found in the pocket of this suit. I do not think I have ever felt this ridiculous in my entire criminal career. Focus. The clown will be moving to the backyard in three minutes. That gives us a window to breach the foyer, secure the table, and exit through the side gate. Keep your head down and try not to trip over your oversized shoes. I am in a bright yellow bird suit, Molly. Subtle is not really on the menu today. A loud, screeching laugh erupted from the yard as a group of children stampeded toward the fountain. Molly signaled for Bill to follow as she crept along the perimeter of the house, her boots silent on the neatly kept mulch. They reached the side door, which had been left slightly ajar to let the summer breeze displace the smell of cheap frosting. Molly peeked inside. The living room was a pristine gallery of wrapped gifts, bows, and expensive toys piled high on a mahogany table. It was beautiful, in a tacky, glittery sort of way. See? It is practically gift-wrapped for us. Get the bags out. I still feel like a monster. What if there is a remote control car in there? Then we sell it on the secondary market and buy ourselves a nice dinner. Stop being sentimental. Grab the bags and move. They slipped inside, the air-conditioned foyer a sudden relief against the stifling afternoon heat. Molly began sweeping the presents into the bags with practiced precision, her movements fluid and rhythmic. Bill stood guard, looking like a deranged canary, nervously swaying his oversized head toward the door every time a lawnmower hummed in the distance. The haul was heavier than expected, but the soft thud of boxes meeting bag was the only music Molly needed.
created_at
"2026-05-10T11:51:28.689Z"
5/10/2026, 11:51:28 AM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
549
moderation_latency
(null)
generation_time
3068
tokens_prompt
77
tokens_completion
746
native_tokens_prompt
67
native_tokens_completion
621
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00094825
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"QHEAas2QNtbqqtsPo7OrmQw"
is_byok
false
latency
549
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778413888-CxWX6pOxfXn2HCm9JaX7"
session_id
(null)
api_type
"completions"
id
"gen-1778413888-HTZbL1tAAdx6s3RHi1A2"
upstream_id
"QHEAas2QNtbqqtsPo7OrmQw"
total_cost
0.00094825
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
513
wordsDialogue
0
49.6885%