NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8YCQRJK1C1JM9DA7NVBKSN
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
17.5s
Cost:
$0.000915
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.1070%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The backyard was a suburban nightmare of neon plastic and sugar-fueled chaos. Molly crouched behind a giant inflatable penguin, her tactical vest snagging on a rogue sequin. Beside her, Bill adjusted his oversized clown mask, the rubber squeaking against his sweat-slicked forehead. They were professionals, or at least they had been before the bank job went sideways and left them desperate enough to target a seven-year-old’s birthday bash. Do you have the perimeter clear, Molly whispered, squinting through the slats of the penguin’s belly. Bill checked his watch, which lacked hands but still looked vaguely official. The bouncy castle is unguarded, but the parents have formed a phalanx of lawn chairs near the gift pile. It is a tactical fortress guarded by Chardonnay and aggressive social standing. We do not care about the adults, Molly hissed, adjusting her dark gloves. Focus on the main objective. The haul is tucked away under that mountain of wrapping paper shaped like race cars. If we grab the duffel and scramble over the fence by the slide, we are gone before the cake cutting. What if the clown sees us, Bill asked, gesturing toward a man in a polyester jumpsuit currently juggling unevenly. He looks like he has seen things. That man is three martinis deep and clearly contemplating his life choices, Molly said. He is not a threat. Just grab the largest boxes first and keep your head low. If a kid tries to intercept you, you are legally allowed to use the glitter bomb as a distraction. Bill sighed, the mask muffling his frustration. I really thought we would be doing art heists or diamond smash-and-grabs by now, not skulking behind inflatable woodland creatures at a backyard gala for toddlers. Focus, Bill. We are stealing toys, not the crown jewels. But those limited edition action figures are worth a fortune on the black market, so do not let your professional pride cloud your judgment. They watched as a child swung a piñata with the murderous intensity of a medieval executioner. The candy rained down, causing a momentary lapse in security. Molly grabbed Bill’s elbow. This is our moment. The distraction is perfect. Move now, stay in the shadows of the slide, and keep your hands off the streamers. Bill nodded, his clown mask slipping slightly. If I trip over a bicycle, I am not turning back for the wrapping paper. Just move, you absolute buffoon, she retorted. They lunged forward, a pair of dark shapes darting through the shimmering haze of heat and impending birthday carnage. They moved with the grace of high-stakes thieves, dodging a stray foam dart and a rogue golden retriever. The gift pile loomed ahead, a glittering mound of plastic possibilities. Molly reached the target first, her fingers brushing against the crinkle of metallic paper. The heist was officially underway.
created_at
"2026-05-10T12:39:30.841Z"
5/10/2026, 12:39:30 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
560
moderation_latency
(null)
generation_time
17345
tokens_prompt
77
tokens_completion
732
native_tokens_prompt
67
native_tokens_completion
599
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00091525
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"g3wAapbzAcyhz7IPyNmpyQY"
is_byok
false
latency
560
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416770-HSIJeRrC41j5JrkYjgTs"
session_id
(null)
api_type
"completions"
id
"gen-1778416770-kqXCHYOfKMFCT4tC4to1"
upstream_id
"g3wAapbzAcyhz7IPyNmpyQY"
total_cost
0.00091525
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
472
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
479
wordsDialogue
0
0.1070%