NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZF8A5N032Q3M15G40X568
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.2s
Cost:
$0.000933
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
4.8041%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peered through the high-powered binoculars at the sprawling backyard where a dozen five-year-olds were currently engaged in a high-stakes game of musical chairs. Every child screamed with the ferocity of a wild beast, their energy vibrating through the fence. Bill leaned against the brick wall of the neighbor’s garage, checking his wristwatch for the tenth time. He adjusted his velvet mask, which kept sliding over his eyebrows. Are we doing this or just watching them play ring-around-the-rosie until they reach retirement, Molly asked. There is a rhythm to the madness, Bill whispered, pointing at the clown who was currently failing to twist a balloon into the shape of a poodle. Look at the perimeter. The clown is the primary threat. The clown is an amateur, Molly scoffed while she checked her lockpicking kit. He is already three margaritas deep into the juice box stash behind the shed. If he is compromised, that changes the extraction route, Bill noted. We cannot risk a scene involving a disgruntled birthday boy wielding a glitter-covered plastic sword. I have zero fear of a preschooler, Molly said, tightening her ponytail. We go in through the bouncy castle during the cake-cutting distraction. The bouncy castle is a tactical nightmare, Bill countered, gesturing toward the inflatable fortress which looked like a sweat-slicked dungeon. You know the structural integrity of those things is held together by hope and heavy gusts of wind. That is why we use the secret entrance, Molly muttered, tapping the map she had drawn on a cocktail napkin. We slip under the rear flap, grab the pile of loot, and exit via the sandbox. And the presents are definitely high-value? Bill asked, his voice thick with uncharacteristic greed. I heard a rumor about a new gaming console in a shiny red box. It is in the pile, Molly confirmed. Wrapped in glittery dinosaur paper. I saw the mother drop it there with the care one usually reserves for a Faberge egg. What about the golden retriever, though? Bill asked, his eyes darting to the corner of the yard where a furry beast was currently chewing on a discarded party hat. I have a pocket full of organic, gluten-free liver treats that would distract a guard dog at the gates of hell, Molly said, patting her jacket. You really thought of everything, didn't you? Bill sighed, shaking his head. I grew up as a middle child, Molly said, pulling a pair of goggles over her eyes. We are the masters of stealing things that aren't ours without being noticed. Bill nodded, his confidence returning as he checked his pockets for his bag of black-market marbles. Ready to ruin some childhood memories. Let us get rich, Molly grinned, vaulting the fence with the grace of a cat. See you on the other side, Bill whispered, following her into the chaos of sugar-fueled disaster.
created_at
"2026-05-10T12:58:21.9Z"
5/10/2026, 12:58:21 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
586
moderation_latency
(null)
generation_time
4107
tokens_prompt
77
tokens_completion
736
native_tokens_prompt
67
native_tokens_completion
611
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00093325
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"7oAAarjqBMrjqtsPrv_MUQ"
is_byok
false
latency
586
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417901-6LERd8IcmNTZ9SSlrVSK"
session_id
(null)
api_type
"completions"
id
"gen-1778417901-2w8IYDauUyf7NVmks9D0"
upstream_id
"7oAAarjqBMrjqtsPrv_MUQ"
total_cost
0.00093325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
478
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
492
wordsDialogue
0
4.8041%