NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TPDAMKFNF5NCP7GNWJCR3
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 31s
Cost:
$0.218503
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his tight yellow jumpsuit and glared intensely at the blue diagram of the bouncy castle. Molly slammed her heavy coffee mug onto the folding table, splashing black lukewarm caffeine over their messy blueprints of the leafy suburban backyard. "We only have a short window before the clown arrives and completely ruins our tactical advantage." "But what about the ice cream cake? It will melt if we wait too long to eat it." "Focus, Bill! Sugar is a distraction." "I am focusing on the major threat of forty sugary toddlers." She dragged her rough finger across the mapping paper, completely ignoring his loud whining. The heist target was a massive mountain of brightly wrapped present boxes sitting in the back sunroom, clearly visible through a large, unguarded sliding glass window. "The perimeter is guarded by three hyperactive golden retrievers." "I brought peanut butter to jam their jaws." "Excellent. Once we bypass them, we breach the sliding glass door." "Is it locked?" "No. It is a party, not Fort Knox." Bill carefully scribbled a silly stick figure on the blueprint plan, giving it a tiny paper party hat. Molly rolled her dark eyes dramatically and angrily kicked a heap of plush, green dinosaur suits across the dusty wooden van floor. "Why must we wear these ridiculous outfits?" "It is camouflage for this dinosaur party." "I look ridiculous in spandex." "You will look rich when we resell those Lego sets." "Are we really stealing from kids?" "We are liberating consumer goods." She patted her custom tactical belt, which was heavily laden with organic apple juice boxes, aerosol silly string, and a yellow plastic spade. Bill slowly picked up his helmet, which closely resembled a Tyrannosaurus head with a squeaking, moving jaw. "What is our escape plan?" "We load all the loot into the wagon." "The fire engine wagon?" "Yes, it is currently hidden near the bushes." "And if the mother spots us?" "Deploy your silly string and run." "That is unprofessional." Molly uncomfortably adjusted her giant green foam tail, accidentally knocking over a huge, full cardboard box of sweet juice pouches. Bill strapped his heavy, hot helmet on, his loud breathing echoing loudly inside the fake, suffocating silly plastic dinosaur snout. "My cheap plastic snout is fogging up." "Suffer for your art, Bill." "I cannot see the giant bouncy castle." "We are not here to bounce!" "Can I jump in it for just ten minutes?" "No. We strike at exactly three." He sighed heavily, his plush tail swishing sadly against the messy floor of their rusted stakeout minivan. Molly checked her digital stopwatch. She threw open the double back doors, exposing them both to the sweet, dangerous scent of freshly baked birthday cupcakes and impending, wild, chaotic, roaring suburban party madness. "Check all of your tactical gear right now." "Is your juice box ready?" "It is fully armed." "Is the silly string loaded?" "Ready for massive deployment." "Let us finally secure our great plastic fortune." "For the rare vintage Lego sets."
created_at
"2026-05-19T19:16:40.392Z"
5/19/2026, 7:16:40 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1483
moderation_latency
(null)
generation_time
90138
tokens_prompt
77
tokens_completion
7215
native_tokens_prompt
67
native_tokens_completion
24267
native_tokens_completion_images
0
native_tokens_reasoning
23596
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2185035
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"GLcMauGXIvzE-sAPiJLgsQk"
is_byok
false
latency
1483
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218200-owxQBq4NyTDg65mwP5dP"
session_id
(null)
api_type
"completions"
id
"gen-1779218200-Fphfk3NOIl84wR4zrwfh"
upstream_id
"GLcMauGXIvzE-sAPiJLgsQk"
total_cost
0.2185035
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
500
wordsDialogue
250
100.0000%