NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25SEXGBVYBTQP1DQMDDKMM
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.2s
Cost:
$0.004808
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The helium is the critical vulnerability in their perimeter, Molly whispered, her nose pressed against the damp glass of the surveillance van. Bill didn't look up from his checklist. He was busy trying to squeeze his defensive lineman shoulders into a primary-colored clown suit. This zipper is an instrument of torture, Molly. I feel like a sausage being forced back into its casing. Why am I the entertainment? Because you have the balloon animal training from your youth group days, and I have the steady hands required for pocket-picking six-year-olds, Molly said, turning to face him. She held up a small plastic device. I’ve calibrated the frequency. This baby will jam their baby monitors within a fifty-yard radius. No parental feedback loops. No crying-induced alerts. Are we seriously rolling a bouncy castle? Bill asked, finally conquering the zipper with a triumphant gasp. He adjusted a giant red foam nose. I have a master's degree in architectural engineering, Molly. I bypassed the security lasers at the Louvre. Now I’m stealing a pile of Lego sets and a motorized miniature Vespa. That motorized Vespa retails for four hundred dollars, Bill, and custody-battle grandmothers will pay double that on the black market before Christmas, Molly countered. She checked her watch. Zero hour. The cake cutting is scheduled for exactly fourteen hundred hours. Sugar highs will peak fifteen minutes later. That is our window of maximum chaos. And the getaway? Bill asked, adjusting his rainbow wig. We stage a piñata accident, Molly explained, her eyes gleaming with tactical fervor. I’ve packed a decoy piñata stuffed with high-grade strobe lights and itching powder. While the toddlers are disoriented and weeping, we bag the gift table. It’s diabolical, Bill admitted, a slow smile spreading across his face. But what about the birthday boy? Toby. He’s a biter. The brief said he has three confirmed VIP extractions from previous playgroups. I have neutralized the threat, Molly said, tapping a pocket. I brought sugar-free gummy bears. They act fast on the digestive tract. He’ll be occupied. You’re a monster, Bill whispered in genuine admiration. I’m a professional, she corrected, sliding her lockpicks into a pocket disguised as a giant flower. Now, grab the balloon pump and put on your game face. Honk honk, Bill sighed, squeezing his red nose. Let’s go rob some babies.
created_at
"2026-05-20T07:49:48.597Z"
5/20/2026, 7:49:48 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
722
moderation_latency
(null)
generation_time
4053
tokens_prompt
77
tokens_completion
626
native_tokens_prompt
67
native_tokens_completion
523
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0048075
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"nGcNavDyLuKs1MkP3abv4As"
is_byok
false
latency
722
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263388-W8o5zKTQuxrcgn4O5QOh"
session_id
(null)
api_type
"completions"
id
"gen-1779263388-hnNHWWks4SAM0QbEXtoB"
upstream_id
"nGcNavDyLuKs1MkP3abv4As"
total_cost
0.0048075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
386
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
395
wordsDialogue
0
0.0000%