NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS2611WTAZK3R0QJKDEMW9MA
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.5s
Cost:
$0.005536
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted through her binoculars, her brow furrowing with intense concentration. "Explain to me again why we are dressed as a pair of oversized, non-copyright-compliant farm animals?" Bill adjusted his giant, padded yellow duck head, his voice muffled by three inches of cheap foam. "Because, Molly, a six-foot mallard and a purple cow can walk right into a seven-year-old’s birthday bash without raising a single eyebrow. It is the perfect camouflage." Molly looked down at her purple velour hooves. "I feel ridiculous. My udders are uneven." "Nobody is looking at your udders, Molly. They will be looking at the cake. Which brings us to phase one of the operation: the distraction." "Let me guess," Molly said, sighing as she adjusted her snout. "You are going to trip over the clown." "I am going to perform a high-stakes, slapstick collision with the bouncy castle," Bill corrected, sniffing with dignity. "While the parents rush to rescue the suffocating toddlers, you slip into the sunroom." "Where the loot is kept." Molly’s eyes gleamed behind her mesh eyeholes. "I saw the target inventory on the mother’s Pinterest board. We are talking high-end Lego sets, three different iPads, and a motorized miniature sports car." "The Holy Grail of childhood consumerism," Bill whispered. "We will be rich." "Or severely beaten by a mob of sugar-crazed second graders," Molly countered, checking her watch. "The magician is scheduled for two o'clock. That is our window." "Magicians are amateurs," Bill scoffed. "They do sleight of hand. We do sleight of van." "Is the getaway vehicle ready?" Molly asked, her hooves tapping anxiously on the pavement. "Parked by the fire hydrant. I left the engine idling under the guise of it being a highly realistic tractor noise." "Bill, our getaway car is a 2008 Honda Civic." "Exactly. Nobody suspects a Civic." "A Civic that sounds like a John Deere is suspicious, Bill." "Just trust the process," Bill snapped, his duck beak swaying. "Do you have the diversionary candy?" Molly patted her side pocket, which jingled softly. "Two pounds of sour worms. If any child gets too close, I will deploy the sugar." "Brilliant. They will eat themselves into a coma." Bill peered over the hedges. "Look! The bouncy castle is fully inflated. It is go time." "For the love of gold, do not mess up the waddle," Molly warned, nudging him forward. "I am a natural mallard, Molly," Bill puffed, waddling with exaggerated hip swings. "Let’s go steal some childhood memories."
created_at
"2026-05-20T07:53:57.407Z"
5/20/2026, 7:53:57 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
689
moderation_latency
(null)
generation_time
4378
tokens_prompt
77
tokens_completion
663
native_tokens_prompt
67
native_tokens_completion
604
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0055365
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"lWgNarWaIYHrjMcPxreJwQM"
is_byok
false
latency
689
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263637-ikqSSSwhYdG6ekGeUOBi"
session_id
(null)
api_type
"completions"
id
"gen-1779263637-ZytEg5GfplSF69g60kQs"
upstream_id
"lWgNarWaIYHrjMcPxreJwQM"
total_cost
0.0055365
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
411
0.00%
Dialogue to Total Word Ratio
Ratio: 74.94%, Deviation: 24.94%
neededClean
false
wordsTotal
419
wordsDialogue
314
0.0000%