NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25ZV6441BF3EM2TG98KN0T
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.7s
Cost:
$0.005744
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0049%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, her eyes tracking a clown who was currently struggling to inflate a yellow balloon shaped vaguely like a sausage dog. They were parked in a rented minivan three houses down from the target location, where a giant helium-filled number seven floated over the backyard fence. "The bouncy castle is the perimeter wall," she murmured, her voice tight with professional focus. "If we breach from the west, we bypass the face-painting station entirely. It is a hotbed of civilian casualties, Bill. Sticky fingers everywhere." Bill sighed, his massive frame squeezed into a khaki vest that he believed made him look like a safari guide but actually made him look like an oversized Boy Scout. He was currently busy sorting through a tactical duffel bag filled with neon noisemakers and juice boxes. "I still think we should have dressed as magicians," he said, holding up a plastic top hat with a false bottom. "I practiced the rabbit trick for three hours last night, Molly. My technique is flawless." "Magic is too high-profile," Molly snapped, snatching the top hat and tossing it into the back seat. "People watch the magician. No one watches the caterer's assistants. We blend in, we locate the designated gift table, we sweep the haul into the insulated laundry hampers, and we exit through the side gate before they cut the cake. It is a classic grab-and-go." "But what about the target?" Bill asked, looking genuinely concerned as he squinted at his map of the backyard. "The birthday boy. Young Lucas. I hear he has a wicked arm with a juice pouch." "Lucas is seven today," Molly said, her tone icy. "His reaction times are sluggish from a steady diet of high-fructose corn syrup and excitement. He is not a threat. The real danger is his mother, Beatrice. She has the hawkish gaze of a casino pit boss and a megaphone." "And the presents?" Bill’s eyes gleamed with greed. "Are we sure the payload is worth the risk?" "My inside source says he’s getting three custom gaming consoles, a miniature electric quad bike, and at least twenty envelopes containing crisp fifty-dollar bills from wealthy grandparents who couldn't be bothered to shop," Molly said, a slow smirk spreading across her face. "It’s a goldmine wrapped in Spider-Man paper." "Brilliant," Bill whispered, strapping a bubble-gun holster to his thigh. "I'll handle the distraction. If anyone questions us, I’ll start a game of duck-duck-goose. It paralyzes them every time." "Just stick to the plan," Molly warned, checking her watch. "The clown is wrapping up his set. He’s losing the crowd. When those kids realize there is no more balloon art, anarchy will reign. That is our window of opportunity." "Understood," Bill said, pulled a pair of aviator sunglasses over his eyes. "Let's go steal some childhood memories."
created_at
"2026-05-20T07:53:17.769Z"
5/20/2026, 7:53:17 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
788
moderation_latency
(null)
generation_time
4620
tokens_prompt
77
tokens_completion
724
native_tokens_prompt
67
native_tokens_completion
627
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0057435
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"bWgNatftN6yd1MkP7oHj2Ao"
is_byok
false
latency
788
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263597-Nxy2BTQvYu2dPm5LvMwR"
session_id
(null)
api_type
"completions"
id
"gen-1779263597-vtfTDepKHpGe2lLryTlP"
upstream_id
"bWgNatftN6yd1MkP7oHj2Ao"
total_cost
0.0057435
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
469
0.00%
Dialogue to Total Word Ratio
Ratio: 60.62%, Deviation: 30.62%
neededClean
false
wordsTotal
480
wordsDialogue
291
0.0049%