NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS25Z7QB3YXP9TBSEPE2A2RB
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.7s
Cost:
$0.005807
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
37.5778%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, training her sights on the bouncy castle inflating in the backyard across the street. It was a monstrosity shaped like a smiling dragon. We have exactly forty-five minutes before the cake is cut, Molly said, her voice a sharp whisper as she checked her watch. That is our golden window. Once sugar enters their systems, we are dealing with thirty pocket-sized agents of chaos. Bill patted his tactical turtleneck, looking nervous. What about the clown? I saw him carrying a balloon pump that looked suspiciously like a weapon. Pierre is a mercenary, Molly replied, not breaking her gaze. He balloon-models for the highest bidder. I bribed him with forty bucks and a promise not to reveal his real mouth is under that fake beard. He will create a distraction at the south perimeter with a barrage of balloon giraffes. But what if a kid sees us? Bill asked. He pulled a juice box from his pocket, eagerly sucking down the apple nectar. They are highly perceptive, Molly. Last week, my nephew pointed at my chin and asked why I had a nest. You are wearing a disguise, Bill, Molly sighed, turning to look at him with deep impatience. It is foolproof. He looked down at his bright yellow, oversized foam suit. I feel like a giant canary. You are a dancing pineapple, Molly corrected. The party theme is tropical paradise. I am the mascot of the local smoothie chain, hired to hand out coupons. You are my sidekick. And why do I have to be the pineapple? Because the coconut suit is currently holding our duffel bags. Bill shifted, his foam leaves rustling loudly. This still feels risky. Stealing from a nine-year-old? It feels like we are dropping to a new depth of depravity. Molly finally lowered her binoculars and gripped Bill by his foam shoulders. Listen to me. Little Timmy is getting a vintage, mint-condition retro gaming console from his rich grandma. It is worth five grand on the black market. He wants to play mobile games. We are performing a public service by rehoming it. What about the other boxes? Bill asked, his eyes gleaming. The ones wrapped in shiny silver foil? Collateral damage, Molly said with a sly grin. Plus, rumor has it there is a titanium mountain bike behind the tool shed. The walkie-talkie on Molly’s belt crackled to life, static buzzing through the quiet garage. The dragon is inflated, a French-accented voice squeaked. I repeat, the beast is ready. Send the fruit. That is Pierre, Molly whispered, her eyes widening with adrenaline. It is go time. Remember, no eye contact with the toddlers. They smell weakness. And if we get cornered by the birthday boy’s mother? Bill asked, adjusting his giant green leaf hat. Offer her a sugar-free smoothie coupon and run like the wind, Molly said, grabbing her clipboard. Now, waddle like your life depends on it.
created_at
"2026-05-20T07:52:57.84Z"
5/20/2026, 7:52:57 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
640
moderation_latency
(null)
generation_time
4344
tokens_prompt
77
tokens_completion
739
native_tokens_prompt
67
native_tokens_completion
634
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0058065
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"WWgNapKfO6mwjMcP6veA2AQ"
is_byok
false
latency
640
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263577-dPJz5xmbQtQ3bP2qIen5"
session_id
(null)
api_type
"completions"
id
"gen-1779263577-Xvwn8VlPXOZAq8fKm09Q"
upstream_id
"WWgNapKfO6mwjMcP6veA2AQ"
total_cost
0.0058065
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
487
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
496
wordsDialogue
0
37.5778%