NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-20-01KS26078EZH69VQ6S012JYV0P
Run Details
Model:
Gemini 3.5 Flash (Reasoning, Minimal)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.3s
Cost:
$0.006535
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
40.6364%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The heavy scent of damp cardboard and mothballs filled the back of Molly’s van. On the wooden crate between them sat a hand-drawn blueprint of a suburban backyard, complete with a crudely sketched bounce house and a large circle labeled "THE loot" in sparkly pink gel pen. "I still say we go in through the dog door," Bill whispered, adjusting his black ski mask. It was eighty degrees outside, and he was already sweating profusely. "You’re six-foot-four, Bill," Molly said, not looking up from her binoculars. "The dog is a miniature schnauzer. Do the math." "I could compress my spine. I’ve been doing yoga." "You’re not compressing your spine. We’re using the Trojan Horse protocol." Molly pointed a manicured finger at the blueprint. "The entertainment arrives at two. Captain Barnaby’s Magic Spectacular. We intercept him, take the costume, and you go in as the clown." "Why do I have to be the clown?" Bill whined. "Clowns are creepy. Parents watch them too closely. Let me be the guy who operates the pony ride." "There is no pony, Bill. It’s a seven-year-old’s birthday, not a medieval tournament. Besides, the clown has direct access to the gift table. It’s positioned right next to the cake station." "What kind of cake?" "Focus!" Molly slapped his thigh with her clipboard. "According to my inside source, the target is a pristine, limited-edition, unopened Lego Death Star. It’s worth a grand on the black market. Along with three iPads and a pile of designer clothes those spoiled brats won’t appreciate anyway." "I appreciate designer clothes," Bill muttered, pulling at his itchy collar. "Do you think they have any of those little mini quiches?" "If you eat the party food, I will leave you there to face the wrath of fifteen sugar-fueled second graders." "They’re just kids, Molly. What are they going to do, juice-box me to death?" "They are feral, Bill. They have no concept of mercy or remorse. If they realize you aren’t a real magician, they will tear you apart like piranhas." "I can do magic!" Bill protested, pulling a crumpled red foam ball from his pocket. "Look. Behind your ear. Bam." Molly stared at the foam ball in his hand with cold, unblinking eyes. "That was in your nose thirty seconds ago." "It establishes a rapport," Bill said, stuffing it back into his pocket. "So, what’s the getaway plan?" "We load the gifts into the giant trick-box. While they're waiting for you to pull a rabbit out of it, I drive away. You escape through the hedge maze." "That’s a garden patch of hydrangeas, Molly. It's knee-high." "Then you’d better crawl fast," Molly said, tossing him a multicolored wig. "Put it on. We’ve got five minutes before Captain Barnaby pulls up." Bill sighed, pulling the neon-green afro over his ski mask. "I hate birthdays." "You’ll love this one," Molly grinned, sliding her sunglasses down. "Now, let’s go steal some childhood memories."
created_at
"2026-05-20T07:53:30.132Z"
5/20/2026, 7:53:30 AM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
783
moderation_latency
(null)
generation_time
5207
tokens_prompt
77
tokens_completion
771
native_tokens_prompt
67
native_tokens_completion
715
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.0065355
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"emgNavrMEb-tjrEP0-KaiQc"
is_byok
false
latency
783
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779263610-NHQ88PF1BdJ2wUuJFzlq"
session_id
(null)
api_type
"completions"
id
"gen-1779263610-fDFJ0YBum6DO2kalaQJo"
upstream_id
"emgNavrMEb-tjrEP0-KaiQc"
total_cost
0.0065355
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
0.00%
Dialogue to Total Word Ratio
Ratio: 69.54%, Deviation: 19.54%
neededClean
false
wordsTotal
499
wordsDialogue
347
40.6364%