NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ951ERMR08MD5G03709YCZP
Run Details
Model:
Gemini 3 Flash (Preview, Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
50.4s
Cost:
$0.026665
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air inside the 2004 Honda Odyssey smelled faintly of damp dog and desperation. Molly pressed her face against the glass of the passenger window, her binoculars fogging up every time she exhaled. Across the street, an inflatable bounce house shaped like a lopsided castle groaned under the weight of twelve sugar-crazed second graders. It was a chaotic scene of suburban revelry that masked a bounty of untended riches. She adjusted the focus dial, zeroing in on a gift table overflowing with primary-colored wrapping paper and oversized bows. It was a treasure trove of plastic dreams, high-end gadgets, and generic gift cards that she and Bill intended to liquidate by sundown. Check the perimeter again, Bill. We are on a clock, Molly said, her voice a tight whisper. Bill sighed and reached into a greasy paper bag. It is a cul-de-sac, Molly. The only perimeter is a picket fence and a very judgmental mailbox. Want a nugget? Focus, Bill. This is the big one. We are talking high-yield electronics and LEGO sets, she snapped. She returned her gaze to the target house. To any casual bystander, the suburban split-level was a temple of quiet domesticity, but to Molly, it was a high-value fortress protected by a Ring doorbell and a mother who had reached her psychological limit. Molly consulted her map, which was actually a frantic series of scribbles on a pepperoni-stained pizza flyer. She pointed toward the side gate where an aging golden retriever lay pancaked in the grass, his tail occasionally thumping the dirt in a rhythm of pure laziness. He was the only guard on duty. I still think the clown outfit is overkill. I look like a nightmare had a mid-life crisis, and I am not even sure these shoes are legal for driving, Bill remarked while wiping mustard off his ruffled collar. The greasepaint on Bill’s face was already beginning to slide into his thick beard, giving him the appearance of a man who had survived a colorful explosion in a bakery. Molly ignored his discomfort. She was busy checking her cheap digital watch, syncing her internal clock to the inevitable moment when the centerpiece cake would emerge from the kitchen. The cake cutting is the diversion. Once the target begins his assault on the buttercream, the parents will be distracted. That is our window, Molly explained. Bill shook his head. Terminology aside, Molly, that dog looked at me with very suspicious eyes. He is a suburban dog, Bill. He can be bought with a piece of lunch meat. Did you bring the distraction protein? Bill held up a slice of ham. If he has a refined palate, we are golden. If not, I am going to lose a finger. A sudden roar of high-pitched singing erupted from the backyard. The rendition of Happy Birthday had begun. Molly gripped the door handle. She looked at Bill, who was struggling to straighten his oversized red nose while his floppy shoes caught on the floor mats. Remember, no eye contact with the toddlers. They can smell fear. Fine. But if I am forced to play freeze-dance, I am definitely out of here. Molly checked the street one last time before sliding the side door open with a metallic screech. Showtime, she said.
created_at
"2026-02-25T01:01:30.269Z"
2/25/2026, 1:01:30 AM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1907
moderation_latency
(null)
generation_time
50364
tokens_prompt
77
tokens_completion
3541
native_tokens_prompt
67
native_tokens_completion
8877
native_tokens_completion_images
0
native_tokens_reasoning
8191
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0266645
router
(null)
provider_responses
0
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byok
false
latency
1907
model_permaslug
"google/gemini-3-flash-preview-20251217"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981290-cSXfrCbiW8pWyxwRJ4kJ"
upstream_id
(null)
total_cost
0.0266645
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
545
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
557
wordsDialogue
0
0.0000%