NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TQZHPVYPD70VKCHFRQB1R
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 23s
Cost:
$0.204734
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
78.51%
Dialogue to Total Word Ratio
(details)
88.0666%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slammed a dry-erase marker onto the greasy laminate table of their temporary basement headquarters. The room smelled of stale laundry and cheap energy drinks. She adjusted her wire-rimmed glasses and pointed at a scribbled crayon blueprint, which was taped over a dusty dartboard. "We enter through the doggy door," Molly said. "I won't fit, Molly. I had three burritos." Bill sighed, patting his round stomach. "You’re the distraction. Wear the tactical containment suit." He glanced toward the corner, where a garish purple dinosaur costume slumped like a dead mascot, complete with green felt spots and a stained tail. "The Barney suit?" "It’s a legally distinct violet reptile, Bill. You waddle into the backyard at precisely two o'clock. The sugar-crazed toddlers will swarm you." "Like piranhas. They smell of apple juice and malice." "Draw them away from the gift table. That’s your only operational goal." "What about the target's mother? She looks like a retired prison warden." "Brenda? She’ll be crying over the artisanal cupcakes. We have a four-minute window before the clown arrives." "Is the clown in on it?" "No, Bill. If he sees you, we have a turf war on our hands. Dinosaurs and clowns don't mix." Bill stared at a sticky note labeled 'Escape Route' and rubbed his chin in deep, confused thought. "Right. Turf wars are bad for business. But the bouncy castle is blocking our exit." "We grab the loot, leap onto the dome, and bounce over the fence." "Are you insane? That's a seventy-foot bounce." "It’s twelve feet, Bill. Do you want the LEGO Star Destroyer or not?" "I want the Millenium Falcon. It has seven thousand pieces." "Focus! The target is the entire gift pile. We bag it, throw it in the clown's decorated van, and drive away." "Wait, we’re stealing the clown’s van too?" "It's perfect. Nobody suspects a vehicle with a giant plastic daisy on the grill." "What if there's a guard dog?" Molly smirked, digging into her leather jacket to produce a single, limp, uncooked hotdog wrapped in a piece of greasy tinfoil. "They have a poodle. I've prepared a countermeasure." "One hotdog?" "He's tiny. If we give him more, he'll choke. I'm not a monster." "Fair enough. But what if the presents are wrapped?" "Of course they’re wrapped. It’s a birthday party." "What if I grab a box and it's just socks?" "You shake them. Heavy and rattling is Lego. Soft is clothing. Noisy is cheap plastic. We only want high-tier loot." Bill nodded solemnly, adjusting the tight collar of his faded superhero shirt with a heavy sigh. "Understood. Heavy and rattling. Like my skull when I put on that dinosaur head." "Just remember. No juice boxes, no cake, and absolutely no petting the poodle. We are professionals." "Can I at least have a balloon animal if the clown makes me one?" "No, Bill. No balloon animals." Molly rolled up the map and prepared for the greatest helium-inflated heist of the century.
created_at
"2026-05-19T19:17:31.33Z"
5/19/2026, 7:17:31 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1426
moderation_latency
(null)
generation_time
83073
tokens_prompt
77
tokens_completion
7160
native_tokens_prompt
67
native_tokens_completion
22737
native_tokens_completion_images
0
native_tokens_reasoning
21993
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.2047335
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"S7cMapD4HZPijMcPip6D4Ag"
is_byok
false
latency
1426
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218251-OmG5gKbuBu6vy58rxj6q"
session_id
(null)
api_type
"completions"
id
"gen-1779218251-ToVUzMxPudf3EO4QlIOY"
upstream_id
"S7cMapD4HZPijMcPip6D4Ag"
total_cost
0.2047335
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
78.51%
Dialogue to Total Word Ratio
Ratio: 72.20%, Deviation: 2.20%
neededClean
false
wordsTotal
500
wordsDialogue
361
88.0666%