NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91P0KVQ6DEEEG09TACX6PB
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.5s
Cost:
$0.001004
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The suburban lawn was a carpet of fake emeralds, manicured to within an inch of its life. Molly leveled her binoculars at the sprawling Victorian house, squinting through the morning fog. Inside, the muffled sound of a sugar-addled mob of seven-year-olds suggested the drop-off time was imminent. Bill adjusted his velcro-strapped utility belt, which currently held a ball of industrial twine and a collapsible hook modeled after a garden trowel. He looked like a budget superhero having a midlife crisis. Molly muttered, keeping her eyes fixed on the target window. Bill, the perimeter is clear, but the inflatable castle is acting as an unplanned sentry. We have a thirty-second window once the clown enters the kitchen for a bathroom break. Bill groaned, shifting his weight. That clown looks like he has seen things, Molly. I do not like his eyes. Are we certain the haul consists of high-end loot? I cannot risk a felony for a pile of plastic building blocks and half-eaten cupcakes. Molly sighed, lowering the glass. The intel states there is a remote-controlled drone in that stack and at least three limited-edition designer watches gifted by a grandmother who clearly does not understand child psychology. It is a goldmine, Bill. Focus. Did you bring the distraction? Bill patted his vest, pulling out a small, quivering box. It contains a highly caffeinated squirrel I caught in the park this morning. Once I release it near the buffet, the carnage will be total. No child will look at a gift pile while that rodent is loose among the fruit punch. That is reckless, Molly said, checking her pocket watch. But I suppose it lacks the subtle elegance we usually strive for. Fine. We move on my signal. Stay low and avoid the bouncy castle. That thing is a tactical nightmare. The clown exited the front door, looking mournful, and headed toward the back garden. Molly tapped her comms. Go, go, go. Move like you are avoiding a tax audit. They sprinted across the dew-covered grass, ducking behind a row of oversized hydrangea bushes. Bill stumbled, his utility belt clattering like a box of dry silverware, but he recovered with a frantic, ungraceful roll that landed him inches from the basement bulkhead. The chaos erupted inside as the squirrel breached the perimeter, followed immediately by a high-pitched, collective shriek from forty children. Bill breathed, wiping sweat from his forehead. That worked better than I anticipated. Should I be concerned about the squirrel? Do not anthropomorphize the getaway animal, Bill, Molly hissed, prying the bulkhead open with a professional shim. Grab the gold-wrapped bundles first. Leave the stuffed animals. They are bulky, trackable, and carry too much sentimental baggage. Bill scrambled inside, his eyes widening at the mountain of paper-clad riches. We are going to be legends, Molly. Forget the watches. Look at the size of this toy truck box. Just grab the loot, you amateur, Molly growled, sliding into the darkness after him. We have three minutes before they realize cake is not a viable defense strategy.
created_at
"2026-05-10T13:37:00.547Z"
5/10/2026, 1:37:00 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
590
moderation_latency
(null)
generation_time
4388
tokens_prompt
77
tokens_completion
787
native_tokens_prompt
67
native_tokens_completion
658
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00100375
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"_IkAatu6LOmHqtsPgeqI0AE"
is_byok
false
latency
590
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420220-eYXUSYfRC4AOKoBYUztI"
session_id
(null)
api_type
"completions"
id
"gen-1778420220-QXNHWRzMLICXjDn3ts2c"
upstream_id
"_IkAatu6LOmHqtsPgeqI0AE"
total_cost
0.00100375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
521
wordsDialogue
0
48.8138%