NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WJ0Y2YJ5WWAKXQMHY1A1W
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.5s
Cost:
$0.000986
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon plastic slide of the Fun-Time Jungle gym loomed over them like a fever dream of primary colors. Molly adjusted her oversized clown mask, the rubber squeaking against her forehead. Bill was hunched behind a sensory table filled with kinetic sand, clutching a walkie-talkie that smelled faintly of grape juice. Are you sure the haul is worth the potential for a sticky encounter with a toddler, Bill? It is custom LEGO sets, Molly. Specialized, discontinued, rare. There is enough profit to retire to a place where juice boxes are outlawed. I feel ridiculous. I have glitter in my eyebrows and a balloon animal tucked in my waistband. Focus. The mark is a seven-year-old named Timmy. The presents are currently unguarded in the foyer. The parents are occupied with the cake lighting ceremony. What about the clown? He is doing balloon swords. He looks suspicious. He is a freelance contractor making minimum wage. He cannot stop us. Wait for the glitter cannon. That is our cue. I hate glitter. It is the herpes of the craft world. Stop whining and check the perimeter. Do you see the bouncy castle? I see it. It is vibrating with the collective energy of thirty sugar-crazed demons. That is the distraction. Once the cake comes out, the pack will descend upon the sugar. We move toward the pile. And if we trip over a stray rubber ducky? We recover with dignity and grab the haul. Do you have the heavy-duty duffel bag? It is hidden in the ball pit. I had to shove aside a very confused toddler to bury it. Professional. I appreciate that. Remember, the gift pile is under the banner that says Happy Birthday in neon green. I see it. The pile is massive. We are looking for the box with the gold ribbon, correct? Correct. That is the limited-edition Star Destroyer series. Everything else is secondary, but if we can carry the wrapping paper, we take it all. Are we really going to rob a child of his birthday joy? We are liberating high-value assets from a demographic that lacks the cognitive capacity to appreciate their investment value. It is basically a charity act. Your moral compass is pointed directly toward the nearest pawn shop, Bill. It is a functional compass. Check your watch. The singing is starting. They are doing the birthday song. Oh god, the off-key screeching is unbearable. This is a tactical nightmare. Sync your breathing. Deep breaths, Molly. Wait, someone is looking at us. Is that a parent? No, that is a inflatable mascot of a dancing bear. Its eyes are lifeless voids. Ignore it. It just waved at me. I feel like it knows. It is polyester and stuffing. It knows nothing. Now, go. The candles are blowing out. Move, move, move! I am moving! I am moving! My glitter shoes are squeaking on the linoleum. For heaven's sake, Molly, stop squeaking! Just run!
created_at
"2026-05-10T12:07:26.922Z"
5/10/2026, 12:07:26 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
596
moderation_latency
(null)
generation_time
3348
tokens_prompt
77
tokens_completion
747
native_tokens_prompt
67
native_tokens_completion
646
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00098575
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"_3QAas-YCP-Gz7IPmb_5iAo"
is_byok
false
latency
596
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414846-GNpnWD0ZNYVZn7jSSB2b"
session_id
(null)
api_type
"completions"
id
"gen-1778414846-RL6gNpOtRuMg1HyjBT8a"
upstream_id
"_3QAas-YCP-Gz7IPmb_5iAo"
total_cost
0.00098575
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
498
wordsDialogue
0
43.1902%