NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYKKSAHTJS25TCTS63FT95
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4.5s
Cost:
$0.000939
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon green tutu, the scratchy tulle rasping against her thighs. She checked her reflection in the darkened window of the community center. Look at me, Bill. Do I look like a disgruntled kindergarten teacher or a criminal mastermind? You look like a giant lime, Molly. Just keep your head down and stay away from the bouncy castle. The blueprints were plastered to the hood of their van, stained with coffee rings and chocolate frosting. Molly drummed her fingers against the cold steel. The security is just a nineteen-year-old in a polo shirt named Tyler. I can outrun him in these sneakers. Tyler has a pressurized glitter cannon, Molly. One blast and we are permanently marked for identification. Why are we stealing from six-year-olds again? Because they have no situational awareness and the gifts are barely protected. There is a limited edition space rover in that pile, Bill. A space rover that you intend to trade for half a tank of gas and a breakfast burrito. Is that the plan? It is a very good burrito. Just hand me the decoy clown nose. We have three minutes until the pinata ceremony starts. I am not wearing the nose. I have my dignity. Wear the nose, or you are on lookout duty by the juice fountain. Bill sighed, snatching the red foam ball from the dashboard. He jammed it onto his nose, his goggles sliding down his forehead. This is the lowest point of my career. I once burgled a royal vault. Now I am a clown. You are a thief with a goal. Focus. Are the exit routes clear? The back door is propped open with a stack of juice boxes. If we hit the gift table during the cake cutting, we can slide the loot out through the delivery hatch. Perfect. Did you bring the magnetic grabber for the wrapped presents? It is in the bottom of the clown bag. I feel ridiculous holding a balloon animal kit. Stop complaining. It is the perfect cover. Nobody suspects a clown of grand larceny. They will when we leave with forty-two boxes of plastic garbage. What if a kid catches us? Give them a balloon dog and run. It is standard operating procedure, Bill. I hate standard operating procedure. Let us just get in, grab the rover, and get out before the clown college reunion happens. Molly checked her watch, the ticking sound muffled by the party music starting to thump behind the brick walls. She pushed off the hood and adjusted her oversized glasses. Sync watches. We move on my signal. My watch is a digital one that only tells me the date. I am not synced with anything. Then just look for the giant pile of ribbons and follow my lead. Try to act festive for someone who is planning to commit a felony. I will try to keep the scowling to a minimum. Let us rob these children, Molly.
created_at
"2026-03-03T22:55:24.984Z"
3/3/2026, 10:55:24 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
701
moderation_latency
(null)
generation_time
4387
tokens_prompt
77
tokens_completion
724
native_tokens_prompt
67
native_tokens_completion
615
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00093925
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
701
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772578524-JqNmljFqgCB88FiejnZ5"
upstream_id
(null)
total_cost
0.00093925
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
498
wordsDialogue
0
48.8138%