NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8VYJ9MVJMJEEBEF8T33DNF
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.0s
Cost:
$0.001031
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like wet cardboard and desperate ambition. Molly adjusted her oversized clown mask, the latex squeaking against her forehead. Bill checked his watch, his knuckles white against the steering wheel. Do you have the floor plan? Molly whispered. I have a crayon drawing I swiped off the fridge. It says the pile of loot is behind the bouncy castle, Bill muttered. A bouncy castle? You told me this was a high security gala. It is a high security toddler birthday in a gated HOA community, Molly. The perimeter is guarded by three golden retrievers and a nanny named Tiffany. Is Tiffany armed? She has a whistle and a lukewarm latte. We are pathetic, Bill. I used to rob banks. Now I am casing a seven-year-old named Brayden. It is about the volume, Molly. There are forty-two guests. That is forty-two toys, high-end electronics, and probably a gift card or two. We are talking professional grade haul here. My mask is suffocating me. Why do I have to be the clown? Because you are the one who insisted on the diversion. I insisted on a smoke bomb, not a balloon animal kit. I do not even know how to make a dog. A snake is fine. They will not know the difference. Just get in there, hand out the prizes, and navigate the pile toward the mudroom. And then what? I just walk out with a giant robot in a sack? They will be distracted by the sugar crash. It is all about timing. The moment the cake starts getting sliced, we strike. What if the dog barks? The dog is a Pomeranian named Princess. Step over it. You are cold, Bill. Truly heartless. I am a businessman. Now, check your internal comms. Can you hear me? Only the sound of your heavy breathing. That is focus. Are you ready? For a room full of sugar-crazed children? No. Are the zip-ties ready? For the presents or the kids? The presents, Molly. For God's sake. You never know. Sometimes these kids are like piranhas. I might have to secure the perimeter. Just get the bags filled. I will have the van idling near the hedge. If I get glitter in my eye, I am going to have to rethink our partnership. If you get glitter in your eye, just steal the glitter. It is craft gold. You are the worst mentor I have ever had. And you are the only one who can fit through the window, so put on the oversized shoes and get moving. Fine. But I am keeping the laser tag set for myself. Don't push it. Molly climbed out of the van, her giant feet tripping over the curb. She looked back one last time, her expression hidden behind the manic painted grin of the mask. Bill gave her an impatient thumbs up and pulled his balaclava tight. She turned toward the dark suburban house, a thief in the night, ready to ruin a birthday party for the sake of a few plastic toys.
created_at
"2026-05-10T11:56:49.339Z"
5/10/2026, 11:56:49 AM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
532
moderation_latency
(null)
generation_time
3897
tokens_prompt
77
tokens_completion
741
native_tokens_prompt
67
native_tokens_completion
676
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00103075
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"gXIAapL9IKyez7IPyLygoQg"
is_byok
false
latency
532
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414209-b2dWXIVEZOiByC4FPQw7"
session_id
(null)
api_type
"completions"
id
"gen-1778414209-BYPVM11YCKT4PB5fZ2D1"
upstream_id
"gXIAapL9IKyez7IPyLygoQg"
total_cost
0.00103075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
515
wordsDialogue
0
47.9934%