NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90N8PS1XJA7337JC0M4MJP
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3.9s
Cost:
$0.000888
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.8117%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The abandoned gazebo smelled strongly of damp pine needles and impending failure. Molly adjusted the oversized foam clown nose on her face, wincing as the elastic snapped against her cheek. Across from her, Bill was struggling to zip up a bright yellow bird suit that seemed designed for a very thin pigeon rather than a grown man of his particular girth. The fabric strained visibly at the seams, threatening a catastrophic explosion of polyester at the slightest provocation. They sat on a pile of folding chairs, surrounded by crinkled maps of the suburban community center and a stash of glittery glue. Look, Bill, we operate on precision, not frantic flapping, Molly said, tapping a finger on the nursery floor plan. Remember the signal. When I release the streamers, you dive for the pile of wrapped toys near the cake table. No dawdling near the snack buffet. Bill grunted, forcing his oversized beak to stay attached. The kid has a giant inflatable castle standing right between us and the loot, he muttered. If I get trapped in there with a bunch of sugar-crazed toddlers, I am not coming out without a fight. Those little monsters wield juice boxes like projectiles. Molly sighed and checked her watch. We are not there for the children, Bill. We are there for the haul. Specifically, that vintage remote-controlled car the parents bought. Sell that on the secondary market and we are golden for the summer. Just keep your eyes on the shimmering gift wrap. Do not let yourself get distracted by the magician. He is not a real magician, Bill snorted, tugging at his wings. I saw him stuffing a deck of cards into his sock earlier. He is a fraud, just like that velvet vest implies. If he tries to pull a coin from my ear, I am going to lose my professionalism. I have a very low tolerance for sleight of hand when I am trying to commit a felony. Focus, Bill. We move when the clown starts the balloon animal segment. That is our cover. While he is busy twisting rubber into distorted poodles, we secure the bag. We exit through the back fire door, dump the suits in the alley, and vanish before the parents realize the presents did not just walk away. Bill stood up, his bird legs wobbling alarmingly. My feathers are itchy and this beak is blocking my peripheral vision. Are you sure they are not checking IDs at the door? It says here it is an invitation-only gala. Molly grinned, revealing a gold tooth. It is a four-year-old birthday party, Bill. Nobody is checking badges. Just waddle confidently and try not to look like you are plotting a heist. Smile like you love cake. I hate cake, Bill whispered, adjusting his belt. But I love larceny. Let us go catch this mythical bird.
created_at
"2026-05-10T13:19:07.487Z"
5/10/2026, 1:19:07 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
606
moderation_latency
(null)
generation_time
3775
tokens_prompt
77
tokens_completion
687
native_tokens_prompt
67
native_tokens_completion
581
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00088825
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"y4UAaoarK-Wez7IP7NGbgQE"
is_byok
false
latency
606
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419147-nv5bXVNQw3EpgtRZ3B5x"
session_id
(null)
api_type
"completions"
id
"gen-1778419147-VsnFLKJT8NnTtFDWjSwA"
upstream_id
"y4UAaoarK-Wez7IP7NGbgQE"
total_cost
0.00088825
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
476
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
481
wordsDialogue
0
1.8117%