NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Z3Y96D51VAHS9KQ3WP4FG
Run Details
Model:
Gemini 3.1 Flash Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3.9s
Cost:
$0.000950
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The suburban cul-de-sac was silent except for the rhythmic, aggressive chirping of crickets that sounded suspiciously like judgment. Molly crouched behind a manicured hedge of hydrangeas, her tactical turtleneck itching against her neck. Beside her, Bill adjusted his oversized novelty glasses, which were clearly meant for a very small toddler. The invitation they had intercepted was printed on heavy, glitter-dusted cardstock depicting a troop of dancing penguins. Inside the high-fenced perimeter of the backyard, a bouncy castle shaped like a deranged dragon loomed against the moonlight. Are you sure about the ventilation shafts, Bill? Molly whispered, tapping her earpiece. The blueprints are from a coloring book. It is a state of the art playhouse, Bill hissed back, adjusting his belt of stolen juice boxes. The ventilation design is top-tier for preventing heatstroke in rugrats. We go in through the chimney. Molly stared at the structure. It was made of reinforced plastic and had no chimney. It was a playhouse for four-year-olds. We are literally breaking into a toddler’s birthday because you have a vendetta against the local bouncy castle rental industry. It is not just the castle, Bill defended, his voice rising in indignation. It is the principle of the thing. They have a gift pile taller than my childhood hopes and dreams and most of it is artisan-crafted wooden blocks. Those are liquid gold on the black market. Molly sighed, checking the perimeter. A single golden retriever was sleeping near the patio doors, looking entirely unbothered by the potential heist of the century. Is the plan still to distract the parents with the remote-controlled clown car? Yes, Bill said, pulling an intricate, colorful vehicle from his tactical backpack. This bad boy is equipped with a high-decibel siren and a bubble-blowing nozzle. It will create enough confusion that we can swoop in, bag the loot, and be out before the clown realizes he is out of a job. You realize we are talking about stealing toys from children whose primary occupation is napping, right? Molly asked, already moving toward the side gate. They do not deserve high-quality craftsmanship, Bill muttered, following closely behind. They are going to draw on them with permanent markers by Tuesday. We are essentially performing a humanitarian service by redistributing these assets to people who actually appreciate the joinery. Molly reached the gate and picked the lock with the professional grace of a woman who had spent too much time in locksmithing forums. The latch clicked open. As they slid into the backyard, a waft of stale cake and balloons hit them. Ahead, the mountain of presents shimmered under the porch light like a plastic treasure hoard. Target acquired, Molly whispered, her heart hammering against her ribs. Let us liberate them, Bill replied, already reaching for the first glitter-wrapped box. I have always wanted a rocking horse, even if it is built for someone who is thirty-six inches tall.
created_at
"2026-05-10T12:52:11.183Z"
5/10/2026, 12:52:11 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
529
moderation_latency
(null)
generation_time
3845
tokens_prompt
77
tokens_completion
771
native_tokens_prompt
67
native_tokens_completion
622
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00094975
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"e38AaoT6FdfVz7IP1-eH-Ag"
is_byok
false
latency
529
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417531-gL0ncQT8UCHnfnu9lASK"
session_id
(null)
api_type
"completions"
id
"gen-1778417531-96Naeq7r0QL0YhVJVmWj"
upstream_id
"e38AaoT6FdfVz7IP1-eH-Ag"
total_cost
0.00094975
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
497
wordsDialogue
0
21.6892%