NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYZWNQ8KYVKK97PHS1FS48
Run Details
Model:
Gemini 3.1 Flash Lite (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.3s
Cost:
$0.000933
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the cool brick of the community center, her tactical turtleneck itching in the humid evening air. Beside her, Bill was frantically adjusting the straps of a backpack specifically designed to hold a haul of stuffed unicorns and remote-controlled trucks. A muffled cacophony of screeching children and aggressive balloon-popping music bled through the thin walls, acting as the perfect sonar for their impending infiltration. Molly checked her watch, a cheap plastic thing she had stolen from a vending machine, and sighed. The itinerary for Operation Party-Pooper was hanging by a thread because Bill refused to leave his glitter-glue trap kit in the getaway sedan. Are we doing this or are you planning to decorate the loot? Molly hissed, tapping her combat boot against the pavement. Bill looked affronted, smoothing his neon-colored party hat. This is professional grade adhesive, Molly, he whispered back loudly. If a parent tries to intervene, I need something to neutralize the situation without resorting to violence. It is about finesse, not just brute force. Finesse is not sticking a toddler to a wall with craft supplies, she retorted, shoving him toward the side entrance. We hit the gift table, we grab the mountain of gold wrapping paper, and we vanish before the clown finishes his balloon animal demonstration. Do you remember the signal? Two sharp whistles followed by a frantic duck-walk, Bill confirmed, his eyes wide behind his oversized sunglasses. But what happens if the birthday boy catches us? He looks like a biter, Molly. I saw him earlier and he had the ferocity of a caged honey badger. If he bites you, you handle it, Molly said, checking the perimeter one last time. You are the distraction specialist. I am the procurement expert. Keep the honey badger occupied while I liberate the loot. The side door creaked as Molly nudged it open with the toe of her boot. A blast of sugary air and the smell of cheap cake flooded the alleyway. Inside, they could see the vibrant chaos of a dozen children vibrating with a sugar-induced frenzy. A pile of glittery packages sat unguarded near the coat rack, a mountain of plunder waiting for their arrival. Remember, Bill whispered as they crouched behind a decorative cardboard castle, no witnesses. We are stealing presents from a seven-year-old, not robbing the federal reserve, Molly muttered, bracing herself for the sprint. Just keep your head down and stay away from the cupcake tower. They moved in tandem, shadows shifting across the linoleum floor. Bill took point, his glitter glue ready in his pocket, while Molly scanned for the largest, most expensive-looking boxes. The mission was absurd, arguably the lowest point in their illustrious career of petty larceny, but the potential resale value of pristine unopened toys was too tempting to resist. As they neared the table, Molly felt the familiar rush of a heist taking shape. Tonight, they would be kings of the playroom.
created_at
"2026-03-03T23:02:07.3Z"
3/3/2026, 11:02:07 PM
model
"google/gemini-3.1-flash-lite-preview-20260303"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
642
moderation_latency
(null)
generation_time
4169
tokens_prompt
77
tokens_completion
761
native_tokens_prompt
67
native_tokens_completion
611
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.00093325
router
(null)
provider_responses
0
endpoint_id
"e004ce39-eb0c-4474-93d5-55072e725970"
is_byok
false
latency
642
model_permaslug
"google/gemini-3.1-flash-lite-preview-20260303"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1772578927-QtoTY6VlrljoEtbhnFX0"
upstream_id
(null)
total_cost
0.00093325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
499
wordsDialogue
0
43.1902%