NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90ZCPVAXTSN8Y3HF9B82G3
Run Details
Model:
Gemini 3.1 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.2s
Cost:
$0.000995
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her neon pink wig and checked her reflection in a puddle of spilled apple juice. She looked ridiculous. Bill, however, was already dressed in a full-body plush hippo costume that smelled faintly of despair and cheap polyester. The birthday party was in full swing inside the suburban gymnasium, a cacophony of shrieking children and aggressive balloon-animal crafting. Are you absolutely certain we have to wear the heads? Bill asked, his voice muffled by the thick foam snout of his disguise. I feel like I am suffocating in a sauna of my own regret. If we do not wear the masks, the parents will identify us within seconds, Molly whispered, keeping her eyes fixed on the pile of brightly wrapped loot in the corner. Keep your tail tucked, Bill. You are drawing attention to your backside. I am a hippo, Molly. Hippos have prominent rears. It is part of the aesthetic. Is the getaway vehicle secure? The tricycle is behind the bushes. I have reinforced the basket with industrial grade duct tape. It will hold the haul, provided you stop lumbering around like a drunken tapir. I am not lumbering. I am stealthy. This is stealth-hippo movement. Your stealth-hippo movement is causing the floorboards to creak like a tomb. Look, the clown is turning toward the gift table. This is our window. Bill hesitated, adjusting his oversized mittens. Do we really need to take the ones wrapped in cartoon dogs? Those look like they contain craft kits. I do not want a craft kit. I want the electronics. Take everything, Bill. We are not here for personal enrichment; we are here for the loot quota. If you leave a single bow behind, the Fence will dock our pay again. Even the glitter-covered ones? Especially the glitter-covered ones. Those usually contain high-end gift cards. Now, initiate the distraction. Bill sighed, a long, wheezing sound that rattled the foam head. The distraction is me acting like a mascot, right? Exactly. Go out there and do the limbo. Make them scream for a hippo dance. When they surround you, I grab the pile and sprint for the tricycle. What if they try to touch the belly? I am very ticklish in the belly area. Then you bite them, metaphorically. Stay in character. Do not let them see your watch, and for heaven's sake, do not mention that you have a felony warrant in three counties. I knew this was a bad idea when we bought these costumes from the guy behind the dry cleaners. The ears are uneven, Molly. People will notice the ears. Nobody is looking at the ears, Bill. They are looking at the cake. Now move, you giant purple nuisance. The pinata is descending, which means we have exactly three minutes before the chaos reaches peak levels. If I don't see those presents in the basket by the time the candy showers, you are walking home. Fine. But if I lose a toe to a toddler, you are paying my medical bills. Move it, hippo.
created_at
"2026-05-10T13:24:39.266Z"
5/10/2026, 1:24:39 PM
model
"google/gemini-3.1-flash-lite-20260507"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
596
moderation_latency
(null)
generation_time
4083
tokens_prompt
77
tokens_completion
750
native_tokens_prompt
67
native_tokens_completion
652
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.00099475
router
(null)
provider_responses
0
endpoint_id
"237af107-d2e9-46c7-b146-c74d75980dee"
id
"F4cAaqKzHPH9qtsP3euq2QE"
is_byok
false
latency
596
model_permaslug
"google/gemini-3.1-flash-lite-20260507"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419479-PChEC68aNo47ZttXUOfV"
session_id
(null)
api_type
"completions"
id
"gen-1778419479-AIH2w3ekOQjkCGL2pLdR"
upstream_id
"F4cAaqKzHPH9qtsP3euq2QE"
total_cost
0.00099475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
513
wordsDialogue
0
49.3562%