NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Flash
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-11-18-01KACP8W8J304S4VBKGR7KHMAR
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Alright, Bill, spill it. What’s the mark this time?” “Molly, my dearest, most discerning partner in… acquiring. We’re going deep. Into the heart of pure, innocent joy.” “Oh, God. Not another charity gala. My sequin phobia is acting up just thinking about it.” “Worse. Better. Depends on your perspective. A kid’s birthday party.” Molly choked on air. “A… a what? Bill, are you feeling alright? Did you get a concussion from that last antique shop job?” “Perfectly fine! And perfectly brilliant, if I do say so myself. Think of the haul, Molly. Brand new, factory-sealed goodies! Unplayed-with remote control cars, pristine dollhouses, enough Lego bricks to build a small nation!” “We’re not stealing from children, Bill! We have standards! Shady, morally ambiguous standards, but standards nonetheless!” “Relax! We’re not stealing their joy. We’re just… rehoming the gifts. Liberating them from the tyranny of excessive wrapping paper. Think of it as a pre-emptive regifting service.” “Regifting for profit, you mean. And how exactly do we waltz into a six-year-old’s sugar-fueled nightmare and pilfer their loot?” “Easy. You, my friend, are going as the entertainer. A clown, perhaps? Or a very enthusiastic balloon artist?” “You want me, Molly ‘Shadowcat’ O’Malley, to wear oversized shoes and paint my face? Bill, I will cut you.” “Or… a very convincing party planner looking for a lost party favor shipment. And I… I’ll be the concerned, slightly flustered father, wondering why little Timmy’s laser tag set hasn’t arrived yet.” “Timmy doesn’t have a laser tag set, Bill. He has a pile of presents with his name on them that we are *not* stealing!” “Yet. He doesn’t have one *yet*.”
created_at
"2025-11-18T23:54:51.962879+00:00"
Tuesday, November 18th, 2025 at 11:54:51 PM GMT+00:00
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
369
moderation_latency
(null)
generation_time
3191
tokens_prompt
68
tokens_completion
412
native_tokens_prompt
67
native_tokens_completion
439
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0011176
api_type
"completions"
id
"gen-1763510087-rghg9xfcytuTWhZ75UoK"
upstream_id
(null)
total_cost
0.0011176
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
278
58.4388%
Dialogue to Total Word Ratio
Ratio: 98.56%, Deviation: 8.56%
neededClean
false
wordsTotal
278
wordsDialogue
274
29.2194%