NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ9554W0SETYNSFTDT97679F
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 5s
Cost:
$0.022745
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
87.76%
Dialogue to Total Word Ratio
(details)
43.8802%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a pen on a crudely drawn map of a suburban house, spread across their cramped kitchen table. Empty coffee cups and biscuit crumbs were pushed to the side. "Tiffany's party. You’re sure of this, Bill?" Bill hummed, meticulously polishing a small, ornamental crowbar with a handkerchief. His eyes were sharp as he stared at the map. "Positive. Every bouncy castle, fairy-floss stand, every parent. Full recon." "Parental unit? It's a seven-year-old's birthday, not a military target." Molly's tone bordered on disbelief. "Mrs. Henderson. Ex-military. Rumour has it she disarms toddlers with a glare. And the prize table? Tripwires." Molly blinked. The pen paused mid-air. "Lasers around presents? Good lord." "High-end stuff, Molly. Furbies, Timmy's new VR headset. Serious loot," Bill argued, stroking the crowbar. "Plastic garbage. We're not going into active combat for a Furby." Molly rubbed her temples. "Ah, Princess Fluffybutt. Chihuahua of terror. Loud bark. Tripped a mailman once." Molly picked up her cold coffee. "So, your plan for the mom, lasers, motion sensor, and fluffy menace?" "Simple. Distraction, infiltration, exfiltration. We need a clown." Bill puffed out his chest, a flicker of pride. Molly laid the pen down with a sigh. "A clown. For petty larceny, really?" "Squeaky Pete. He's got charisma. He draws kids, we grab loot." Bill gestured. "Melting into the suburban night sounds elegant for Paw Patrol merchandise and a VR headset," Molly muttered, picturing the indignity. "Details, Molly. Timing. Mrs. Henderson steps away for her 'special' iced tea, Pete unleashes the balloon tornado, we go." "And the lasers?" "Broomstick trick. Again." Bill announced, beaming. "It nearly got us electrocuted last time, Bill. Flamingo incident?" "Minor oversight. This time, industrial-strength duct tape for sensors. Foolproof." Bill brandished a roll. Molly massaged her temples. "Duct tape and a clown for this kiddie plastic heist?" "Precisely. Getaway vehicle: the ice-cream truck." "The one that still plays 'Pop Goes the Weasel' at ear-splitting volume?" "Adds to chaos. Molly, nobody suspects the ice-cream man." Bill winked. "Nobody suspects the *competent* ice-cream man. You driving that thing, dead giveaway." "I've practiced my jingle: 'Ice Cream, Ice Cream, stolen at high speed!'" Bill hummed off-key. "No. Absolutely not. We are not broadcasting our crimes. Get presents, get out. Quietly. No balloon animals in our hideout." Molly pointed a stern finger. Bill chuckled, undeterred. "Spoilsport. Imagine Timmy’s face when his Furby’s gone." Molly just sighed, staring blankly at the map. "Broomstick. Clown. Ice cream truck. Let's pray these presents are worth this utter humiliation."
created_at
"2026-02-25T01:03:31.206Z"
2/25/2026, 1:03:31 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1743
moderation_latency
(null)
generation_time
65181
tokens_prompt
77
tokens_completion
15058
native_tokens_prompt
67
native_tokens_completion
9090
native_tokens_completion_images
0
native_tokens_reasoning
8411
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0227451
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1743
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981411-dYUzyTrBZOnn20cgm5Cn"
upstream_id
(null)
total_cost
0.0227451
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
415
87.76%
Dialogue to Total Word Ratio
Ratio: 68.38%, Deviation: 1.62%
neededClean
false
wordsTotal
427
wordsDialogue
292
43.8802%