Run: 2026-02-25-01KJ95VVEPH367J4FVYEFA6DAX

Run Details

Duration:
29.0s
Cost:
$0.016827
Provider:
Google AI Studio

Evaluation summary

Score Evaluator
Details Link
99.38% Matches word count (details)
99.66% Dialogue to Total Word Ratio (details)
99.5181% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Did you pack the smoke bombs?" Bill whispered, adjusting his foam tail. "They're bubble machines, Bill. We aren't raiding a cartel." "Bubbles are tactical. They sting the eyes." "It’s a six-year-old’s birthday. We just need to blend in." "I'm wearing a sequined dinosaur suit. Is this blending?" "You look like a professional mascot. Hand me the blueprints." "It’s a crayon drawing of a bouncy castle, Molly." "Precision matters. The loot is in the Southwest quadrant, behind the organic juice boxes." "What if there’s a clown?" "We neutralize him with the high-velocity Silly String." "I don't like his baggy pants. He could be hiding a piece." "A piece of what? Pound cake? Focus. The haul is massive. There is a vintage-edition Lego set." "And the remote-control helicopter?" "That's mine. You get the neon Play-Doh." "Deal. Do I deploy the glitter grenades on entry?" "Only if the toddlers get unruly." "They’re always unruly," Bill muttered, checking his watch. "They’re caffeinated and emotionally unstable." "That's why we brought the juice bribes. Check your perimeter." "Perimeter is clear of parents. Let’s do this. For the plastic gold." "Masks on, Bill." "This dinosaur snout is incredibly itchy." "Work through the pain," Molly said, opening the gate. "Think of the Legos."
created_at"2026-02-25T01:15:55.229Z" 2/25/2026, 1:15:55 AM
model"google/gemini-3-flash-preview-20251217"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1773
moderation_latency(null)
generation_time28997
tokens_prompt77
tokens_completion1932
native_tokens_prompt67
native_tokens_completion5598
native_tokens_completion_images0
native_tokens_reasoning5252
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"STOP"
usage0.0168275
router(null)
provider_responses
0
endpoint_id"6e82f652-2693-4b60-bbe0-7362d6e345f0"
is_byokfalse
latency1773
model_permaslug"google/gemini-3-flash-preview-20251217"
provider_name"Google AI Studio"
status200
api_type"completions"
id"gen-1771982155-x1JUOLRA5EzZJCsAMrdY"
upstream_id(null)
total_cost0.0168275
cache_discount(null)
upstream_inference_cost0
provider_name"Google AI Studio"

Evaluation details

Result Evaluator Details Meta Data
99.38% Matches word count n/a
neededCleanfalse
words205
99.66% Dialogue to Total Word Ratio
Ratio: 92.42%, Deviation: 2.42%
neededCleanfalse
wordsTotal211
wordsDialogue195
99.5181%