Run: 2026-02-20-01KHYB0S48GHPC3QJNMTT23JPJ

Run Details

Duration:
5.7s
Cost:
$0.000037
Provider:
DeepInfra

Evaluation summary

Score Evaluator
Details Link
20.19% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
10.0954% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The community hall smelled overwhelmingly of sugar and desperation – desperation, in Molly's case, for a decent payday. Streamers drooped like forgotten dreams from the ceiling, a pathetic confetti graveyard. Bill, ever the pragmatist, was meticulously sketching the room’s layout on a napkin pilfered from a nearby refreshment table. Tiny pastel chairs sat arranged in neat rows, facing a stage adorned with a frankly alarming number of balloons. He circled a particularly large pile of brightly wrapped boxes with a thick red pen. “That’s the jackpot, I reckon,” he mumbled, more to the napkin than to her. Molly kicked at a rogue bouncy castle, the air whooshing out with a mournful sigh. “Kids,” she muttered. Just miniature, sugar-fueled chaos agents. Bill tapped the napkin. “Security's…minimal. One volunteer manning the door, looks like he moonlighted as a scarecrow.” “Right, because terrorizing five-year-olds is a rigorous training ground for security professionals,” Molly deadpanned. “So, inflatable diversion?” Bill nodded, pointing at the partially deflated bouncy castle. “Distraction, then swift retrieval. Present count is…significant.” “Let's just hope none of them have bite.”
created_at"2026-02-20T20:14:20.82Z" 2/20/2026, 8:14:20 PM
model"google/gemma-3-12b-it"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency379
moderation_latency(null)
generation_time5521
tokens_prompt93
tokens_completion321
native_tokens_prompt75
native_tokens_completion264
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00003732
router(null)
provider_responses
0
endpoint_id"5aef2a85-3d1c-4659-a522-98bbb5272c6a"
is_byokfalse
latency35
model_permaslug"google/gemma-3-12b-it"
provider_name"Crusoe"
status429
1
endpoint_id"eb06dc92-5a16-47ec-a776-6ef956457c47"
id"chatcmpl-RDne3483kfZDYWgThcsIdxJX"
is_byokfalse
latency50
model_permaslug"google/gemma-3-12b-it"
provider_name"DeepInfra"
status200
api_type"completions"
id"gen-1771618460-aE0GYsmEEFRGyPVk97if"
upstream_id"chatcmpl-RDne3483kfZDYWgThcsIdxJX"
total_cost0.00003732
cache_discount(null)
upstream_inference_cost0
provider_name"DeepInfra"

Evaluation details

Result Evaluator Details Meta Data
20.19% Matches word count n/a
neededCleanfalse
words180
0.00% Dialogue to Total Word Ratio
Ratio: 28.33%, Deviation: 18.33%
neededCleanfalse
wordsTotal180
wordsDialogue51
10.0954%