Run: 2026-03-11-01KKFJQG06WJ021JD9T6BXM8XX

Run Details

Duration:
2m 32s
Cost:
$0.001714
Provider:
Together

Evaluation summary

Score Evaluator
Details Link
43.38% Matches word count (details)
100.00% Dialogue to Total Word Ratio (details)
71.6870% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The sliding door groaned under Bill’s hand. Dust motes danced in the shaft of light piercing through the backyard. Molly crouched behind the hydrangea bush, buttoning her shirt with shaking fingers. Dragon fruit patterns marred her skin like bruises that refused to fade. Beside them, the party lurked behind frosted plastic curtains. Music pulsed out in rhythmic thumps, signaling chaos and sugar. The scent of cotton candy was heavy enough to choke upon inside the room. Bill adjusted his black mask. “Ready?” Molly nodded tight-lipped. “No.” “The carrots are heavy?” Bill ignored the accent. “Monsters watch the shadows.” They crept forward like shadows on the carpet. Year-old stains revealed the floor. The table groaned with Cadbury wrappers stacked like treasure chests. Leo’s teddy bear sat uninvited near the punch bowl. Molly sniffed loudly. “Balloons are sticky.” Bill grinned. “Wrap them up.” They approached the gift rack. Boxes piled high in a tight grid. Blue bows snagged the plastic. Molly counted them silently before reaching out. They took the center piece. The wall hummed with electricity. “Wait.” “No.” “Take it.” Suddenly the lights flickered.
created_at"2026-03-11T23:11:52.338Z" 3/11/2026, 11:11:52 PM
model"qwen/qwen3.5-9b-20260310"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency142
moderation_latency(null)
generation_time152192
tokens_prompt77
tokens_completion8473
native_tokens_prompt76
native_tokens_completion11373
native_tokens_completion_images(null)
native_tokens_reasoning8166
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00171355
router(null)
provider_responses
0
endpoint_id"00d506d9-a5e9-4382-a2b9-d54732365594"
id"oagruyT-6Ng1vN-9dae3da0341a2c4e"
is_byokfalse
latency142
model_permaslug"qwen/qwen3.5-9b-20260310"
provider_name"Together"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773270712-PLX8xlIiNOjpfnZ9uhH5"
upstream_id"oagruyT-6Ng1vN-9dae3da0341a2c4e"
total_cost0.00171355
cache_discount(null)
upstream_inference_cost0
provider_name"Together"

Evaluation details

Result Evaluator Details Meta Data
43.38% Matches word count n/a
neededCleanfalse
words183
100.00% Dialogue to Total Word Ratio
Ratio: 10.81%, Deviation: 0.81%
neededCleanfalse
wordsTotal185
wordsDialogue20
71.6870%