NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ04GAMF4CDMJK4A2FB68P5N
Run Details
Model:
GPT-OSS 120B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 46s
Cost:
$0.004417
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
98.08%
Dialogue to Total Word Ratio
(details)
83.0915%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The night before the Saturday celebration, Molly and Bill crouched behind a stack of folding chairs in the dimly lit community hall. The smell of fresh paint clung to the walls, and the faint hum of the air‑conditioner was the only soundtrack to their whispered schematics. Bill rolled a tiny, battered suitcase onto the floor, its lid creaking like a confession. Inside lay a collection of oddities: a cheap fog machine, a set of oversized bunny onesies, and a plastic clown shoe that squeaked on command. Molly adjusted the strap of her own backpack, the one that normally held grocery lists and cat toys, now bulging with a sack of empty gift boxes she’d borrowed from a thrift shop. She glanced at the calendar pinned to the wall: 9 P.M., twenty‑four kids, a cake shaped like a dinosaur, and a mountain of wrapped presents waiting under a glittering arch. The plan was simple in theory: slip in during the chaos, grab the piles, and disappear before the parents even noticed the missing ribbons. In reality, the only thing they were sure of was that children have a way of turning any mishap into a circus. Molly: "Bill, the party's at the community center at nine; the cake's chocolate and the presents sit by the door." Bill: "Got it. Kids will be busy with balloons, so we slip behind the clowns." Molly: "We need a distraction—maybe release the clown's squeaky shoes and make a racket." Bill: "I'll bring a fog machine; kids will think it's magic." Molly: "Remember, the teacher's watchful eye is on the piñata, not the gift table." Bill: "We’ll wear those oversized bunny onesies from the costume shop; they’ll blend in perfectly." Molly: "And we have to time the heist with the twenty‑minute song play, otherwise the adults will notice." Bill: "I’ll set the alarm clock to go off at exactly ten minutes after the cake cutting." Molly: "If a kid spots us, we’ll hide in the ball pit and act like a dolphin." Bill: "Sounds ridiculous, but it's our cover." Molly: "Alright, we meet at the loading dock at eight forty‑five. No mistakes." She slipped the walkie‑talkie from her belt and whispered a gruff affirmation. Bill nodded, his eyes flickering to the hallway where a line of parents would soon file in, each dragging a tote of folded party favors. The clock on the wall ticked toward seven, each second a metronome for their impending caper. Outside, faint echo of a children's choir practiced for the upcoming song—a lullaby that would become the perfect soundtrack for their heist. As the hall lights dimmed, a spotlight flickered above the stage, casting long shadows that stretched like silent accomplices across the polished floor. The duo tightened their grip on the suitcase, feeling the weight of both their plan and the absurdity of stealing birthday joy from a room full of giggling innocence. In that moment, the line between comedy and crime blurred, and the thing they could hear over the distant hum is the echo of their own ridiculous, ambitious whispers.
created_at
"2026-04-24T16:17:28.217Z"
4/24/2026, 4:17:28 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
193
moderation_latency
(null)
generation_time
105878
tokens_prompt
77
tokens_completion
7567
native_tokens_prompt
127
native_tokens_completion
7330
native_tokens_completion_images
(null)
native_tokens_reasoning
6790
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00441705
router
(null)
provider_responses
0
endpoint_id
"c7cc0654-cfe3-4531-bd8a-68cd162ba020"
id
"ogSbfyq-z1gNr-9f166b1781a61216"
is_byok
false
latency
179
model_permaslug
"openai/gpt-oss-120b"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777047448-bpBZbQQxHVSSmsTDRAtq"
session_id
(null)
api_type
"completions"
id
"gen-1777047448-y0pnEqYgAaTVHXz6fo38"
upstream_id
"ogSbfyq-z1gNr-9f166b1781a61216"
total_cost
0.00441705
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
514
98.08%
Dialogue to Total Word Ratio
Ratio: 29.38%, Deviation: 0.62%
neededClean
false
wordsTotal
514
wordsDialogue
151
83.0915%