NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0AV1QQWPTKPB82VZ99F969
Run Details
Model:
GPT-OSS 120B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 34s
Cost:
$0.001159
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
14.07%
Dialogue to Total Word Ratio
(details)
57.0366%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the straw wall of the backyard, peeking at the brightly colored balloons bobbing in the summer breeze. The smell of frosting and fresh cut grass mingled, making the whole operation smell like a birthday cake heist. Bill adjusted his oversized ski mask, his fingers twitching with excitement. "Molly, you really think we can swipe all those presents without getting caught?" Bill whispered, his voice a conspiratorial hiss. "We've got a plan, Bill, and a map of the cookie table," Molly replied, tapping an imaginary blueprint on her wrist. "Just keep your eyes on the clown." "The clown? He's the real security," Bill muttered, glancing at the rotund man juggling rubber ducks. "We need a distraction." "Leave that to me," Molly said, pulling a squeaky toy from her pocket. She pressed it against the grass and let it squeak loudly. A few children giggled, turning their heads. "Perfect!" Bill whispered, slipping on a pair of glittery slippers he had stolen from a previous gig. "Now we slip in under the 'magic' tent." "Magic? More like a blanket of chaos," Molly snorted. "Remember, the party's kid-to-kid gift exchange starts in ten minutes." "I've got the perfect entry point," Bill whispered, pointing to a side door left ajar. "We crawl through that vent, grab the presents, and vanish like a stolen cupcake." "The cupcakes are safe with us," Molly giggled, her eyes sparkling like confetti. "Just don't forget the fake Santa sack." "Fake Santa sack? I thought you said you had a sack!" "I have one, Bill. It's stuffed with meltable chicks for drama," Molly said, shaking her head. "Drama is our middle name," Bill grinned. "But seriously, how many presents are we talking about?" "A hundred, maybe more. Parents love to overcompensate," Molly replied, counting the gift-wrapped boxes in her mind. "That's a lot of loot, but we can handle it." "We need a timeline. Five minutes for the entry, ten for the grab, and fifteen to exit before the clowns notice." "Molly, the kids are gonna notice if we leave the cake untouched," Bill pointed out, tapping the giant chocolate ganache tower. "Then we eat the cake first," Molly declared. "A quick sugar boost will give us the energy we need." "You're right," Bill agreed. "I also brought a rope." "A rope? For what? To swing from the chandelier?" Molly laughed. "No, to tie up the clown's shoes so he can't chase us," Bill explained. "You've thought of everything," Molly sighed, smiling. "Now, let's move." "On your mark, Molly!" Bill shouted, gripping the rope. "'Now!'" Molly whispered, and together they slipped into the tent, the squeaky toy still echoing in the garden as the party erupted in giggles and chaos. The lights dimmed as the kids sang 'Happy Birthday' to an empty cake. Molly lifted a sack full of glittering toys, whispered, 'Mission accomplished.' Bill grinned, 'Now we disappear before Mom finds the missing presents.' They vanished into the night, leaving only laughter behind and frosting in air.
created_at
"2026-04-24T18:08:11.01Z"
4/24/2026, 6:08:11 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
288
moderation_latency
(null)
generation_time
93493
tokens_prompt
77
tokens_completion
4765
native_tokens_prompt
114
native_tokens_completion
4612
native_tokens_completion_images
(null)
native_tokens_reasoning
3978
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0011587
router
(null)
provider_responses
0
endpoint_id
"3dc95f88-3960-4c91-b2bb-bbd2f5069c48"
id
"96a3833aedc606acf01d5485a1d7198a"
is_byok
false
latency
248
model_permaslug
"openai/gpt-oss-120b"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777054091-RTIHhirHtleFTxoyJw7v"
session_id
(null)
api_type
"completions"
id
"gen-1777054091-xp5iWjnKfM8yT8qev2qb"
upstream_id
"96a3833aedc606acf01d5485a1d7198a"
total_cost
0.0011587
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
14.07%
Dialogue to Total Word Ratio
Ratio: 56.26%, Deviation: 6.26%
neededClean
false
wordsTotal
503
wordsDialogue
283
57.0366%