NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ00SBEG9W18PSJ93A1CV69K
Run Details
Model:
GPT-OSS 120B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
33.5s
Cost:
$0.000232
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van was parked under a clump of maple trees, its paint faded from a lifetime of sneaky midnight runs. Inside, the air smelled like stale coffee and the faint ozone of a busted neon sign that had once advertised a “Turbo Taco Tuesday.” Molly checked the zip‑tied bag of glittery black gloves, then the walkie‑talkie that had been patched together from an old baby monitor and a dead‑beat radio. Bill hovered by the window, peering at the backyard where a banner flapped lazily in the evening breeze, reading “Hooray! It’s Lily’s 7th!” “It looks like a party, not a vault,” Molly said, her voice a mixture of excitement and contempt. Bill rolled his eyes. “Exactly the point. Kids are the least suspicious when they’re distracted by cake.” Molly laughed, the sound echoing off the van’s dented metal. “Kids are like ninjas with sugar. They’ll give us the perfect cover if we time it right.” She spread a crumpled map of the yard on the passenger seat. The layout showed a kiddie pool, a bounce house, and a table piled high with presents wrapped in everything from dinosaur‑themed paper to glittery pink bows. “The bounce house is our entry point,” Bill noted, tapping a crayon‑drawn arrow. “We slip in, grab the swag, and disappear before the clown—if there even is one—gets a chance to wiggle his oversized shoes on the floor.” Molly tapped the side of her nose. “And the clown’s red nose will be the perfect distraction for the parents. I’ll pull the string on that inflatable unicorn while you siphon the gift pile.” Bill smirked. “Just promise me you won’t get caught stealing candy. I have a reputation to keep.” She rolled her eyes. “Your reputation? The only thing you’re known for is stealing the last slice of pizza from the breakroom and calling it a ‘core strategy.’” He chuckled, then grew serious. “All right, checklist. Gloves—on. Mask—on. Silence—on. No toddlers awake.” Molly produced a tiny silver whistle. “This is the ‘panic button.’ One short blow and we’ll all pretend the power went out. Kids love that.” Bill nodded. “And the parents will think it’s a ‘cool blackout’ and not suspect the party crashers.” She checked the pocket watch she'd stolen from a side‑walk magician. “We have exactly twelve minutes before the parents start the gift‑opening ceremony. That’s our window.” Bill rubbed his chin. “Do we have a backup plan if the unicorn inflates itself and blocks the exit?” Molly tapped the watch. “Then we’ll improvise. I’ve got a pair of karaoke headphones. I’ll blast ‘Baby Shark’ on repeat. Parents will be too busy trying to stop the ear‑worm to notice the missing presents.” He laughed. “You’re brilliant. Let’s do this.” Molly slipped the bag of gloves into her coat. “Remember, we only take the gifts. No bribes for the kids. We’re professional thieves, not Santa’s assistants.” Bill lifted his hand, mimicking a magician’s flourish. “Abracadabra, Molly. Let’s make these presents disappear.” The engine roared as they pulled away, the van’s rear hatch opening to reveal a stash of empty cardboard boxes printed with cartoon bears. The night air was thick with anticipation, and somewhere in the distance a child’s laugh floated over the yard, promising a chaotic symphony of sugar‑high mischief. The heist was about to begin, and neither Molly nor Bill could suppress the grin spreading across their faces.
created_at
"2026-04-24T15:12:30.036Z"
4/24/2026, 3:12:30 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
271
moderation_latency
(null)
generation_time
32783
tokens_prompt
77
tokens_completion
1440
native_tokens_prompt
114
native_tokens_completion
1196
native_tokens_completion_images
(null)
native_tokens_reasoning
500
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000231686
router
(null)
provider_responses
0
endpoint_id
"85835306-dba3-4334-87bc-ab9ffa5c9c62"
id
"chatcmpl-R8FLJOu3wCUtphtfhoXvU8Ao"
is_byok
false
latency
80
model_permaslug
"openai/gpt-oss-120b"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043550-sbrHDP5FnqjrlhkRFYH0"
session_id
(null)
api_type
"completions"
id
"gen-1777043550-WdUklOA8iEvcsUkhTKe5"
upstream_id
"chatcmpl-R8FLJOu3wCUtphtfhoXvU8Ao"
total_cost
0.000231686
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
568
0.00%
Dialogue to Total Word Ratio
Ratio: 49.47%, Deviation: 19.47%
neededClean
false
wordsTotal
568
wordsDialogue
281
0.0000%