NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ00Z3R0MN48WRGE4JTPH9TD
Run Details
Model:
GPT-OSS 120B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
40.6s
Cost:
$0.000193
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
1.83%
Dialogue to Total Word Ratio
(details)
0.9132%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slipped a cheap, glittering mask over her eyes and crouched behind the folding table, scanning the backyard where a rainbow balloon arch swayed in the breeze. A plastic dinosaur bobbed on the lawn, its eyes wide and innocent. "Bill, you see the candy table?" she whispered, her voice a squeak under the thumping pop music. "I see it," Bill replied, adjusting the oversized clown wig that made his head look comically large. "Three rows of cupcakes, a mountain of lollipops, and—" "—the presents, right behind the piñata?" Molly jabbed a finger toward the bright paper monster swaying on a pole. "Exactly," Bill said, tapping his earpiece. "The kids will be distracted when the clown does his magic trick. That’s our window." Molly grinned, a flash of mischief in her eyes. "How long do we have before the parents start panicking?" "Ten minutes tops," Bill muttered, glancing at the timer on his watch. "Then the fire marshal will show up, and the insurance guy will be in his office. We want to be out before any adults notice." "Got it," Molly said, pulling a small sack from her jacket. "What about the security camera?" "The nanny—she's got a phone. I’ll loop the feed with a pre‑recorded video of dancing unicorns," Bill replied, tapping his laptop. "That should keep the goons busy for five minutes." Molly laughed, a short, sharp sound. "You always think of the techy stuff. I’m handling the physical side." "Good," Bill said, stepping closer. "What’s the plan for the presents? We can’t just grab them and run—they’re in a tower of cardboard." "I’ll use the inflatable slide," Molly said, pointing to the plastic slide set up for the kids. "We’ll roll the box down, catch it at the bottom, and dump it straight into the sack." "Slide, huh?" Bill raised an eyebrow. "Are we sure the kids won’t notice the tower collapsing?" "They’ll think it’s part of the game," Molly replied, already picturing the chaos. "When the clown says, ‘Now for the big finale!’ we push. Everyone’s eyes will be on the fireworks." "Speaking of fireworks," Bill said, glancing at the portable fireworks rack, "do we have a diversion when the alarm goes off?" Molly pulled a small puffball from her pocket. "One of these, and the kids will scream in delight. They'll think it's a surprise." "Perfect," Bill said, nodding. "And the exit?" "Molly, you know the snake pit?" he asked, smirking. "We’ll slip through the hedges behind it and hit the side gate. Quick and quiet." "I've already cleared the path," Molly whispered, tapping a map on her phone. "No prying eyes." Bill clapped his hands lightly. "All right, then. You take the slide, I’ll loop the cam. When the clown says his line, we move." Molly checked the sack, making sure the zipper was intact. "One more thing—what about the party favors? The little toys everyone wants to keep?" Bill shrugged. "Leave ’em. We’re after the big stuff. Toys are a waste of space." Molly nodded, her grin widening. "Deal. Once we’re out, we split the loot fifty‑fifty. No one gets the clown shoe." "Deal," Bill affirmed, his voice low. "Let’s make this the most unforgettable birthday party ever—just not for the kids." The music swelled, the clown waved a wand, and the two thieves slipped from the shadows, ready to turn a cheerful celebration into a lucrative caper.
created_at
"2026-04-24T15:15:38.378Z"
4/24/2026, 3:15:38 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
378
moderation_latency
(null)
generation_time
40494
tokens_prompt
77
tokens_completion
1117
native_tokens_prompt
114
native_tokens_completion
990
native_tokens_completion_images
(null)
native_tokens_reasoning
228
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000192546
router
(null)
provider_responses
0
endpoint_id
"85835306-dba3-4334-87bc-ab9ffa5c9c62"
id
"chatcmpl-RHSClUrtZynYbuf4r9IsPT7Q"
is_byok
false
latency
86
model_permaslug
"openai/gpt-oss-120b"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043738-QE4iMrVJyjbGLkcKjvO5"
session_id
(null)
api_type
"completions"
id
"gen-1777043738-7QFrKJIEISqBi16YRoal"
upstream_id
"chatcmpl-RHSClUrtZynYbuf4r9IsPT7Q"
total_cost
0.000192546
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
570
1.83%
Dialogue to Total Word Ratio
Ratio: 61.05%, Deviation: 8.95%
neededClean
false
wordsTotal
570
wordsDialogue
348
0.9132%