NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ091YNV7K5QTZH3JTXJSW3M
Run Details
Model:
GPT-OSS 120B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 43s
Cost:
$0.000732
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.63%
Dialogue to Total Word Ratio
(details)
49.8133%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slipped the cheap plastic mask over her nose and smirked at the whispering hallway. The birthday banners fluttered like nervous flags above the room, and the scent of frosting clung to the air like a guilty secret. Bill crouched behind the inflatable dinosaur, his gloved hand hovering over the stack of brightly wrapped parcels. He tapped a finger against the laminated schedule taped to the wall: twenty‑three minutes until the magician vanished and the cake was cut. “Ready?” Molly hissed, her voice a squeak through the mask. “Just three more seconds,” Bill replied, eyes flicking to the clock. Molly whispered, “Grab the glitter balloons, they’ll mask our footprints.” Bill muttered, “And the piñata—sweet loot for the getaway.” She slipped a tiny screwdriver from her pocket and began prying the latch on the side door, while Bill surveyed the treasure trove of toys and candy‑wrapped gifts. Their breath rose in shallow puffs, matching the jittery giggles of the unattended kids sprawled on the carpet. The room was a kaleidoscope of colors, each present promising a small fortune in stolen joy. With a soft click, the door yielded, and they slipped inside, shadows merging with the chaos of balloons, ready to filch every ribboned box before anyone noticed the sudden disappearance of the party’s sparkle. They vanished like confetti in wind. Inside the hall, plush animals and candy bars glimmered under harsh lights. Molly stuffed a sack with the biggest teddy bear and a bundle of stickers. Bill slipped chocolate coins into his pockets, grin widening. They slipped past a half‑deflated clown balloon and out the back door, unnoticed as children chased bubbles while music thumped loudly everywhere now.
created_at
"2026-04-24T17:37:00.107Z"
4/24/2026, 5:37:00 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
353
moderation_latency
(null)
generation_time
102805
tokens_prompt
77
tokens_completion
3949
native_tokens_prompt
114
native_tokens_completion
3827
native_tokens_completion_images
(null)
native_tokens_reasoning
3499
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000731576
router
(null)
provider_responses
0
endpoint_id
"85835306-dba3-4334-87bc-ab9ffa5c9c62"
id
"chatcmpl-R9tzwYf6fc4rNsWh9QsLPjvX"
is_byok
false
latency
88
model_permaslug
"openai/gpt-oss-120b"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777052220-uyBkbkBptO6WAEGp4Awa"
session_id
(null)
api_type
"completions"
id
"gen-1777052220-Ib1tiJ2QiNrewUgMt8IC"
upstream_id
"chatcmpl-R9tzwYf6fc4rNsWh9QsLPjvX"
total_cost
0.000731576
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
279
99.63%
Dialogue to Total Word Ratio
Ratio: 7.53%, Deviation: 2.47%
neededClean
false
wordsTotal
279
wordsDialogue
21
49.8133%