NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFCC7C1RB37C9RSBMS77FBY
Run Details
Model:
Nemotron 3 Super
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m
Cost:
$0.000000
Provider:
Nvidia
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
64.67%
Dialogue to Total Word Ratio
(details)
81.6891%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Night had settled over Maple Street, the streetlights flickering like hesitant fireflies. Molly crouched in the driver's seat of their dented van, eyes scanning the quiet cul-de-sac where balloons bobbed lazily from mailboxes. Beside her, Bill tapped a rhythm on the steering wheel, his grin betraying the excitement that simmered beneath his usually stoic facade. The target was simple: a backyard bash for Tommy Turner, whose parents had spared no expense on a mountain of brightly wrapped presents piled beneath a canopy of pastel streamers. Inside, laughter bubbled from children chasing bubbles, while adults mingled with paper plates of cake. Molly and Bill had spent the afternoon observing the routine, noting the guard—a bemused teenager hired to keep the piñata from becoming a projectile—and the single side gate that creaked open with a sigh whenever the wind blew just right. Their plan was simple, almost ridiculous: slip through the gate during the lull when the cake was cut, distract the adults with a faux magician's trick, and make off with the loot before the sugar rush turned into chaos. As the clock edged toward seven, the scent of vanilla frosting drifted on the breeze, reminding them why they’d chosen this particular heist—nothing says ‘easy score’ like a mountain of toys guarded only by sugar‑high kids and a half‑asleep babysitter. They shared a quiet laugh, the kind that only partners in crime understand tonight together. The van’s engine purred softly, awaiting the signal that would launch their sugary siege in silence. "Molly: Okay, Bill, listen up. The gate swings open at exactly 7:15 when the DJ drops the beat for the cake song. We'll slip in then, grab the table near the piñata, and start loading the presents into the duffel bags we hid under the bush. Keep your eyes on the babysitter; she’s dozing but might wake if we make noise." "Bill: Got it. I’ll handle the distraction. I brought a compact speaker that plays a burst of cartoon music—kids will flock to it like moths to a flame. While they’re distracted, you can work the back table where the bigger gifts are stashed. Remember, we only have ten minutes before the parents start the gift‑opening ceremony." "Molly: Ten minutes is plenty. I’ll use the lock‑pick set on the side gate’s latch; it’s rusty but should yield with a gentle twist. After we clear the presents, we’ll meet at the van, toss the bags in, and peel out before the sugar rush turns into a full‑blown tantrum. Any questions?" "Bill: Just one. What if the babysitter wakes up and calls the cops? We need a fallback. I’ve got a fake fire alarm pull‑string in my pocket; if things go south, we yank it, the building alarms, and everyone evacuates—cover for our exit." "Molly: Then we’ll vanish like smoke. I’ll drive the van two blocks away, dump the bags in a dumpster behind the laundromat, and we’ll split the take at midnight. No trace, no trouble—just a quiet payday for us."
created_at
"2026-03-11T21:20:51.598Z"
3/11/2026, 9:20:51 PM
model
"nvidia/nemotron-3-super-120b-a12b-20230311:free"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2007
moderation_latency
(null)
generation_time
180114
tokens_prompt
95
tokens_completion
10335
native_tokens_prompt
82
native_tokens_completion
11945
native_tokens_completion_images
(null)
native_tokens_reasoning
9571
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
0
endpoint_id
"970aecad-f13b-4ec3-9362-e1758aea3e56"
id
"chatcmpl-9b1c9588c1b694b7"
is_byok
false
latency
223
model_permaslug
"nvidia/nemotron-3-super-120b-a12b-20230311"
provider_name
"Nvidia"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264051-WOrSk7egLi4KNdZxXvDP"
upstream_id
"chatcmpl-9b1c9588c1b694b7"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nvidia"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
64.67%
Dialogue to Total Word Ratio
Ratio: 52.95%, Deviation: 2.95%
neededClean
false
wordsTotal
508
wordsDialogue
269
81.6891%