NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFK3NWX3ZXP2KMZ59TECEVZ
Run Details
Model:
Nemotron 3 Super
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 41s
Cost:
$0.000000
Provider:
Nvidia
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
99.56%
Dialogue to Total Word Ratio
(details)
99.6521%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the fence, eyes on balloons and a clown juggling torches. Bill shifted on damp grass, muttering about the guard who patrols every ten minutes. Sugar and cheap pizza scented the air, tightening their stomachs despite the plan. Molly: "Alright, Bill, we hit the gift table when the kids swarm the piñata." Bill: "Good. I’ll watch for the guard; he does his loop every ten minutes." Molly: "I’ll dress as a party helper, apron and clipboard. No one looks twice." Bill: "What about the parents with cameras?" Molly: "I’ll flash a fake badge, say I’m checking supplies. They’ll buy it." Bill: "And the clown? He’s a distraction." Molly: "I’ll drop a rubber spider on his shoe; he’ll yelp, the music stutters." Bill: "Perfect. Then I’ll slide in, snatch the wrapped presents, and dash to the hedge." Molly: "If a kid spots us, we’ll pretend it’s a game, hand them a balloon." Bill: "Let’s move when the chicken dance starts. That’s our cue." They slipped through fence gap, staying low as the clown swayed. Molly adjusted her fake badge, feeling the stolen watch tick. Bill glanced at the sky, noting the sun’s angle, and whispered the music would shift soon. Molly: "What if parents film everything?" Bill: "We’ll wear silly hats to hide our faces." Molly: "I’ll also pocket sunglasses for shade under the tent." Bill: "The tent’s flaps are loose; we can exit back if needed." Molly: "Cake table holds the big presents near the frosting." Bill: "We’ll grab those first; ignore the kids’ goody bags." Molly: "If alarm triggers, fire exit behind shed leads to alley." Bill: "It’s unlocked; I checked earlier." Molly: "Signal: two taps on the juice box means move." Bill: "Got it. Stay calm, act like we’re helping the host." Molly: "And if a kid spots us, we’ll hand them a balloon and smile." Bill: "Then we melt into the crowd before the cake is cut." Molly: "Remember, the guard’s loop is every ten minutes exactly." The piñata burst, showering candy as children screamed. Molly slipped past snack table, her apron brushing gift-wrapped boxes. Bill hovered near hedge, eyes on guard’s footsteps, waiting for the cue. They shared a silent nod, hearts racing with the music. Molly: "The chicken dance just started; that’s our signal." Bill: "I see the guard turning the corner; he’s two minutes away." Molly: "Okay, I’m dropping the spider now—watch the clown’s reaction." Bill: "Clown jumped, music glitched, perfect." Molly: "I’m moving to the gift table, acting like I’m fixing a ribbon." Bill: "I’ve got the first stack of presents; sliding them into my sack." Molly: "Hey, a kid just tugged my leg; I gave him a balloon and winked." Bill: "Second stack secured; the heavy ones are still in the back." Molly: "Guard’s radio crackled; he’s calling for backup, but we’re clear." Bill: "Let’s wrap up and hit the hedge; the alley awaits." Molly: "One last grab—those fancy bikes near the cake." Bill: "Done. Now we run, hats low, hearts pounding."
created_at
"2026-03-11T23:18:31.609Z"
3/11/2026, 11:18:31 PM
model
"nvidia/nemotron-3-super-120b-a12b-20230311:free"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
272
moderation_latency
(null)
generation_time
220359
tokens_prompt
95
tokens_completion
19419
native_tokens_prompt
82
native_tokens_completion
21579
native_tokens_completion_images
(null)
native_tokens_reasoning
18621
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
0
endpoint_id
"970aecad-f13b-4ec3-9362-e1758aea3e56"
id
"chatcmpl-b2e27dde6a9b271f"
is_byok
false
latency
216
model_permaslug
"nvidia/nemotron-3-super-120b-a12b-20230311"
provider_name
"Nvidia"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773271111-JEktiYz9156egNDLjVI6"
upstream_id
"chatcmpl-b2e27dde6a9b271f"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nvidia"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
99.56%
Dialogue to Total Word Ratio
Ratio: 69.70%, Deviation: 0.30%
neededClean
false
wordsTotal
505
wordsDialogue
352
99.6521%