NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2G2DVHD5YVABSJ0DDV0GDF
Run Details
Model:
DeepSeek V4 Pro
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
24.8s
Cost:
$0.000419
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: A Stern Welcome" data-color="red"> As I dismounted and led Thunder toward the stable, a shadow fell over me. It belonged to a towering figure clad in dented, practical armor, its surface etched with the memory of a hundred battles. The man's face was a roadmap of scars, and his eyes were the grey of a winter storm. "Frederick," he said, not as a question but a judgment. "I am Sir Roland. Your training is not a game for boys. There will be no coddling here. Muck out that stall first, and we’ll see if you have the stomach for a knight's true work." </nc-section> <nc-section data-title="Version 2: A Kindly Groom" data-color="green"> The scent of hay and horses was the first thing to truly make me feel I had arrived. A stooped, weathered man with straw in his hair looked up from brushing a dappled mare. "Ah, you must be the new squire," he said, his voice a gentle rasp. "Name's Roland, but everyone calls me Sir Roland, though I feel more sir than knight these days. Forgive me, I was just seeing to your mount. A good heart in this one, I can tell. Come, let's get him settled. You look like you could use a kind word and a warm meal." </nc-section> <nc-section data-title="Version 3: The Test in the Yard" data-color="blue"> Sir Roland wasn't in the stables. He was in the training yard adjacent to it, a circle of hard-packed earth. He was a man of sharp angles and controlled energy, spinning a wooden practice sword with a deftness that spoke of decades of discipline. He stopped the blade mid-air as I approached, the point aimed directly at my chest. "You're late, Frederick," he said, his voice sharp as a whetstone. "The sun waits for no squire. Pick up a sword. Your first lesson begins now. Show me you have the fire to be more than just a name on a roster." </nc-section>
created_at
"2026-04-25T14:18:18.549Z"
4/25/2026, 2:18:18 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
862
moderation_latency
(null)
generation_time
10696
tokens_prompt
318
tokens_completion
466
native_tokens_prompt
291
native_tokens_completion
454
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000419485
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"581d2c2e-d429-44df-877c-45deaeb7136b"
is_byok
false
latency
290
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126698-QPfWUap0RV02jpPl2Yky"
session_id
(null)
api_type
"completions"
id
"gen-1777126698-2Md6dmhYwEALU1HMsFBX"
upstream_id
"581d2c2e-d429-44df-877c-45deaeb7136b"
total_cost
0.000419485
cache_discount
0.00010208
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval