NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2FJEHGH9FS53XSZJX9TRCP
Run Details
Model:
DeepSeek V4 Pro
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
11.6s
Cost:
$0.000351
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="A Stern Welcome" data-color="red"> Sir Roland stood rigid as an iron gate before the stable entrance, his scarred hand resting upon the pommel of his broadsword. His eyes, cold and grey as winter stone, swept over me with visible disdain. "You're late," he growled, the words scraping like gravel. "The castle has no use for squires who cannot keep time. Stow your horse and report to the armory. We'll see if you've the mettle to last the week." </nc-section> <nc-section data-title="A Gentle Hand" data-color="green"> Sir Roland emerged from the stable with a warm, easy smile, brushing hay from his tunic. He was older than I'd imagined, his beard streaked with silver, but his eyes sparkled with quiet mirth. "Ah, young Frederick! We've been expecting you." He reached up and clasped my hand as I dismounted. "Come, let's see to your horse together. A squire who tends his mount with kindness will never want for a loyal friend." </nc-section> <nc-section data-title="A Rushed Encounter" data-color="blue"> Sir Roland strode towards me with quick, clipped steps, barely glancing up from the parchment clutched in his gauntleted hand. "Frederick? Good, good. The steward will assign you quarters, and the stablemaster can handle the horse." He paused, squinting towards the western tower where a bell had begun to toll. "Trouble on the border. I've no time for pleasantries. Make yourself useful—find the captain of the guard and tell him Sir Roland rides within the hour." And with that, he was gone, his cloak billowing behind him. </nc-section>
created_at
"2026-04-25T14:09:20.948Z"
4/25/2026, 2:09:20 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1266
moderation_latency
(null)
generation_time
11547
tokens_prompt
318
tokens_completion
403
native_tokens_prompt
291
native_tokens_completion
375
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000350755
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"c541787b-4ed0-4c7a-a030-2e2ffcbf59c0"
is_byok
false
latency
275
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126160-tkYyk1PPvyhxm3y29Cj5"
session_id
(null)
api_type
"completions"
id
"gen-1777126160-DdHcNHwHzwbZRcquBB23"
upstream_id
"c541787b-4ed0-4c7a-a030-2e2ffcbf59c0"
total_cost
0.000350755
cache_discount
0.00010208
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval