NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 1B
Test
Tool usage within Novelcrafter
Scenario
Create alternate prose sections
Run ID
2024-09-26-01J8PZ4N1P5VM25JNZWTTW1M04
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
Here are the alternate prose sections: Version 1 (red): Sir Roland's rugged features creased into a warm smile as he approached me. "Ah, young squire, I see you've finally arrived," he said, offering me a firm handshake. His eyes crinkled at the corners as he eyed my armor. "I trust you've met the castle's master-at-arms, Grimbold? He's eager to hear tales of your... ah, adventurous spirit."
id
"gen-QBE4oBtfcwrdT2NuXFtGEjPR2sGB"
total_cost
0.0000407
upstream_id
"9c5c8618-9280-44d7-85ba-b9ef8e605c64"
created_at
"2024-09-26T10:48:29.596445+00:00"
Thursday, September 26th, 2024 at 10:48:29 AM GMT+00:00
model
"meta-llama/llama-3.2-1b-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
177
moderation_latency
(null)
generation_time
258
finish_reason
"stop"
tokens_prompt
384
tokens_completion
97
native_tokens_prompt
308
native_tokens_completion
99
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0000407
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Fail
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Fail
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
0.0000%