NC Bench
Overview
About
Tests
Models
Model
Mistral Nemo 12B Celeste
Test
Tool usage within Novelcrafter
Scenario
Create alternate prose sections
Run ID
2024-08-09-01J4VTA46PP35B8TYX1FD94P41
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
Version 1 (red): Sir Roland, the muscular knight, approached Frederick at the stables, tugging down his chainmail glove. "Ah, you must be the new squire," he said, eyeing Frederick's simple armor. "Welcome to our kingdom, lad. I'll be your mentor and trainer." Version 2 (green):
id
"gen-4NvUC96DrMaVqVPRPoQBLEv4ROgC"
total_cost
0.000573
upstream_id
"cmpl-7012715e6ed7418294e597b89b17ea52"
created_at
"2024-08-09T14:57:31.635897+00:00"
Friday, August 9th, 2024 at 2:57:31 PM GMT+00:00
model
"nothingiisreal/mn-celeste-12b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Infermatic"
latency
935
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
301
tokens_completion
72
native_tokens_prompt
309
native_tokens_completion
73
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000573
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Fail
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Fail
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
0.0000%