NC Bench

A comprehensive benchmark for creative writing models.

Created by Novelcrafter.

Learn more here

This project is in early access and still work in progress.

Focused on Creativity

NC Bench is a cutting-edge benchmark for creativity in LLMs, with focus on creative writing, instruction following, utility, tooling, and language skills.

AI as an Assistant

We test how well models enhance the writing process through text manipulation, idea generation, summarization, and translation.

Comprehensive Testing

From generating quality prose to hallucination-free extraction, NC Bench puts AI models through their paces in all aspects of writing assistance.

Benchmark Overview

Tests

10

Scenarios

64

Models

57

Results

40,715

Samples per Run

9.41 runs/scen.

Category Distribution

Shows the number of scenarios in each category. Some scenarios may be in multiple categories.

Creative writing (7)
Rule following (31)
Utility (32)
Mathematics (1)
Tooling (15)
Language (9)
Logic (16)

Top Results

Creative writing

83.95%GPT-5
81.05%GPT-5 Mini
78.16%o4 Mini High

Rule following

93.82%GPT-5
93.20%GPT-5 Mini
93.09%o4 Mini High

Mathematics

100.00%Claude 3.5 Sonnet
100.00%Claude 3.7 Sonnet
100.00%Hermes 3 70B