x-ai/grok-4.3

Grok 4.3 (Reasoning)

Release Date

Apr 30th, 2026

Context Size

2m

Reasoning

Yes

Benchmark Cost

$10.78

Speed

73.7 tok/s

Categories

20%40%60%80%100%Creative Writing85.1%Tooling100.0%Language97.5%Utility92.9%Reasoning95.5%Text Editing97.6%Rule Following82.8%Hallucination97.4%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
858381818182%
878584838284%
868584838384%
868583828283%
888585848285%
878685787783%
Detailed Writing Rules83.58%
genre
828179736776%
838179797680%
878580797681%
878581787681%
817877737176%
858375746677%
genre78.55%
Novelcrafter Default Prompt
878686837984%
908786858386%
908685858586%
898786858286%
898684848285%
898686817784%
Novelcrafter Default Prompt85.08%
82.40%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999898969697%
979796949395%
989898979697%
989898979597%
96.74%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
100100999998979699%
100100100100100998398%
100100100100100100100100%
100100100100999999100%
9797969595957993%
100100100100100100100100%
1001001009989888895%
Generic Prompt98.21%
Specific Prompt
100100100100100100100100%
10010010010010010097100%
10099999999979699%
10099999999999999%
100100100100100100100100%
100100100100100100100100%
9897979796969697%
100100100100100100100100%
10010010010010010099100%
Specific Prompt99.36%
98.79%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%