cohere/command-r-plus-08-2024

Cohere Command R+ (Aug. 2024)

Release Date

Aug 31st, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$11.09

Speed

39.3 tok/s

Categories

20%40%60%80%100%Creative Writing77.7%Tooling91.8%Language66.6%Utility59.5%Reasoning65.1%Text Editing68.4%Rule Following58.7%Hallucination64.4%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
767574716773%
807978757077%
827574747376%
858079767379%
898580807282%
848181807780%
Detailed Writing Rules77.59%
genre
797372706872%
807974736975%
807777747276%
848479757179%
818178787579%
817877767577%
genre76.31%
Novelcrafter Default Prompt
797873726974%
828181808081%
898482737380%
949288868389%
847979737277%
838277767178%
Novelcrafter Default Prompt79.91%
77.94%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
909087868688%
878582817983%
898483806079%
919190867687%
84.23%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
8181787878786777%
100100494740383358%
962725221811329%
5748333232291836%
9279715849474263%
7474747474746572%
4029272727232228%
9552494943403752%
9997969385686085%
Generic Prompt55.59%
Specific Prompt
9186838167666577%
100100100100100100100100%
9999999998989898%
1001001009999999899%
100100100100100100100100%
1001001001001009999100%
7876715453504661%
1001001001001001009399%
10099979386868292%
Specific Prompt91.86%
73.72%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100090%
90.00%