nvidia/llama-3.1-nemotron-70b-instruct

Llama 3.1 Nemotron 70B

Release Date

Oct 15th, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$2.28

Speed

29.4 tok/s

Categories

20%40%60%80%100%Creative Writing71.7%Tooling95.7%Language46.8%Utility88.3%Reasoning82.2%Text Editing87.3%Rule Following50.6%Hallucination75.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
767367656269%
767070665768%
817773726774%
777573736974%
777568686671%
827979737077%
Detailed Writing Rules71.92%
genre
837170696672%
737271696570%
817976757377%
837876727276%
737272716871%
797978767577%
genre73.92%
Novelcrafter Default Prompt
777070676470%
757473666571%
858178767679%
757473717173%
827874727275%
787168676570%
Novelcrafter Default Prompt72.96%
72.93%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
939292908891%
868483834176%
817978785975%
949490908891%
83.18%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100817994%
100100100100100100100100%
9189888785858587%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
8279777270676373%
9996959492857791%
9797979797979797%
Generic Prompt93.58%
Specific Prompt
100100100100100838395%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
8986858585837484%
100100100100100100100100%
100100100100100100100100%
Specific Prompt97.53%
95.55%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%