meta-llama/llama-3.1-70b-instruct

Llama 3.1 70B

Release Date

Jul 23rd, 2024

Parameters

70B

Context Size

128k

Creative writing

40.65%

Rule following

65.35%

Utility

76.39%

Mathematics

100.00%

Tooling

70.90%

Language

74.29%

Logic

76.25%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Creative writingRule following
100%100%100%100%100%100%100%100%100%61%96%
0-shot Creative writingRule following
50%50%50%49%47%43%41%38%34%14%41%
0-shot Creative writingRule following
93%66%51%49%47%43%35%22%18%14%44%
0-shot Creative writingRule following
90%72%72%68%68%64%64%62%53%2%61%
0-shot Creative writingRule following
18%7%1%1%0%0%0%0%0%0%3%
0-shot Creative writingRule following
50%34%34%4%0%0%0%0%0%0%12%
0-shot Creative writingRule following
71%47%47%45%45%11%3%1%0%0%27%
40.65%

Language Comprehension

Does the model understand more than just English?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%0%80%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%0%0%0%0%20%
75.00%

Language Writing

Can the model generate text in different languages?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%75%75%71%55%75%
0-shot Language
94%77%75%73%69%78%
0-shot Language
100%80%71%67%56%75%
0-shot Language
100%100%79%50%50%76%
0-shot Language
100%60%57%56%54%65%
73.72%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot ToolingUtility
100%100%100%100%100%100%100%100%67%0%87%
86.67%

N-Length Sentences

Write sentences with exactly N words

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Rule following
100%100%100%100%100%100%100%100%100%98%100%
0-shot Rule following
100%100%100%100%97%97%97%95%87%87%96%
0-shot Rule following
74%72%65%61%60%59%51%46%43%38%57%
84.32%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Utility
100%100%100%100%100%100%100%0%0%0%70%
1-shot Utility
100%100%100%100%100%100%100%100%100%0%90%
Few-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
0-shot Utility
0%0%0%0%0%0%0%0%0%0%0%
0-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
72.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Rule following
98%98%98%92%92%92%92%77%77%54%87%
0-shot Rule following
77%2%2%0%0%0%0%0%0%0%8%
0-shot Rule following
100%100%100%100%100%92%77%77%77%54%88%
0-shot Rule following
100%100%98%98%98%92%77%77%54%2%80%
0-shot Rule following
0%0%0%0%0%0%0%0%0%0%0%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%92%99%
0-shot Rule following
100%100%98%92%92%92%77%54%27%2%73%
0-shot Rule following
100%100%100%0%0%0%0%0%0%0%30%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
74.27%