deepseek/deepseek-chat

DeepSeek-V2 Chat

Release Date

May 6th, 2024

Parameters

Context Size

128k

Creative writing

41.57%

Rule following

61.95%

Utility

77.00%

Mathematics

100.00%

Tooling

66.15%

Language

96.28%

Logic

81.25%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Creative writingRule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Creative writingRule following
50%49%47%26%26%18%5%0%0%0%22%
0-shot Creative writingRule following
50%50%50%50%50%50%48%48%43%0%44%
0-shot Creative writingRule following
99%92%92%87%83%72%50%50%49%49%72%
0-shot Creative writingRule following
49%26%2%1%0%0%0%0%0%0%8%
0-shot Creative writingRule following
45%3%0%0%0%0%0%0%0%0%5%
0-shot Creative writingRule following
89%84%50%44%44%40%19%17%11%5%40%
41.57%

Language Comprehension

Does the model understand more than just English?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
100.00%

Language Writing

Can the model generate text in different languages?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%83%80%93%
0-shot Language
100%100%100%90%90%96%
0-shot Language
100%82%80%80%80%84%
0-shot Language
100%100%100%88%80%94%
0-shot Language
100%100%100%100%100%100%
93.31%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
100.00%

N-Length Sentences

Write sentences with exactly N words

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Rule following
100%93%91%89%89%82%82%80%78%72%86%
0-shot Rule following
25%23%18%18%15%13%10%8%7%4%14%
0-shot Rule following
0%0%0%0%0%0%0%0%0%0%0%
33.25%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
1-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
Few-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
0-shot Utility
0%0%0%0%0%0%0%0%0%0%0%
0-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
80.00%