qwen/qwen3.5-plus-02-15

Qwen 3.5 Plus (2026-02-15)

Release Date

Feb 15th, 2026

Parameters

Context Size

1m

Benchmark Cost

$1.11

Speed

21.4 tok/s

Creative writing

30.51%

Rule following

70.52%

Utility

88.71%

Mathematics

100.00%

Tooling

87.92%

Language

92.55%

Logic

89.30%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
matrix
0-shot ToolingUtilityLogicRule following
97%93%92%91%89%89%87%77%61%3%78%
0-shot ToolingUtilityLogicRule following
96%96%91%91%91%91%89%89%82%80%90%
0-shot ToolingUtilityLogicRule following
95%95%95%95%95%95%95%95%95%95%95%
0-shot ToolingUtilityLogicRule following
100%100%100%95%95%95%95%95%95%95%97%
tiers
0-shot ToolingUtilityLogicRule following
100%100%100%100%100%100%100%100%100%66%97%
0-shot ToolingUtilityLogicRule following
100%100%100%100%100%100%86%86%86%86%94%
0-shot ToolingUtilityLogicRule following
92%92%92%89%88%88%85%85%85%79%87%
0-shot ToolingUtilityLogicRule following
100%100%92%92%92%92%92%87%83%83%91%
91.10%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
dialogue-200
0-shot Creative writingRule following
72%70%68%63%62%52%44%36%19%18%50%
0-shot Creative writingRule following
100%52%50%39%22%14%10%2%0%0%29%
0-shot Creative writingRule following
82%80%76%56%49%48%32%19%18%18%48%
dialogue-500
0-shot Creative writingRule following
26%17%0%0%0%0%0%0%0%0%4%
0-shot Creative writingRule following
4%0%0%0%0%0%0%0%0%0%0%
0-shot Creative writingRule following
3%0%0%0%0%0%0%0%0%0%0%
Ungrouped
0-shot Creative writingRule following
100%100%100%100%100%100%100%100%14%1%81%
30.51%

Language Comprehension

Does the model understand more than just English?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
100.00%

Language Writing

Can the model generate text in different languages?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%93%86%81%72%86%
0-shot Language
100%100%100%92%92%97%
0-shot Language
100%100%54%50%46%70%
0-shot Language
100%100%100%100%100%100%
0-shot Language
93%92%82%75%56%80%
86.59%

N-Length Sentences

Write sentences with exactly N words

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Rule following
100%91%82%78%78%78%78%78%78%78%82%
0-shot Rule following
100%84%84%74%72%72%70%65%52%35%71%
0-shot Rule following
23%12%3%3%2%0%0%0%0%0%4%
52.40%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
outline-count
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
0-shot ToolingUtility
100%0%0%0%0%0%0%0%0%0%10%
pov-count
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
0-shot ToolingUtility
100%100%100%100%100%100%100%100%0%0%80%
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
81.67%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot ToolingUtility
100%100%100%100%100%100%100%100%100%100%100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
0-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
1-shot Utility
100%100%100%100%100%100%100%100%100%0%90%
Few-shot Utility
100%100%100%100%100%100%100%100%0%0%80%
0-shot Utility
100%100%100%100%100%100%0%0%0%0%60%
86.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
paragraphs
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
sentences
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
0-shot Rule following
100%100%100%100%100%100%100%100%100%100%100%
words
0-shot Rule following
100%100%100%100%100%100%100%100%100%98%100%
0-shot Rule following
100%100%100%100%100%100%98%98%98%98%99%
0-shot Rule following
100%100%100%100%98%98%92%92%27%9%82%
0-shot Rule following
54%2%0%0%0%0%0%0%0%0%6%
0-shot Rule following
0%0%0%0%0%0%0%0%0%0%0%
83.57%