microsoft/wizardlm-2-8x22b
WizardLM 2 8x22b via OpenRouter
Release Date
Apr 15th, 2024Parameters
8x22BContext Size
65kCreative writing
8.45%Rule following
39.41%Utility
62.06%Mathematics
50.00%Tooling
59.36%Language
73.07%Logic
63.13%Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 50% | 70% | |
| 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 65% | |
| 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 65% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 95% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 75% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 73.33% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 61% | 14% | 1% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 8% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 40% | 28% | 21% | 20% | 8% | 3% | 1% | 0% | 0% | 0% | 12% | |
| 50% | 44% | 6% | 4% | 1% | 0% | 0% | 0% | 0% | 0% | 11% | |
| 49% | 2% | 2% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 50% | 41% | 33% | 21% | 3% | 3% | 0% | 0% | 0% | 0% | 15% | |
| 49% | 19% | 13% | 3% | 0% | 0% | 0% | 0% | 0% | 0% | 8% | |
| 8.45% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 0% | 80% | |
| 95.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 92% | 80% | 79% | 67% | 0% | 63% | |
| 90% | 83% | 71% | 0% | 0% | 49% | |
| 100% | 63% | 56% | 0% | 0% | 44% | |
| 100% | 100% | 75% | 56% | 0% | 66% | |
| 93% | 64% | 63% | 57% | 0% | 55% | |
| 55.53% | ||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 100% | 50% | 50% | 50% | 50% | 0% | 0% | 0% | 50% | |
| 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 0% | 65% | |
| 56.25% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 67% | 97% | |
| 96.67% | |||||||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 91% | 89% | 89% | 85% | 81% | 68% | 64% | 50% | 23% | 10% | 65% | |
| 77% | 74% | 66% | 66% | 59% | 57% | 54% | 52% | 46% | 19% | 57% | |
| 41% | 21% | 19% | 5% | 5% | 1% | 1% | 0% | 0% | 0% | 9% | |
| 43.81% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 42.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 98% | 98% | 77% | 27% | 0% | 0% | 0% | 60% | |
| 98% | 92% | 92% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 28% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 98% | 98% | 98% | 98% | 98% | 77% | 9% | 88% | |
| 100% | 100% | 98% | 98% | 92% | 92% | 77% | 27% | 9% | 2% | 70% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 55.06% | |||||||||||