ai21/jamba-instruct
AI21 Jamba via OpenRouter
Release Date
Mar 28th, 2024Parameters
52B MoEContext Size
256kCreative writing
17.90%Rule following
26.26%Utility
33.11%Mathematics
45.00%Tooling
29.87%Language
70.57%Logic
41.88%Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 50% | 50% | 50% | 50% | 50% | 50% | 0% | 0% | 0% | 0% | 30% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 45% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 45% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 45% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 50.42% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 14% | 14% | 14% | 1% | 1% | 1% | 0% | 0% | 0% | 0% | 4% | |
| 50% | 50% | 49% | 48% | 26% | 14% | 3% | 0% | 0% | 0% | 24% | |
| 65% | 50% | 49% | 37% | 26% | 22% | 21% | 13% | 6% | 0% | 29% | |
| 97% | 51% | 50% | 50% | 50% | 49% | 48% | 3% | 1% | 1% | 40% | |
| 50% | 50% | 44% | 1% | 1% | 0% | 0% | 0% | 0% | 0% | 15% | |
| 50% | 50% | 13% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 11% | |
| 12% | 9% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 2% | |
| 17.90% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 0% | 0% | 0% | 40% | |
| 70.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 86% | 80% | 55% | 84% | |
| 85% | 83% | 62% | 50% | 50% | 66% | |
| 95% | 75% | 75% | 50% | 50% | 69% | |
| 100% | 100% | 77% | 58% | 44% | 76% | |
| 100% | 64% | 53% | 44% | 40% | 60% | |
| 71.03% | ||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 50% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 31.25% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 33% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 13% | |
| 13.33% | |||||||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 91% | 78% | 72% | 64% | 59% | 32% | 2% | 1% | 0% | 0% | 40% | |
| 62% | 53% | 52% | 32% | 22% | 6% | 5% | 3% | 3% | 0% | 24% | |
| 15% | 5% | 3% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 2% | |
| 21.95% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 98% | 77% | 2% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 18% | |
| 100% | 100% | 54% | 9% | 0% | 0% | 0% | 0% | 0% | 0% | 26% | |
| 77% | 54% | 2% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 13% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 92% | 77% | 54% | 54% | 54% | 54% | 27% | 27% | 9% | 9% | 46% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 31.76% | |||||||||||