ministral-3b-2410
Ministral 3B via mistral
Release Date
Oct 16th, 2024Parameters
–Context Size
128kCreative writing
18.62%Rule following
29.95%Utility
53.50%Mathematics
40.00%Tooling
52.31%Language
41.23%Logic
65.63%Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 15% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 100% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 72.92% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 61% | 61% | 14% | 1% | 1% | 1% | 0% | 0% | 0% | 24% | |
| 45% | 18% | 18% | 18% | 3% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 74% | 50% | 50% | 48% | 46% | 26% | 18% | 0% | 0% | 0% | 31% | |
| 84% | 50% | 46% | 30% | 30% | 21% | 18% | 15% | 0% | 0% | 29% | |
| 67% | 3% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 7% | |
| 48% | 48% | 16% | 3% | 3% | 1% | 0% | 0% | 0% | 0% | 12% | |
| 67% | 43% | 24% | 18% | 15% | 0% | 0% | 0% | 0% | 0% | 17% | |
| 18.62% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 0% | 0% | 0% | 40% | |
| 25.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 86% | 55% | 50% | 43% | 23% | 51% | |
| 100% | 94% | 71% | 50% | 0% | 63% | |
| 69% | 59% | 56% | 42% | 0% | 45% | |
| 79% | 71% | 57% | 54% | 44% | 61% | |
| 53% | 50% | 50% | 50% | 50% | 51% | |
| 54.22% | ||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 100% | 50% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 100% | 50% | 50% | 50% | 50% | 0% | 0% | 0% | 0% | 40% | |
| 50.00% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 80.00% | |||||||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 71% | 64% | 64% | 64% | 52% | 52% | 40% | 37% | 20% | 0% | 46% | |
| 39% | 32% | 31% | 30% | 30% | 27% | 26% | 21% | 7% | 6% | 25% | |
| 32% | 19% | 15% | 11% | 4% | 0% | 0% | 0% | 0% | 0% | 8% | |
| 26.50% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 10.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 98% | 98% | 92% | 27% | 9% | 2% | 0% | 0% | 0% | 0% | 33% | |
| 92% | 77% | 54% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 22% | |
| 27% | 2% | 2% | 2% | 0% | 0% | 0% | 0% | 0% | 0% | 3% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | |
| 100% | 100% | 98% | 98% | 92% | 77% | 77% | 27% | 0% | 0% | 67% | |
| 100% | 98% | 77% | 27% | 27% | 9% | 0% | 0% | 0% | 0% | 34% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 36.86% | |||||||||||