nothingiisreal/mn-celeste-12b
Mistral Nemo 12B Celeste via OpenRouter
Release Date
Aug 2nd, 2024Parameters
12BContext Size
32kCreative writing
17.56%Rule following
24.89%Utility
46.00%Mathematics
50.00%Tooling
39.62%Language
56.75%Logic
60.62%Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 50% | 0% | 80% | |
| 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 50% | 70% | |
| 100% | 100% | 50% | 50% | 50% | 50% | 50% | 0% | 0% | 0% | 45% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 50% | 90% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 95% | |
| 100% | 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 50% | 75% | |
| 100% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 50% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 0% | 45% | |
| 100% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 55% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 61.25% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 14% | 1% | 1% | 0% | 0% | 0% | 0% | 0% | 22% | |
| 50% | 49% | 49% | 48% | 44% | 43% | 34% | 0% | 0% | 0% | 32% | |
| 50% | 48% | 42% | 41% | 8% | 7% | 6% | 3% | 0% | 0% | 21% | |
| 95% | 57% | 50% | 49% | 48% | 34% | 0% | 0% | 0% | 0% | 33% | |
| 26% | 26% | 1% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 31% | 23% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 48% | 2% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 17.56% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 0% | 80% | |
| 100% | 100% | 100% | 0% | 0% | 60% | |
| 100% | 0% | 0% | 0% | 0% | 20% | |
| 65.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 91% | 91% | 50% | 33% | 0% | 53% | |
| 100% | 83% | 71% | 67% | 25% | 69% | |
| 50% | 44% | 44% | 42% | 0% | 36% | |
| 71% | 54% | 50% | 0% | 0% | 35% | |
| 83% | 63% | 57% | 50% | 33% | 57% | |
| 50.15% | ||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 5% | |
| 100% | 50% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 42.08% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 10.00% | |||||||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 68% | 54% | 39% | 21% | 15% | 7% | 1% | 0% | 0% | 0% | 21% | |
| 34% | 31% | 28% | 27% | 23% | 22% | 19% | 19% | 19% | 15% | 24% | |
| 12% | 12% | 11% | 9% | 0% | 0% | 0% | 0% | 0% | 0% | 4% | |
| 16.16% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 26.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 9% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 21% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 98% | 98% | 98% | 98% | 92% | 0% | 89% | |
| 100% | 100% | 100% | 100% | 100% | 98% | 92% | 92% | 27% | 0% | 81% | |
| 100% | 98% | 54% | 54% | 27% | 27% | 27% | 27% | 0% | 0% | 41% | |
| 100% | 98% | 98% | 9% | 0% | 0% | 0% | 0% | 0% | 0% | 31% | |
| 100% | 92% | 92% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 28% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 30.84% | |||||||||||