microsoft/phi-3.5-mini-128k-instruct
Phi-3.5 Mini 128k via OpenRouter
Release Date
Aug 17th, 2024Parameters
3.8BContext Size
128kCreative writing
18.74%Rule following
44.81%Utility
61.44%Mathematics
80.00%Tooling
55.64%Language
59.00%Logic
88.75%Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 92.50% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 61% | 61% | 14% | 14% | 14% | 14% | 14% | 1% | 1% | 29% | |
| 49% | 45% | 1% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 93% | 50% | 47% | 33% | 2% | 0% | 0% | 0% | 0% | 0% | 22% | |
| 91% | 50% | 49% | 49% | 48% | 45% | 38% | 0% | 0% | 0% | 37% | |
| 44% | 43% | 26% | 10% | 1% | 0% | 0% | 0% | 0% | 0% | 12% | |
| 46% | 22% | 21% | 14% | 2% | 2% | 0% | 0% | 0% | 0% | 11% | |
| 49% | 38% | 9% | 3% | 1% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 18.74% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 0% | 0% | 0% | 40% | |
| 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | |
| 35.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 60% | 50% | 50% | 72% | |
| 100% | 100% | 100% | 100% | 63% | 93% | |
| 100% | 100% | 80% | 75% | 38% | 78% | |
| 100% | 100% | 50% | 50% | 40% | 68% | |
| 100% | 100% | 100% | 50% | 50% | 80% | |
| 78.20% | ||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 30% | |
| 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 60% | |
| 100% | 100% | 100% | 50% | 50% | 0% | 0% | 0% | 0% | 0% | 40% | |
| 100% | 50% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 20% | |
| 52.50% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 67% | 67% | 93% | |
| 93.33% | |||||||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 91% | 89% | 89% | 86% | 84% | 79% | 65% | 61% | 59% | 57% | 76% | |
| 76% | 73% | 70% | 69% | 65% | 63% | 56% | 55% | 49% | 26% | 60% | |
| 15% | 14% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 3% | |
| 46.41% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 10% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 2.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 98% | 77% | 77% | 0% | 0% | 75% | |
| 100% | 100% | 100% | 92% | 92% | 92% | 54% | 54% | 9% | 0% | 69% | |
| 100% | 100% | 100% | 92% | 0% | 0% | 0% | 0% | 0% | 0% | 39% | |
| 92% | 54% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 15% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 98% | 92% | 92% | 77% | 77% | 54% | 89% | |
| 100% | 100% | 27% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 23% | |
| 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 58.48% | |||||||||||