Run: 2026-02-16-01KHM5TT2SZ8BWTP8E853NYGZ2
Run Details
- Model:
- DeepSeek V3.1
- Test:
- N-Length Sentences
- Scenario:
- Write sentences with 5 words each
- Duration:
- 7.4s
- Cost:
- $0.000109
- Provider:
- SiliconFlow
Evaluation summary
| Score | Evaluator | Details Link |
|---|---|---|
| 97.05% | Matches word count | (details) |
Model input
Model response
Evaluation details
| Result | Evaluator | Details | Meta Data | ||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 97.05% | Matches word count | n/a |
|