Run: 2026-02-16-01KHK34FANX0S78N87HGX0P5BP
Run Details
- Model:
- Qwen 3.5 Plus (2026-02-15)
- Test:
- N-Length Sentences
- Scenario:
- Write sentences with 10 words each
- Duration:
- 5.2s
- Cost:
- $0.000296
- Provider:
- Alibaba
Evaluation summary
| Score | Evaluator | Details Link |
|---|---|---|
| 72.28% | Matches word count | (details) |
Model input
Model response
Evaluation details
| Result | Evaluator | Details | Meta Data | ||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 72.28% | Matches word count | n/a |
|