Run: 2026-04-25-01KQ2HWTNQWN2G7CJ70P1E10RY
Run Details
- Model:
- DeepSeek V4 Pro (Reasoning)
- Test:
- Data extraction
- Scenario:
- Fruits excluding citrus
- Duration:
- 5m 17s
- Cost:
- $0.035723
- Provider:
- Together
Evaluation summary
| Score | Evaluator | Details Link |
|---|---|---|
| Pass | Contains a list of texts | (details) |
Model input
Model response
Evaluation details
| Result | Evaluator | Details | Meta Data | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pass | Contains a list of texts | bananas
apples
grapes
kiwis
watermelon
cantaloupe |
|