Run: 2026-03-12-01KKHZGED3D8ZE829G8G6KVSX9

Run Details

Model:: Grok 4.20 (Beta)
Test:: Data extraction
Scenario:: Future event time
Duration:: 585ms
Cost:: $0.000384
Provider:: xAI

Evaluation summary

Score	Evaluator	Details Link
Fail	Matches text	(details)
Pass	Matches Regex	(details)
no eval	Overall

Model input

User/Human

What time does the future event start? Provide the answer in 24-hour format (HH:MM).

Yesterday's meeting was rescheduled from 2:30 PM to 4:15 PM due to a conflict. The next meeting will be held tomorrow at 9:00 AM, followed by a lunch break at 12:30 PM. The future planning session is set for next week, starting at 14:45.

Model response

14:45

The future event is the "future planning session" explicitly described as occurring "next week" (in the future relative to "yesterday" and "tomorrow" in the text). All other times refer to past or nearer-term events.

Evaluation details

Result	Evaluator	Details	Meta Data
Fail	Matches text	Case sensitive: 14:45	n/a
Pass	Matches Regex	/\b14:45\b/	n/a
no eval