Run: 2024-07-13-01J2MT5W2SYJC76FGGN0337EJ1

Run Details

Model:: GPT-4o, May 13th (temp=0)
Test:: Data extraction
Scenario:: Highest-rated movie
Duration:: 7.2s
Cost:: $0.000665
Provider:: OpenAI

Evaluation summary

Score	Evaluator	Details Link
Pass	Matches text	(details)
Pass	Matches Regex	(details)
no eval	Overall

Model input

User/Human

What is the title of the highest-rated movie? Only return the movie title, preserving its original capitalization.

Movie Ratings:
- The Shawshank Redemption (1994): 9.3/10
- The Godfather (1972): 9.2/10
- 12 Angry Men (1957): 8.9/10
- Schindler's List (1993): 9.0/10
- The Lord of the Rings: The Return of the King (2003): 9.0/10

Model response

The Shawshank Redemption

Evaluation details

Result	Evaluator	Details	Meta Data
Pass	Matches text	Case sensitive: The Shawshank Redemption	n/a
Pass	Matches Regex	/\bThe Shawshank Redemption\b/	n/a
no eval

id	"gen-QvYmc1KVQfHXjwxfJf5G8YavmjqV"
total_cost	0.000665
upstream_id	"chatcmpl-9kLY7h02zbi0RP1oH4dKYK1WdtUGa"
created_at	"2024-07-13T01:09:13.583643+00:00" 7/13/2024, 1:09:13 AM
model	"openai/gpt-4o-2024-05-13"
app_id	182717
streamed	false
cancelled	false
provider_name	"OpenAI"
latency	396
moderation_latency	206
generation_time	0
finish_reason	"stop"
tokens_prompt	119
tokens_completion	5
native_tokens_prompt	118
native_tokens_completion	5
num_media_prompt	(null)
num_media_completion	(null)
origin	"https://ncbench.com/"
usage	0.000665