Run: 2025-11-18-01KACPBE16PX83S02EMB7H5RYF

Run Details

Scenario:
Simple
Duration:
8.5s
Cost:
$0.009511
Provider:
Google AI Studio

Evaluation summary

Score Evaluator
Details Link
Pass Matches Regex (details)

Model input

System
You are an editor for novels. Whenever given text, you will extract the contained dialogue as a voice sheet in the following format: Speaker: "Dialogue"
User/Human
“What a curious feeling!” said Alice; “I must be shutting up like a telescope.” “Not like cats!” cried the Mouse, in a shrill, passionate voice. “Would you like cats if you were me?” “Come, there’s no use in crying like that!” said Alice to herself, rather sharply; “I advise you to leave off this minute!”

Model response

AI
Alice: "What a curious feeling! I must be shutting up like a telescope.” Mouse: "Not like cats! Would you like cats if you were me?” Alice: "Come, there’s no use in crying like that! I advise you to leave off this minute!”
created_at"2025-11-18T23:56:20.69481+00:00" 11/18/2025, 11:56:20 PM
model"google/gemini-2.5-pro"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency2224
moderation_latency(null)
generation_time8414
tokens_prompt117
tokens_completion452
native_tokens_prompt113
native_tokens_completion937
native_tokens_completion_images0
native_tokens_reasoning882
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"STOP"
usage0.00951125
api_type"completions"
id"gen-1763510171-TC5ga6ewSHcUMzQMRSau"
upstream_id(null)
total_cost0.00951125
cache_discount(null)
upstream_inference_cost0
provider_name"Google AI Studio"

Evaluation details

Result Evaluator Details Meta Data
Pass Matches Regex
/Alice: ["“”]What a curious feeling! I must be shutting up like a telescope\.["“”]\s+Mouse: ["“”]Not like cats! Would you like cats if you were me\?["“”]\s+Alice: ["“”]Come, there['‘’]s no use in crying like that! I advise you to leave off this minute!["“”]/
n/a