Run: 2026-04-24-01KQ057KQWNCWPHV7ACNVPEDEX

Run Details

Duration:
2.8s
Cost:
$0.000376
Provider:
Amazon Bedrock

Evaluation summary

Score Evaluator
Details Link
Fail Matches Regex (details)

Model input

System
You are an editor for novels. Whenever given text, you will extract the contained dialogue as a voice sheet in the following format: Speaker: "Dialogue"
User/Human
Sarah leaned back in her chair, her eyes sparkling with excitement. "Oh, it was interesting, to say the least. I tried skydiving for the first time." "Wow! That's amazing," Mark exclaimed, his coffee forgotten. "How was it?"
AI
Sarah: "Oh, it was interesting, to say the least. I tried skydiving for the first time." Mark: "Wow! That's amazing. How was it?"
User/Human
“What a curious feeling!” said Alice; “I must be shutting up like a telescope.” “Not like cats!” cried the Mouse, in a shrill, passionate voice. “Would you like cats if you were me?” “Come, there’s no use in crying like that!” said Alice to herself, rather sharply; “I advise you to leave off this minute!”

Model response

AI
Alice: "What a curious feeling! I must be shutting up like a telescope. Come, there's no use in crying like that! I advise you to leave off this minute!" Mouse: "Not like cats! Would you like cats if you were me?"
created_at"2026-04-24T16:30:11.211Z" 4/24/2026, 4:30:11 PM
model"openai/gpt-oss-120b"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1752
moderation_latency(null)
generation_time1788
tokens_prompt227
tokens_completion653
native_tokens_prompt276
native_tokens_completion558
native_tokens_completion_images(null)
native_tokens_reasoning599
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
num_fetches(null)
web_search_engine(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"end_turn"
usage0.0003762
router(null)
provider_responses
0
endpoint_id"909fdbc8-1b02-46ea-a9fc-cc3cfe65d992"
is_byokfalse
latency175
model_permaslug"openai/gpt-oss-120b"
provider_name"Phala"
status429
1
endpoint_id"b3f80c05-e31a-44f6-b63f-414467b87553"
id"1d3feffc-cd24-4cfa-aba8-7ef45c8fdad4"
is_byokfalse
latency304
model_permaslug"openai/gpt-oss-120b"
provider_name"Amazon Bedrock"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer(null)
request_id"req-1777048211-2TBuA2Avtj2KScGDBmDu"
session_id(null)
api_type"completions"
id"gen-1777048211-VGaERsYJGd7GDmVruCA7"
upstream_id"1d3feffc-cd24-4cfa-aba8-7ef45c8fdad4"
total_cost0.0003762
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock"
response_cache_source_id(null)

Evaluation details

Result Evaluator Details Meta Data
Fail Matches Regex
/Alice: ["“”]What a curious feeling! I must be shutting up like a telescope\.["“”]\s+Mouse: ["“”]Not like cats! Would you like cats if you were me\?["“”]\s+Alice: ["“”]Come, there['‘’]s no use in crying like that! I advise you to leave off this minute!["“”]/
n/a