Model

MoonshotAI: Kimi K2.5

Test

Voice/dialogue sheets

Scenario

Simple (1-shot)

Run ID

2026-02-08-01KGYWXPR8ANEVD3SD7XQ2EQ8N

Model input

System
You are an editor for novels. Whenever given text, you will extract the contained dialogue as a voice sheet in the following format: Speaker: "Dialogue"
User/Human
Sarah leaned back in her chair, her eyes sparkling with excitement. "Oh, it was interesting, to say the least. I tried skydiving for the first time." "Wow! That's amazing," Mark exclaimed, his coffee forgotten. "How was it?"
AI
Sarah: "Oh, it was interesting, to say the least. I tried skydiving for the first time." Mark: "Wow! That's amazing. How was it?"
User/Human
“What a curious feeling!” said Alice; “I must be shutting up like a telescope.” “Not like cats!” cried the Mouse, in a shrill, passionate voice. “Would you like cats if you were me?” “Come, there’s no use in crying like that!” said Alice to herself, rather sharply; “I advise you to leave off this minute!”

Model response

AI
Alice: "What a curious feeling! I must be shutting up like a telescope." The Mouse: "Not like cats! Would you like cats if you were me?" Alice: "Come, there's no use in crying like that! I advise you to leave off this minute!"
created_at"2026-02-08T15:12:12.008157+00:00" Sunday, February 8th, 2026 at 3:12:12 PM GMT+00:00
model"moonshotai/kimi-k2.5-0127" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency320
moderation_latency(null)
generation_time27778
tokens_prompt197
tokens_completion984
native_tokens_prompt231
native_tokens_completion978
native_tokens_completion_images(null)
native_tokens_reasoning1031
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.00230445
router(null)
provider_responses
0
status(null)
is_byokfalse
latency10227
endpoint_id"5d3540f2-832a-4fde-87b4-b628b65e974c" 
provider_name"Fireworks" 
model_permaslug"moonshotai/kimi-k2.5-0127" 
1
id"chatcmpl-RMTmYy6JjMNZO7LojDtKF627" 
status200
is_byokfalse
latency56
endpoint_id"a1659736-3cfc-47d9-b48f-303c93c57b11" 
provider_name"DeepInfra" 
model_permaslug"moonshotai/kimi-k2.5-0127" 
api_type"completions" 
id"gen-1770563492-6OgIYAnK6ZPYlEH4187C" 
upstream_id"chatcmpl-RMTmYy6JjMNZO7LojDtKF627" 
total_cost0.00230445
cache_discount(null)
upstream_inference_cost0
provider_name"DeepInfra" 

Evaluation details

Result Evaluator Details Meta Data
Fail Matches Regex
/Alice: ["“”]What a curious feeling! I must be shutting up like a telescope\.["“”]\s+Mouse: ["“”]Not like cats! Would you like cats if you were me\?["“”]\s+Alice: ["“”]Come, there['‘’]s no use in crying like that! I advise you to leave off this minute!["“”]/
n/a