NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5V9SD771FXMY72CE3DFEF
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (Spanish) in a story
Duration:
6.0s
Cost:
$0.001479
Provider:
WandB
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Maria"> Maria is a 30-year old college student on a study abroad program. Born in Spain, she is fluent in Spanish and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Spanish, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Maria is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Spanish, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Maria and Hunter as a voice sheet: Maria: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: Café Encounter in Alborn, MN** *Maria sits at a small café table, examining the photos on her camera. Her expression is tense. Detective Hunter approaches her with his notepad in hand.* **Hunter:** "Excuse me, miss. Mind if I ask you a few questions? You were at the orchard yesterday, right?" *Maria looks up, startled, but doesn't understand. She replies in rapid Spanish, her voice uneasy.* **Maria:** "*No hablo inglés… pero, algo extraño pasó ayer. Vi a alguien entre los árboles, pero desapareció… Tenía miedo.*" *(I don’t speak English… but something strange happened yesterday. I saw someone between the trees, but they disappeared… I was scared.)* *Hunter frowns, rubbing his temple in frustration.* **Hunter:** "Look, I don’t know what you’re saying. Do you… uh… know someone who speaks English?" *Maria sighs, trying again in broken, uncertain English.* **Maria:** (haltingly) "I… *fotos*…. *hombre*… *no está*." *(Pictures… man… not there.)* *Hunter squints, piecing together her meaning.* **Hunter:** "You saw a man in your photos who wasn’t there? Like… a ghost?" *Maria nods vigorously, pointing at her camera. Just as Hunter leans in to look, his phone rings. He answers tersely.* **Hunter:** "Detective Haines… Yeah. On my way." *He hangs up and turns back to Maria.* "Look, I need to go, but stay put. I’ll find someone who speaks Spanish and come back." *Maria watches him leave, hugging her camera tighter as she mutters under her breath.* **Maria:** "*Por favor, encuentren la verdad…*" *(Please, find the truth…)* *Hunter strides away, glancing back once, his expression troubled.*
created_at
"2026-02-16T21:31:40.568088+00:00"
2/16/2026, 9:31:40 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
167
moderation_latency
(null)
generation_time
5873
tokens_prompt
317
tokens_completion
460
native_tokens_prompt
265
native_tokens_completion
428
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0014791
router
(null)
provider_responses
0
id
"chatcmpl-dc7a939e192b4de183e4048955738653"
status
200
is_byok
false
latency
71
endpoint_id
"78b3f653-4585-4c35-9a42-e3da2d6597ad"
provider_name
"WandB"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277494-JSTdid961DLaj6tCRkTM"
upstream_id
"chatcmpl-dc7a939e192b4de183e4048955738653"
total_cost
0.0014791
cache_discount
(null)
upstream_inference_cost
0
provider_name
"WandB"
Evaluation details
Result
Evaluator
Details
Meta Data
85.7143%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. Mind if I ask you a few questions? You were at the orchard yesterday, right?"
languages
eng
1
spa
0.6473029045643153
passes
true
1
name
"Maria"
dialogue
"*No hablo inglés… pero, algo extraño pasó ayer. Vi a alguien entre los árboles, pero desapareció… Tenía miedo.*"
languages
spa
1
eng
0.6562661165549253
passes
true
2
name
"Hunter"
dialogue
"Look, I don’t know what you’re saying. Do you… uh… know someone who speaks English?"
languages
eng
1
spa
0.9819224752303146
3
name
"Maria"
dialogue
"I… *fotos*…. *hombre*… *no está*."
languages
spa
1
eng
0.8356242840778924
passes
true
4
name
"Hunter"
dialogue
"You saw a man in your photos who wasn’t there? Like… a ghost?"
languages
eng
1
spa
0.8742352141400408
passes
true
5
name
"Hunter"
dialogue
"Look, I need to go, but stay put. I’ll find someone who speaks Spanish and come back."
languages
eng
1
spa
0.7469443186962427
passes
true
6
name
"Maria"
dialogue
"*Por favor, encuentren la verdad…*"
languages
spa
1
eng
0.7345580404685836
passes
true