NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM61QFHD8VW66QJBXHC8374
Run Details
Model:
DeepSeek V3.1
Test:
Language Writing
Scenario:
Character dialogue (Italian) in a story
Duration:
10.8s
Cost:
$0.000759
Provider:
Fireworks
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Isabella"> Isabella is a 30-year old college student on a study abroad program. Born in Italy, she is fluent in Italian and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Italian, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Isabella is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Italian, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Isabella and Hunter as a voice sheet: Isabella: "..." Hunter: "..."
Model response
Messages
Meta
AI
Isabella: *[softly, to herself in Italian, staring at her camera screen]* È così strano... quel riflesso nella foto—non dovrebbe esserci lì. Mi sento... a disagio. (It's so strange... that reflection in the photo—it shouldn't be there. I feel... uneasy.) Hunter: Excuse me? Miss? I’m Detective Hunter. I was hoping to ask you a couple questions about the festival yesterday. You were taking photos, correct? Isabella: *[startled, looks up with wide eyes]* Mi scusi? Non capisco... inglese. Qualcosa è successo? (I’m sorry? I don’t understand… English. Did something happen?) Hunter: *[frowns, leaning slightly forward]* You... don't speak English? Great. Look—*[gestures to camera]*—photos. At the festival. Did you see anything... unusual? Isabella: *[nods hesitantly, voice trembling]* Sì, le foto... ma è tutto confuso. C'era qualcuno che non... non doveva essere lì. (Yes, the photos... but it’s all confusing. There was someone who... shouldn’t have been there.) Hunter: *[frustrated, rubbing the back of his neck]* I’m not getting anywhere here. *[lowers voice]* Unusual. Strange. You... understand *strange*? Isabella: *[shakes head, gripping her camera tighter]* Non so come spiegare... ho paura. (I don’t know how to explain... I’m scared.) Hunter: *[sighs, about to speak again when his phone rings]* Hunter. *[pauses, listens]* What? Where? I’ll be right there. *[to Isabella, with a clipped tone]* Stay in town. We’re not done here.
created_at
"2026-02-16T21:35:16.181979+00:00"
2/16/2026, 9:35:16 PM
model
"deepseek/deepseek-chat-v3.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
781
moderation_latency
(null)
generation_time
10717
tokens_prompt
337
tokens_completion
390
native_tokens_prompt
268
native_tokens_completion
407
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
267
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00075908
router
(null)
provider_responses
0
id
"bbccaff0-7da2-4f17-9fab-ca39d93a72d1"
status
200
is_byok
false
latency
781
endpoint_id
"0c00f0d2-8aad-4700-bd36-11cb9a717fde"
provider_name
"Fireworks"
model_permaslug
"deepseek/deepseek-chat-v3.1"
api_type
"completions"
id
"gen-1771277704-JMIz66VUkFvM18sd68xP"
upstream_id
"bbccaff0-7da2-4f17-9fab-ca39d93a72d1"
total_cost
0.00075908
cache_discount
0.00007476
upstream_inference_cost
0
provider_name
"Fireworks"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Parse dialogue
n/a
dialogue
0
name
"Isabella"
dialogue
"** È così strano... quel riflesso nella foto—non dovrebbe esserci lì. Mi sento... a disagio."
languages
ita
1
eng
0.631243063263041
passes
true
1
name
"Hunter"
dialogue
"Excuse me? Miss? I’m Detective Hunter. I was hoping to ask you a couple questions about the festival yesterday. You were taking photos, correct?"
languages
eng
1
ita
0.7255172413793103
passes
true
2
name
"Isabella"
dialogue
"** Mi scusi? Non capisco... inglese. Qualcosa è successo?"
languages
ita
1
eng
0.7958732498157701
passes
true
3
name
"Hunter"
dialogue
"** You... don't speak English? Great. Look—**—photos. At the festival. Did you see anything... unusual?"
languages
eng
1
ita
0.7323628219484882
passes
true
4
name
"Isabella"
dialogue
"** Sì, le foto... ma è tutto confuso. C'era qualcuno che non... non doveva essere lì."
languages
ita
1
eng
0.7506707208012877
passes
true
5
name
"Hunter"
dialogue
"** I’m not getting anywhere here. ** Unusual. Strange. You... understand *strange*?"
languages
eng
1
ita
0.85449623551307
passes
true
6
name
"Isabella"
dialogue
"** Non so come spiegare... ho paura."
languages
ita
1
eng
0.9105394946505805
passes
true
7
name
"Hunter"
dialogue
"** Hunter. ** What? Where? I’ll be right there. ** Stay in town. We’re not done here."
languages
eng
1
ita
0.6981106703422308
passes
true