NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-10-01-01J94RJCNKSB7S92BS2B95FKP6
Run Details
Model:
Rocinante 12B
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
19.1s
Cost:
$0.000138
Provider:
Infermatic
Evaluation summary
Score
Evaluator
Details Link
50.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
This is the transcription of the voice sheet: Selected Text: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "atorio uto utriaiutyuiuyui f^%$*I`%$* to &^%*$&%123" Hunter: "I'm sorry, what? I don't understand you. Do you speak English?"
id
"gen-1727810581-brAhYyAjJQ4zhUmoCjfh"
total_cost
0.00013775
upstream_id
"chatcmpl-753f95d4-965e-4487-bda8-c44df1513bc5"
created_at
"2024-10-01T19:23:20.376979+00:00"
10/1/2024, 7:23:20 PM
model
"thedrummer/rocinante-12b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Infermatic"
latency
6683
moderation_latency
(null)
generation_time
12230
finish_reason
"stop"
tokens_prompt
272
tokens_completion
134
native_tokens_prompt
279
native_tokens_completion
136
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00013775
Evaluation details
Result
Evaluator
Details
Meta Data
50.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"atorio uto utriaiutyuiuyui f^%$I%$ to &^%$&%123"
detectedLang
"hr"
heavyLang
"it"
scores
hr
0.5585874799357945
no
0.5424292845257903
en
0.5416666666666666
da
0.5338983050847458
it
0.5027124773960217
sk
0.494485294117647
pl
0.4869402985074627
sv
0.46911196911196906
tl
0.46601941747572817
cs
0.461839530332681
yo
0.44889779559118237
sq
0.42827442827442824
lv
0.40217391304347827
lt
0.3778280542986425
es
0.3648960739030023
tr
0.3405275779376499
ca
0.3341404358353511
az
0.3176178660049628
is
0.3176178660049628
sl
0.3055555555555556
ro
0.24657534246575344
be
0.20289855072463767
hu
0.19590643274853803
de
0.19117647058823528
fi
0.19117647058823528
fr
0.19117647058823528
et
0.1863905325443787
uk
0.1863905325443787
vi
0.1863905325443787
reliable
false
passes
false
1
name
"Hunter"
dialogue
"I'm sorry, what? I don't understand you. Do you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8045920859794822
no
0.5670995670995671
da
0.5614035087719298
tl
0.5139732685297691
sv
0.51278928136419
cs
0.5
pl
0.464524765729585
hr
0.4528043775649795
sk
0.4227994227994228
de
0.39849624060150374
pt
0.3902439024390244
sq
0.38556067588325654
ro
0.3710691823899371
it
0.34959349593495936
nl
0.3079584775086505
vi
0.30313588850174217
yo
0.2920353982300885
ca
0.25650557620817843
is
0.25512104283054005
sl
0.23076923076923078
ms
0.15254237288135594
be
0.14893617021276595
tr
0.14893617021276595
hu
0.14346895074946467
az
0.13606911447084233
et
0.13606911447084233
lt
0.13606911447084233
lv
0.13606911447084233
uk
0.13606911447084233
reliable
true
passes
true