NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ01SAAN4ZFDC1A35Z4PMZNF
Run Details
Model:
GPT-OSS 120B
Test:
Language Comprehension
Scenario:
Friend got new kittens (German)
Duration:
1.5s
Cost:
$0.000311
Provider:
Groq
Evaluation summary
Score
Evaluator
Details Link
Pass
Contains a count of nouns
(details)
Model input
User/Human
I need your help. I want to congratulate my friend on her new babies, but how many are there? Here's what she said: Omg!! hast du mittbekommen was Lisa auf Insta geteilt hat? Ihre Tatze hat 3 Junge bekommen - die kleinen sehen sooo süß aus. Ich will auch eine Katze :(((
Model response
Messages
Meta
AI
Sure thing! From the German message your friend posted: > “Ihre **Tatze** hat **3** Junge bekommen …” the word **“Junge”** means “male kittens,” and the number **3** is explicitly given. So Lisa’s cat just had **three** kittens. --- ### A quick, friendly congratulation you could send (in German and English) **German** > Hey Lisa! 🎉 Herzlichen Glückwunsch zu den drei süßen Kätzchen! Sie sehen ja alle so knuffig aus – ich bin total verliebt. Hoffe, Mama und die Kleinen sind alle gesund und glücklich. Wenn du mal ein paar Bilder hast, schick sie mir unbedingt! 😍 **English** > Hey Lisa! 🎉 Huge congratulations on the three adorable kittens! They’re all so cute—I’m totally smitten. Hope the mama and the little ones are healthy and happy. If you have any pictures, please send them my way! 😍 Feel free to tweak the tone to match how close you are, add emojis, or mention how you’re already planning to adopt a cat yourself. Congrats to Lisa (and to you for sharing the joy)! 🎈
created_at
"2026-04-24T15:29:57.089Z"
4/24/2026, 3:29:57 PM
model
"openai/gpt-oss-120b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
102
moderation_latency
(null)
generation_time
658
tokens_prompt
71
tokens_completion
508
native_tokens_prompt
138
native_tokens_completion
483
native_tokens_completion_images
(null)
native_tokens_reasoning
220
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0003105
router
(null)
provider_responses
0
endpoint_id
"38ee5d29-5f91-486e-8485-eaf2408ee328"
is_byok
false
latency
74
model_permaslug
"openai/gpt-oss-120b"
provider_name
"DeepInfra"
status
429
1
endpoint_id
"cedca069-169c-4bc2-963a-7ad3a21380c8"
is_byok
false
latency
79
model_permaslug
"openai/gpt-oss-120b"
provider_name
"Parasail"
status
429
2
endpoint_id
"909fdbc8-1b02-46ea-a9fc-cc3cfe65d992"
is_byok
false
latency
170
model_permaslug
"openai/gpt-oss-120b"
provider_name
"Phala"
status
429
3
endpoint_id
"900c5273-4931-4e9f-bb52-57935dd154b4"
id
"chatcmpl-8ff7257f-4b51-422a-9eaf-e9a3fdd4f332"
is_byok
false
latency
98
model_permaslug
"openai/gpt-oss-120b"
provider_name
"Groq"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044597-sXfTbBCq4Ju2NnHEoa6Z"
session_id
(null)
api_type
"completions"
id
"gen-1777044597-shxkt4JEk2QcdT8k42en"
upstream_id
"chatcmpl-8ff7257f-4b51-422a-9eaf-e9a3fdd4f332"
total_cost
0.0003105
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Groq"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Contains a count of nouns
3 of: kitten, kittens
n/a