NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7JPM4EVAQDX6KZ9JBFDXD
Run Details
Model:
Gemma 3 12B
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
3.5s
Cost:
$0.000018
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
89.63%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Rain lashed the windows, a frantic, mournful, rhythmic patter outside. Lord Ashworth lay still upon the Persian rug below. A single crimson stain bloomed outward, darkening the intricate design. Inspector Davies surveyed the scene, his expression grim and thoughtful. The grand library felt cold despite the roaring fireplace within. Lady Ashworth wept quietly, clutching a delicate porcelain teacup. Servants whispered nervously, huddled together near the mahogany doorway. A priceless antique clock ticked slowly, marking the dreadful hour. Everything seemed ordinary moments before the shocking, brutal discovery. Something was amiss even beyond the obvious tragedy’s stark presence.
created_at
"2026-02-20T19:14:13.777Z"
2/20/2026, 7:14:13 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
241
moderation_latency
(null)
generation_time
3359
tokens_prompt
63
tokens_completion
173
native_tokens_prompt
46
native_tokens_completion
125
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00001809
router
(null)
provider_responses
0
endpoint_id
"5aef2a85-3d1c-4659-a522-98bbb5272c6a"
is_byok
false
latency
55
model_permaslug
"google/gemma-3-12b-it"
provider_name
"Crusoe"
status
429
1
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-Rwk9SaBuprNgcZqHztTVCdth"
is_byok
false
latency
57
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771614853-xICKcGHJhCnC7sUdy6du"
upstream_id
"chatcmpl-Rwk9SaBuprNgcZqHztTVCdth"
total_cost
0.00001809
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
89.63%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
10
1
9
2
10
3
10
4
10
5
9
6
9
7
10
8
9
9
10