NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYB7RYCDDBZ3P2WF3DC1BP0
Run Details
Model:
Gemma 3 4B
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
2.5s
Cost:
$0.000016
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.13%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Rain lashed against the grand manor windows, creating an unsettling and dramatic atmosphere within the hall. Detective Inspector Davies surveyed the opulent room, noting the peculiar stillness of the assembled guests. A priceless antique clock ticked rhythmically, marking the passage of time with a morbid insistence. Lord Ashworth lay sprawled upon the Persian rug, a single crimson stain blooming on his waistcoat. The butler, Mr. Finch, nervously adjusted his spectacles, maintaining an unnervingly calm demeanor immediately. A priceless diamond necklace was conspicuously missing, suggesting a robbery gone horribly, terribly wrong. Several guests offered strained condolences, their faces reflecting a mixture of shock and veiled suspicion. The scent of lilies and something darker, metallic, hung heavily in the air, creating a suffocating effect. Investigating the scene carefully, Davies recognized the complexities of a deeply layered and intricate scheme.
created_at
"2026-02-20T20:18:10.005Z"
2/20/2026, 8:18:10 PM
model
"google/gemma-3-4b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
290
moderation_latency
(null)
generation_time
2332
tokens_prompt
63
tokens_completion
243
native_tokens_prompt
46
native_tokens_completion
174
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00001576
router
(null)
provider_responses
0
endpoint_id
"d3de3bd4-81bc-48fb-924f-2a87b2a36e75"
id
"chatcmpl-R2tIYQFhAccPb1qiqEwxkH5e"
is_byok
false
latency
55
model_permaslug
"google/gemma-3-4b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771618690-XEBH63MrtUSGYavXvony"
upstream_id
"chatcmpl-R2tIYQFhAccPb1qiqEwxkH5e"
total_cost
0.00001576
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.13%
Matches word count
n/a
neededClean
false
sentences
9
wordCounts
0
16
1
15
2
15
3
16
4
14
5
14
6
15
7
17
8
15