NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-19-01KHVPAWTTPMJNF63ZVDB29091
Run Details
Model:
Gemini 3.1 Pro (Preview)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 15s
Cost:
$0.102182
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
100.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"The target is extremely heavily guarded," Molly whispered, unrolling a crumpled blueprint of a backyard bouncy castle. "Are you really sure about this?" Bill asked, adjusting his tight black tactical sweater. "Those toddlers look absolutely ruthless. What if they throw sticky juice boxes at us?" He pointed a gloved finger toward the chaotic screaming crowd. "We drop in during the afternoon magic show," she replied, tapping a red marker against her chin. "That cheap magician is a total hack. When he pulls the silly rabbit out, we strike." A loud balloon suddenly popped in the distance, making them flinch. "Stealing wrapped presents from a tiny seven year old seems just slightly unprofessional," he muttered, nervously watching an entertainer twist balloons into animal shapes. "We are highly trained professional thieves, Bill," she hissed loudly, glaring at her anxious accomplice through the bushes. "The loot is piled strictly by the piƱata. We grab the large boxes and vanish." They stared at a towering mountain of bright shiny gift bags. "Do we get chocolate cake?" he asked with a hopeful expression spreading across his sweating face. "Only if you can quietly disable the roaming security clown," she flatly confirmed before checking her watch.
created_at
"2026-02-19T19:34:23.161Z"
2/19/2026, 7:34:23 PM
model
"google/gemini-3.1-pro-preview-20260219"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4152
moderation_latency
(null)
generation_time
74726
tokens_prompt
77
tokens_completion
4121
native_tokens_prompt
67
native_tokens_completion
8504
native_tokens_completion_images
0
native_tokens_reasoning
8247
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.102182
router
(null)
provider_responses
0
endpoint_id
"2abf5677-0826-45d9-8470-3d62c31d54a1"
is_byok
false
latency
4152
model_permaslug
"google/gemini-3.1-pro-preview-20260219"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771529663-pK09u2aKMBUS6nViUWdZ"
upstream_id
(null)
total_cost
0.102182
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
100.00%
Dialogue to Total Word Ratio
Ratio: 50.00%, Deviation: 0.00%
neededClean
false
wordsTotal
200
wordsDialogue
100
100.0000%