NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90261RT0BJVHCCKNCESZ5F
Run Details
Model:
Gemma 4 31B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 1s
Cost:
$0.000873
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
16.36%
Dialogue to Total Word Ratio
(details)
8.1812%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the grease-stained workbench, her finger stabbing a red circle around the living room's floral rug. She had spent three days surveilling the residence, charting the precise movement of the juice boxes and the exact duration of the musical chairs round. The map was a masterpiece of amateur espionage, complete with arrows indicating the flow of toddlers and a X marking the designated gift pile. Bill leaned in, smelling faintly of old ham and desperation, and squinted at the scribbles. "Are we really doing this, Molly? It's a six-year-old's birthday," Bill whispered. "It is a six-year-old with parents who shop at boutique toy stores in the city, Bill," Molly snapped, her eyes wide with intensity. "Those limited edition robotic dinosaurs fetch a fortune on the black market. We aren't stealing from a child; we are liberating overpriced plastic from a household of excess." Bill scratched his chin, looking skeptical. He shifted his weight, making the floorboards of the damp garage creak beneath his boots. He wasn't a man of grand ambitions, but stealing from a group of sugar-crazed first graders felt like a tactical nightmare that no amount of planning could fully mitigate. He imagined the sticky fingers and the high-pitched shrieking. "What about the cake?" he asked. "If we're already inside, can I snag a slice of the chocolate fudge?" Molly glared at him with the coldness of a professional who had once stolen a prized poodle from a governor's mansion. "Focus, Bill. The window of opportunity is precisely twelve minutes. While the children are distracted by the magician's disappearing rabbit trick, we breach the perimeter through the sliding glass door. You handle the bag; I handle the perimeter." Bill sighed, glancing at the costume laid out on the table. It was a neon-pink tutu and a glittery headband that looked like it had been salvaged from a bankrupt carnival. He felt a cold sweat break across his brow as he contemplated the social cost of the operation. "Tell me I don't have to wear the skirt," Bill pleaded. "The theme is 'Enchanted Forest,' and you are the mystical woodland sprite," Molly replied, her voice devoid of emotion. "It is the perfect camouflage. No one suspects a man in a tutu of being a professional heist artist." Bill looked at the tutu and then back at the map. He imagined the chaos of twenty children screaming in unison, a sensory overload that would likely leave him paralyzed in the middle of the living room. He wondered if the robotic dinosaurs were truly worth the inevitable humiliation. "Do I get a wand?" he asked quietly. Molly handed him a plastic star on a stick with a triumphant smirk. "Now we have a plan. Let's go make some toddlers very disappointed."
created_at
"2026-05-10T13:08:42.176Z"
5/10/2026, 1:08:42 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
170
moderation_latency
(null)
generation_time
61339
tokens_prompt
77
tokens_completion
2518
native_tokens_prompt
82
native_tokens_completion
2270
native_tokens_completion_images
(null)
native_tokens_reasoning
1809
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00087326
router
(null)
provider_responses
0
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RPW1E74M9PYXyThzPmFsBdfl"
is_byok
false
latency
43
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418522-hQRBjgzih9QbQu7haKGI"
session_id
(null)
api_type
"completions"
id
"gen-1778418522-VvDKkqvQA9O3QEo1dOav"
upstream_id
"chatcmpl-RPW1E74M9PYXyThzPmFsBdfl"
total_cost
0.00087326
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
464
16.36%
Dialogue to Total Word Ratio
Ratio: 36.02%, Deviation: 6.02%
neededClean
false
wordsTotal
472
wordsDialogue
170
8.1812%