Our posts

Our posts — Ideas behind the private inference cloud.

Thoughts about the AI industry, AI in general, inference economics, and the servescale.ai thesis: why enterprise AI needs a private inference cloud inside their control boundaries.

Michael Dvorkin

Infrastructure economics

Don’t Scale Up TL;DR

The room is the architecture. Enterprise inference depends on power, topology, cache locality, utilization, and disciplined cost control.

Read post →

Michael Dvorkin

Model economics

Don’t Scale Up Part 2: The Model Edition

Production AI needs right-sized models, routing, validators, tools, and escalation paths instead of sending every task to the largest model.

Read post →

Dante Malagrino

Enterprise platformization

The Inevitable Path of AI

AI is moving from developer experimentation to operational platforms owned by CIO and CTO organizations.

Read post →

Dante Malagrino

Inference market architecture

2026: The Bifurcation of the Inference Market

The inference market is splitting between rented-capacity velocity and enterprise sovereignty, governance, locality, and permanence.

Read post →

Hero image thumbnail for How You Can Turn 2025 AI Pilots into an Enterprise Platform

Dante Malagrino

AI platformization

Turn 2025 AI Pilots into an Enterprise Platform

Enterprise AI pilots need unified model access, governance, policy, infrastructure economics, and accountability to become durable platforms.

Read post →

Michael Dvorkin

Agentic AI governance

The Ode to Chaos

Agentic AI does not need fake guardrails. It needs anti-capture architecture, promise discipline, and systems that metabolize uncertainty.

Read post →

Michael Dvorkin

Power economics

Every Watt is Precious

A power-noir argument that AI economics are now shaped by scarce electricity, cooling limits, PUE, reliability, and energy per token.

Read post →

Michael Dvorkin

Energy strategy

The Energy Gambit

The AI revolution is becoming a physical contest for coolable, controllable megawatts, water, siting, and energy sovereignty.

Read post →

Michael Dvorkin

Neocloud economics

The Neocloud Doom Loop

Inference demand, power constraints, hardware depreciation, and token-margin pressure can turn GPU clouds against their own balance sheets.

Read post →

Michael Dvorkin

Agentic AI governance

Promise Theory, Agentic AI, and the Collapse of Command-and-Control

A systems argument for treating agentic AI trust as an ecology of promises, constraints, observability, and negotiated behavior.

Read post →

Michael Dvorkin

AI utility economics

Neocloud as AI Utility

AI infrastructure should be financed like power and shipped like product: disciplined, metered, reliable, and built for production utility.

Read post →

Dante Malagrino

Founder field report

Three Weeks in Puglia

A founder’s report from Italy on startup geography, local ambition, and what Silicon Valley thinking looks like far from Silicon Valley.

Read post →

Michael Dvorkin

Agentic AI trust

The Trust Is Broken

Identity-first trust is cracking under agentic AI. Production systems need explicit promises, context, verification, and observable behavior.

Read post →

Next article Coming soon!

Our posts — Ideas behind the private inference cloud.

Don’t Scale Up TL;DR

Don’t Scale Up Part 2: The Model Edition

The Inevitable Path of AI

2026: The Bifurcation of the Inference Market

Turn 2025 AI Pilots into an Enterprise Platform

The Ode to Chaos

Every Watt is Precious

The Energy Gambit

The Neocloud Doom Loop

Promise Theory, Agentic AI, and the Collapse of Command-and-Control

Neocloud as AI Utility

Three Weeks in Puglia

The Trust Is Broken

Watts → ROI. Optimized.