We Taught a Building to Think—And It Cut Energy Bills by 32%

Your HVAC System Is an Idiot

Dec 10, 2025

∙ Paid

I don’t mean that as an insult. I mean it as a precise technical assessment.

That building management system you spent $400,000 installing? The one with the sleek dashboard and the impressive-sounding algorithms?

It runs on logic that would embarrass a mediocre thermostat from 1987.

Set temperature to 72°F when occupied. Lower to 65°F when empty.

Adjust based on... nothing, really. Just time of day and whatever the facilities manager programmed during installation.

Meanwhile, energy represents the single largest operating expense in commercial real estate. One of our largest clients alone spends $8.7 billion annually. Industry estimates suggest 15-30% of that expenditure is pure waste—not inefficiency that requires capital investment to fix, but waste caused by systems too stupid to respond to conditions they could easily sense.

The weather forecast says tomorrow will be unseasonably warm.

Your BMS doesn’t know.

The building next door just emptied for lunch and 400 people are about to flood your food court.

Your BMS doesn’t know.

Electricity prices spike 340% during peak demand hours.

Your BMS doesn’t care.

We built the Autonomous Energy Optimization System to make buildings think. Actually think. Not follow rules written by someone who left the company three years ago.

The DeepMind Proof Point

Before I explain what we built, let me tell you what convinced us the approach would work.

In 2016, DeepMind deployed reinforcement learning to optimize cooling in Google’s data centers. The results were staggering: 40% reduction in cooling energy consumption, 15% improvement in overall Power Usage Effectiveness. The system processed thousands of data points every five minutes and controlled cooling infrastructure autonomously—no human intervention required.

That wasn’t a pilot.

That was production deployment at scale, running one of the most demanding computing environments on the planet.

If reinforcement learning could optimize cooling for facilities where a temperature deviation of 2°C risks millions in equipment damage, it could certainly handle a commercial office building where the stakes are comfort rather than catastrophe.

The technology was proven.

The question was whether it could be packaged for environments without Google’s engineering resources.

What We Actually Built

The Autonomous Energy Optimization System implements what I call the Adaptive Building Intelligence Stack—four integrated subsystems that transform passive building management into active energy optimization.

Layer one: Deep Q-Networks for HVAC optimization. Reinforcement learning that treats building climate control as a continuous optimization problem. The system learns which actions (adjusting setpoints, modulating airflow, pre-cooling before occupancy) produce which outcomes (energy consumption, comfort scores, equipment stress) under which conditions (weather, occupancy, time-of-use pricing). Unlike rule-based systems, it discovers optimal strategies that no human would think to program.

Layer two: Weather forecast integration with predictive load management. The system ingests 72-hour weather forecasts and adjusts building operation proactively rather than reactively. If tomorrow will be 15 degrees warmer than today, pre-cool tonight when electricity costs half as much. If a cold front arrives Thursday, reduce heating Wednesday evening and let thermal mass carry the load.

Layer three: Privacy-preserving occupancy sensing with edge AI. Computer vision models running on local hardware—not cloud—count people without identifying them. The system knows Conference Room B has 12 people, not that John from accounting is meeting with Sarah from legal. Occupancy data drives real-time HVAC adjustments and informs the reinforcement learning about actual building usage patterns.

Layer four: LLM-powered policy generation. This is where Claude Sonnet 4 earns its keep. The system generates natural-language energy policies, automates carbon accounting reports, and translates complex optimization decisions into explanations that facilities managers can understand and audit. When the system decides to pre-cool at 3 AM, the policy layer explains why in terms humans can verify.

The Economics Nobody Wants to Discuss

Development cost us €110,000.

AI agent processing consumed 800 hours.

Infrastructure runs €15,000 monthly during operation.

Total pilot investment: €160,000 internal, €220,000 client pricing for a 12-week proof-of-concept.

Full deployment: €580,000 internal, €1.6 million client pricing depending on portfolio size and integration complexity. Plus €98,000 monthly subscription and—here’s the interesting part—an energy savings share model.

That savings share aligns our incentives with client outcomes in a way traditional software licensing doesn’t. If the system underperforms, we eat the shortfall.

If it outperforms, we share the upside.

The clients who understand value-based pricing love this structure.

The clients who just want to check a box for their board hate it.

We prefer the former.

What Happened When We Deployed

The pilot deployment covered a commercial real estate portfolio of 50 buildings totaling 2 million square feet. Mixed use—office towers, retail centers, a handful of light industrial facilities.

Year one results: 32% reduction in energy consumption. €8.4 million in direct savings. Carbon emissions dropped 4,200 tons annually, which matters increasingly for ESG reporting requirements that determine whether institutional investors will hold portfolio positions.

The system learned patterns that surprised everyone. One building’s HVAC operated most efficiently when pre-cooling started at 4:17 AM rather than the programmed 5:00 AM—something about thermal mass interaction with the eastern facade that no engineer had modeled. Another facility wasted 23% of its cooling energy fighting heat from server closets that weren’t on any official floor plan. The occupancy sensors revealed that Conference Room C on the 14th floor had been conditioned for 12 hours daily despite averaging 47 minutes of actual use.

These aren’t insights a rule-based system discovers. They emerge from learning algorithms that explore the state space of building operation with patience no human possesses.

The Honest Failures

Month two nearly killed the project.

The reinforcement learning agent, optimizing for energy efficiency without sufficient constraint on comfort, discovered that it could reduce consumption 8% by maintaining temperatures exactly at the edge of acceptable ranges. Technically within policy.

Practically miserable.

We received 340 comfort complaints in a single week.

We rebuilt the reward function to weight comfort violations exponentially rather than linearly. The new formulation made the agent treat comfort boundaries as walls rather than suggestions. Consumption savings dropped to 29% for two weeks while the system relearned, then recovered to 31% with near-zero complaints.

The occupancy sensors initially struggled with open-plan offices where people cluster unpredictably. Edge cases the training data didn’t cover. We added synthetic training examples and deployed model updates that improved accuracy from 78% to 94% over six weeks.

The LLM policy layer occasionally generated explanations that were technically….accurate but incomprehensible to facilities staff. “Initiated pre-cooling sequence to exploit arbitrage opportunity in time-of-use tariff structure while maintaining thermal gradient below comfort threshold” means something to an engineer but nothing to someone trying to understand why the building got cold at 3 AM. We added a translation layer that renders explanations in conversational English.

Every one of these problems was solvable.

None of them appeared in the pilot plan.

That’s the nature of deploying AI systems in physical environments—the edge cases are where reality diverges from simulation.

The Hybrid Architecture Decision

Data sovereignty requirements shaped our deployment architecture significantly.

Building occupancy data stays on-premise. Always. The privacy-preserving sensors process video locally, extract counts, and discard the original imagery before anything leaves the edge device. No tenant data, no individual identification, no legal liability for occupancy surveillance.

Weather forecasts and grid pricing data come from cloud APIs—public information that creates no privacy exposure. The reinforcement learning models train on anonymized consumption patterns that can safely transit external infrastructure.

Claude Sonnet 4 for policy generation runs through Anthropic’s API, but we carefully control what context the model receives. Building identifiers are stripped. Tenant information is never included.

The model sees consumption patterns and optimization decisions, not anything that could identify specific facilities or their occupants.

This architecture added €25,000 to development costs and creates ongoing operational complexity. Worth it. Enterprise clients don’t sign contracts that put their tenant data on external servers.

What Actually Matters

The technology works. The economics work. The implementation complexity is manageable for teams that have done it before.

But here’s what actually determines success or failure: organizational readiness to let AI systems make decisions that humans used to make.

The facilities manager who has set HVAC schedules for 15 years doesn’t welcome a system that changes his setpoints every five minutes based on math he doesn’t understand. The CFO who approved the building management system five years ago doesn’t love explaining to the board why it’s being replaced by something with “autonomous” in the name.

Change management consumed more project resources than algorithm development.

Not because the people were wrong to question—healthy skepticism about AI systems is appropriate—but because building trust requires time and demonstrated performance that no amount of prior explanation accelerates.

We now budget 30% of implementation hours for stakeholder alignment. The projects that succeed are the ones where facilities leadership becomes advocates rather than obstacles. That transformation requires showing them the system works, letting them override decisions when they disagree, and proving that their expertise is augmented rather than replaced.

The Market Reality

DeepMind proved autonomous building optimization was possible.

Google captured the benefits internally.

The question for the rest of the market was whether anyone would productize the approach.

That productization is happening now. We’re one of several serious players, though I’d argue our hybrid architecture and LLM policy layer create differentiation that competitors lack.

The market will determine whether that differentiation matters.

What I know for certain: buildings will get smarter. Energy will get more expensive. The gap between optimized and unoptimized facilities will widen until the financial pressure becomes irresistible.

The question isn’t whether autonomous energy optimization happens.

The question is whether you’re capturing the savings or competing against someone who is.

Paid Appendix : FULL SYSTEM ARCHITECTURE & TECHNICAL SPECIFICATION

Keep reading with a 7-day free trial

Subscribe to AI of the Coast: The 5-Year Roadmap to General AI to keep reading this post and get 7 days of free access to the full post archives.