skip to content
Stephen Van Tran
Table of Contents

The bottleneck is no longer the GPU

The AI race just moved from the model card to the substation.

Federal regulators voted on Thursday to force the grid to make room for the AI boom. The Federal Energy Regulatory Commission ordered six regional grid operators to speed the way data centers, factories, and other enormous power users connect to the transmission system, a move the Associated Press described as a unanimous push to help energy-hungry AI data centers reach the grid faster. This is not a niche utility docket. It is the moment Washington admitted that frontier AI is now a power-market problem.

The order gives the six regional operators a hard clock. As TechCrunch summarized it, grid operators have 30 days to report how much generation they can make available for large loads and 60 days to defend or revise tariffs for the biggest customers. That is regulatory language, but the strategy is blunt: do not let old interconnection queues decide whether the United States can deploy the compute needed for advanced AI.

The order lands because data centers have become the physical shadow of every AI announcement. A model release needs chips. Chips need racks. Racks need cooling. Cooling and inference need electricity all day, not only when the wind blows or a marketing deck says capacity exists. The Department of Energy has already warned that data centers could consume up to 9% of U.S. electricity generation by 2030, up from 4% of total load in 2023, in a clean-energy resource brief built around the same planning problem. The takeaway is simple: if power access is late, the model roadmap is late.

This changes how AI infrastructure should be valued. For two years, investors treated Nvidia allocation, cloud credits, and model talent as the scarce inputs. Those still matter. But the new scarce asset is a permitted, financed, grid-connected site with enough megawatts to host the next generation of training clusters and inference farms. EPRI’s 2026 data-center study projects that U.S. data centers could consume 9% to 17% of national electricity by 2030, up from roughly 4% to 5% today (EPRI). That range is wide because the future is still being wired, but even the low case is too large to treat as a marginal load.

The proprietary math is the signal. AP reports more than 4,000 data centers already operate in the United States, with another 3,000 planned or under construction. If only half of that 3,000-project pipeline reached commercial operation at an average 100 megawatts, the country would need roughly 150 gigawatts of additional connected load, before counting bigger gigawatt-scale AI campuses. Grid Strategies’ latest load-growth report says utilities already forecast 166 gigawatts of peak-load growth over the next five years, with about 90 gigawatts linked to data centers (Grid Strategies). The order is not a luxury lane. It is Washington trying to keep the pipeline from outrunning the road.

That matters for the software side of the market too. I wrote this week that frontier labs are becoming diplomatic actors as governments decide who gets access to strategic AI systems (internal). FERC’s order is the domestic version of the same thesis. Sovereignty is not only export controls, compute clusters, or model weights. It is whether a country can bring enough power online quickly enough to keep its model companies from turning into queue-management businesses.

The important inversion is that electricity is no longer a back-office input. It is becoming a visible feature of AI reliability, price, and geopolitical credibility. A lab that can promise customers stable capacity during peak demand has a different sales motion from one that must ration tokens because its next campus is stuck behind a substation study. That gap will show up first in enterprise service levels, then in model-roadmap confidence.

The first AI boom was cloud abstracted. The next one is utility exposed. When a chatbot answers a question, the user sees a sentence. The operator sees capacity reservations, transformer lead times, gas turbines, water permits, colocation contracts, and the politics of who pays for wires. FERC just made that buried stack visible.

Follow the electrons, find the moat

Compute wants to be digital. Power refuses.

The new order has two strategic parts. First, it asks grid operators to create cleaner, faster processes for studying and serving large loads. Second, it pressures them to prevent cost shifting, support co-location and behind-the-meter generation, create flexible transmission services, and study nearby generation for large users. The American Action Forum’s breakdown of the June 18 orders notes that FERC chose targeted show-cause orders instead of a slower national rulemaking, which could have taken years (AAF). The analytical takeaway is that speed is now part of industrial policy.

This is a new kind of AI moat. Model quality still compounds, but electricity access compounds differently. A company with a live 500-megawatt campus can ship capacity while a rival is still negotiating interconnection studies. A cloud provider with spare transformers can sell inference before a startup’s site host gets a substation upgrade. A lab with flexible loads can promise a utility it will curtail during grid stress, while a less mature buyer looks like a rigid burden. Power strategy becomes product strategy because latency, availability, and unit economics all move through the same breaker.

The timing also explains why AI companies are drifting toward self-supply. Bloom Energy’s 2026 Data Center Power Report says power availability has moved from a planning factor to a defining boundary on data-center growth, with onsite generation becoming part of long-term strategy rather than a temporary bridge (Bloom Energy). That is the quiet revolution inside the data-center boom. Operators are no longer asking only where they can lease space. They are asking where they can control energy.

The grid itself is already saturated with requests. Lawrence Berkeley National Laboratory’s interconnection queue tracker says that, as of the end of 2025, more than 2,060 gigawatts of generation and storage capacity was seeking connection to the grid, while only a fraction of requested capacity historically reaches commercial operation (Berkeley Lab). That is the painful asymmetry: new AI load can be announced in months, but the generation and transmission that serve it can spend years in queues. FERC is trying to speed the load side without magically solving the supply side.

The better operators will read that constraint as an invitation to redesign. Behind-the-meter generation, co-located power plants, flexible-load tariffs, and regional procurement discipline all become part of AI infrastructure planning. TechCrunch notes that FERC explicitly opened the door for grid operators to consider alternative transmission technologies, including solid-state transformers or superconducting transmission lines, and to be more accommodating to behind-the-meter power (TechCrunch). The near-term winner is not necessarily the company with the most adventurous model. It is the company with the least fictional power plan.

This is where the financing story gets interesting. A data center used to look like real estate with servers. In the AI era it looks more like a regulated infrastructure project with software margins attached. Utilities, grid operators, local governments, cloud buyers, chip suppliers, and model labs all sit inside the same risk stack. FirstEnergy recently asked FERC to require data centers to pay for transmission upgrades needed to bring them online rather than spreading those costs across existing customers (Utility Dive). That proposal is narrow, but the principle is broad: AI’s growth bill is now visible enough that every stakeholder wants to know whose name is on it.

The order also shifts leverage from abstract demand forecasts to concrete commitments. Utilities fear building wires for speculative projects that never materialize. Communities fear subsidizing loads that raise bills before they create local value. Data-center developers fear waiting years for power while competitors secure better sites. A good tariff can force deposits, milestones, curtailment terms, and cost transparency. A bad tariff can socialize risk while privatizing the AI upside.

The quantified takeaway is that the next frontier-model gap may be measured in months of energized capacity, not only benchmark points. If an AI campus takes five years to connect under old large-load processes and a fast-lane tariff cuts even 18 months from the timeline, that is 18 months of model training, inference revenue, customer onboarding, and operational data that a rival does not get. In software, 18 months is a product cycle. In AI infrastructure, it is a strategic epoch.

That is why the FERC move belongs in the same conversation as chip supply and model access. Nvidia still sells the shovel. Utilities sell the ground the shovel can stand on. Without the second, the first becomes a very expensive promise.

The grid can still say no

Fast lanes do not create roads.

The central weakness in FERC’s order is that it accelerates connection rules without creating generation capacity. AP makes the limit plain: the order can do little to address tightening energy supplies, rising bills, and blackout warnings as data-center construction outruns the pace of new power plants (AP). That is the first counterpoint to the bullish version of the story. Regulators can shorten paperwork. They cannot decree spare electrons into existence.

The second counterpoint is affordability. The public does not experience AI infrastructure as an elegant national strategy. It experiences it as a utility bill, a zoning meeting, a water fight, a noise complaint, or a field that becomes a fenced campus. The Belfer Center’s analysis of AI and the U.S. electric grid warns that insufficient regulation risks grid instability, higher consumer costs, reliance on high-emission energy sources, public backlash, and setbacks to climate goals (Belfer Center). The order tries to protect ratepayers, but ratepayer protection is where politics will concentrate.

The third counterpoint is forecasting error. Data-center demand is unusually opaque because tenants, workloads, chip densities, utilization rates, and site plans change quickly. The World Resources Institute argues that data-center electricity forecasts vary widely and that even respected estimates can diverge because the sector lacks transparent operating data (WRI). That opacity matters because grid infrastructure is expensive and durable. If utilities overbuild for speculative AI campuses, households may inherit stranded costs. If they underbuild, labs lose the capacity race.

The fourth counterpoint is local consent. A national regulator can signal that AI data centers are strategic. County commissions can still make projects slow, ugly, and politically expensive. AP notes a growing backlash against data centers over power, water, noise, pollution, and land use. This backlash will not behave like partisan climate politics. A conservative rural county that wants jobs may still reject a project that strains wells, raises rates, or changes land values. A progressive city that wants clean technology may still object to gas-fired backup power.

The fifth counterpoint is emissions. AI companies often promise clean power, but grid reality is regional and temporal. A hyperscale site can buy renewable credits while its incremental load still pulls from gas-heavy generation during local peaks. Recent research on U.S. hyperscale data centers estimated that 403 facilities operating between May 2024 and April 2025 consumed 68 to 99 terawatt-hours and were associated with 37 to 54 million metric tons of CO2 across load scenarios (arXiv). The analytical point is not that AI should stop. It is that electricity accounting will become a reputational risk when claims meet hourly grid data.

The sixth counterpoint is legal durability. FERC is carefully preserving state authority over retail rates, siting, and generation choices, because that is where a blunt federal takeover would invite litigation. AAF reads the order as a targeted preemption strategy meant to maximize legal durability while pressuring regional operators to act quickly. That is clever, but it also means the result will be uneven. PJM, MISO, SPP, CAISO, ISO New England, and NYISO will not produce one national AI-grid regime. They will produce six regional answers shaped by different politics, market designs, and resource mixes.

The final counterpoint is that fast connection can reward the wrong projects. If every data center frames itself as critical AI infrastructure, regulators must distinguish between genuinely strategic workloads and speculative land banks with transformer requests attached. A bad fast lane becomes a stampede. A good fast lane filters for committed capital, flexible demand, credible power supply, and local cost discipline. The technology industry is accustomed to moving first and explaining later. The grid cannot run on that habit.

This is the tension operators should not dodge. The United States needs more AI infrastructure if it wants to keep frontier models, advanced agents, defense systems, drug-discovery platforms, and scientific compute at home. It also needs an electric system that does not quietly transfer the cost of that ambition to households least able to arbitrage the upside. The FERC order is important because it brings the fight into the open.

What operators should wire before the breaker trips

The winning AI company will sound a little like a utility.

That does not mean labs should become power companies. It means the operating cadence has to change. AI leaders can no longer treat energy as a procurement line hidden below chips and cloud. They need board-level power strategy, regional siting discipline, tariff literacy, and a public story that can survive contact with local ratepayers. In the next phase, the credible AI operator will be able to explain not only what model it is training, but what grid it is touching.

The operator checklist starts with site truth. Do not count announced megawatts as available megawatts. Separate land control, interconnection status, utility service agreements, substation equipment, generation commitments, water rights, and community approvals. A data-center plan with three missing pieces is not a plan; it is a press release with concrete footings.

  • Map every AI workload to a power class: interruptible training, firm inference, latency-sensitive enterprise service, research batch work, and emergency failover.
  • Build a regional tariff view before signing leases, especially in PJM, MISO, SPP, CAISO, ISO-NE, and NYISO, because FERC’s 60-day clock will produce different market answers.
  • Treat curtailment as a product feature where possible. Flexible training loads can be cheaper, easier to interconnect, and politically more defensible than rigid all-hours demand.
  • Reserve transformer, switchgear, and skilled-labor capacity as carefully as GPU capacity. The bottleneck is not always the chip.
  • Publish power claims with enough specificity to withstand scrutiny: region, timeline, load profile, emissions method, and who pays for upgrades.
  • Model community risk before incentives. A tax break is useless if local opposition turns the site into a two-year hearing.

The investor checklist is equally blunt. Stop underwriting AI capacity from fundraising headlines alone. Ask which megawatts are energized, which are contracted, which are speculative, and which depend on tariffs that may change within months. Ask whether the company can shift workloads across regions when one grid tightens. Ask whether its gross margin assumes cheap firm power that does not exist. The balance sheet may say software. The risk profile may say infrastructure.

The policy checklist is harder. Regulators should reward flexible loads, transparent cost allocation, and credible local benefits. They should punish phantom queues, vague demand forecasts, and projects that externalize grid costs while advertising national-security urgency. AI infrastructure is strategic, but strategic does not mean exempt from arithmetic. The grid is a public platform. It should not become a private subsidy machine for whoever can say “frontier model” loudest.

There is a better path. The United States can build more generation, modernize transmission, accelerate clean firm power, make interconnection less absurd, and let data-center operators pay fairly for the capacity they need. It can also require the AI sector to bring flexibility to the grid rather than only demand. Training jobs can move in time. Some inference workloads can move in place. Backup systems can become grid assets instead of isolated insurance. The best AI infrastructure will not merely consume power. It will help the system manage power.

FERC’s order should therefore be read as a starting pistol, not a finish line. The easy interpretation is that AI data centers just won a fast lane. The more serious interpretation is that AI has become large enough to force a redesign of American electricity governance. That is what happens when an industry graduates from app layer to load forecast.

For frontier labs, the message is severe and useful. Models no longer scale in a vacuum. The next breakthrough may depend on a county hearing, a transformer delivery, a capacity auction, or a tariff filing with a name only energy lawyers recognize. The AI era promised intelligence on demand. Now demand is asking the grid for permission.

In other news

OpenAI raids Google’s model bench - OpenAI added Noam Shazeer, a Gemini co-lead and Character.AI founder, after Google had reportedly spent $2.7 billion to bring him and parts of Character.AI back in 2024 (TechCrunch). The move turns AI talent into IPO signaling, not just research hiring.

Amazon eyes Trainium outside AWS - AWS is discussing direct sales of its Trainium AI chips to other data centers, while Andy Jassy has framed the chip business as a potential $50 billion annual run-rate opportunity (TechCrunch). The strategic wrinkle is that selling chips could weaken AWS’s own capacity advantage if supply stays tight.

Snap spins out its AI video team - Snap is turning an internal generative AI video group into Dotmo, a separate company focused on models for interactive gaming experiences (TechCrunch). That is a cost-control story and a reminder that consumer AI features still need independent economics.

Midjourney moves toward medical imaging - Midjourney Medical says it is working on full-body ultrasound scans, a striking pivot from image generation toward diagnostic hardware and clinical workflows (The Verge). The claim is bold enough that validation, regulation, and liability will matter more than demo quality.

Sanders files the AI wealth-fund bill - Sen. Bernie Sanders introduced the American AI Sovereign Wealth Fund Act, proposing a one-time 50% stock tax on the largest AI companies to create a roughly $7 trillion public fund (Sanders Senate). Even if the bill stalls, it shows that AI upside sharing is moving from think tank thought experiment to legislative text.