skip to content
Stephen Van Tran
Table of Contents

The shock that arrived through a different door

The AI industry has spent eighteen months bracing for the wrong kind of disruption. The fear was always a benchmark surprise out of a Chinese lab, an EU rule that throttled training data, a hyperscaler that swung the price of inference. The shock that actually landed this spring came through a much older door — an oil-and-gas facility on the Qatari coast, a closed waterway in the Persian Gulf, and a noble gas that costs a few dollars a liter when it is available and is currently not. Per CNBC’s May 19 dispatch on how the Iran war is exposing weak spots in the AI supply chain, the conflict has now run long enough that the second- and third-order impacts on component costs, vendor margins, and overall AI data center economics are no longer hypothetical. TSMC and SK Hynix have both told investors that supply has held — for now. Almost everyone else in the chain is rerouting, repricing, or rationing.

The thesis of this post is narrow and concrete. The Iran war is not an exogenous shock to AI; it is an endogenous tax. Every cluster of H100, Blackwell, and Rubin-class GPUs being installed this year passes through a manufacturing chain that depends on helium, neon, krypton, and shipping lanes that the war has degraded. Per CNBC’s coverage of the Iran missile strike on Qatar’s Ras Laffan complex, Qatar’s Ras Laffan industrial complex — the source of roughly 30 to 38 percent of the world’s helium — was knocked offline by Iranian missile strikes in mid-March, and the Strait of Hormuz has been effectively closed to commercial shipping since March 4. Per an OilPrice analysis of the helium shock, up to 35 percent of global helium supply is now offline and roughly one-third of the world’s cryogenic helium ISO containers are stranded in or around Qatar with no viable maritime route to market. Spot helium prices have surged 70 to 100 percent and long-term contracts have moved up by as much as 40 percent. None of this is in the GPU sticker price yet. It will be.

The stakes are bigger than a single quarter of slimmer Nvidia margins. The AI capex story of 2026 — Nvidia at a multi-trillion-dollar market cap, hyperscaler spend tracking toward $400 billion combined, and the AI infrastructure trade pulling roughly a third of the S&P 500’s earnings upside with it — assumes that the supply side of the equation is essentially price-elastic. Order more wafers, pay for more reticles, light up more fabs. That assumption breaks the moment a critical input has no substitute and a damaged supplier that may need years to rebuild. Per The American Prospect’s analysis of how the Iran war threatens the AI economy, the supply-chain choke points exposed by the conflict are exactly the kind that geopolitical-risk models tend to underweight: not the leading-edge lithography monopolies that everyone watches, but the boring industrial gases and shipping lanes that move silently underneath them.

The market has begun to price some of this in. Per the Carnegie Endowment’s framing of the Iran war as a semiconductor problem, South Korea’s stock market dropped 18 percent in four trading days in early March, wiping out more than $500 billion in market value, with Samsung and SK Hynix each shedding more than 20 percent in two sessions. Per a Motley Fool retrospective on the damage already done, even an immediate ceasefire would not erase the physical damage at Ras Laffan; recovery estimates have stretched to three to five years. The market is correct to take the equity hit. The mistake would be to assume the worst is in the rear-view mirror. The AI infrastructure trade is now path-dependent on a war that nobody on the cap table has any leverage over, and the cost of that dependency is only beginning to show up on income statements.

Helium, Hormuz, and the wafer math

The mechanics here repay close attention because the AI press cycle has, predictably, undersold them. The choke points are not glamorous. They are gas plants and tanker routes and a memory subsystem that nobody outside semiconductor manufacturing thinks about until it breaks.

Helium first. Per Tom’s Hardware’s reporting on the global helium shortage, helium is used in advanced semiconductor fabs to cool silicon wafers during chemical etching, to purify clean-room atmospheres, and to detect microscopic leaks in the ultra-clean environments where advanced chips are made. There is no substitute. EUV lithography systems, which produce every leading-edge AI chip below 7nm, depend on helium for thermal management and ambient control inside the scanner. Per TrendForce’s analysis of the helium crunch on South Korean memory makers, Samsung and SK Hynix produce roughly two-thirds of the world’s memory chips — including the high-bandwidth memory that every Nvidia AI GPU depends on — and South Korea sources a significant share of its industrial helium from Qatar and the broader Gulf. When the gas tightens, the HBM tightens. When HBM tightens, the Nvidia GPU production curve does not stay flat.

The other gases matter too. Per a J2 Sourcing rundown of the global helium crisis and adjacent inputs, the chip industry’s exposure to disruption is not limited to helium — neon, krypton, and certain fluorine-based etching gases all see price pressure when energy costs spike and shipping lanes close. The Russia-Ukraine war already taught the industry what happens when neon supply gets concentrated; the current war has stress-tested helium in much the same way. Per Digitimes’ note on the chipmakers’ scramble, spot helium prices were reported up roughly 50 percent within weeks of the strikes; the more recent OilPrice and supplier data suggest those gains have widened materially. Per an analysis of supply chain disruption in 2026 published by Carra Globe, even fabs with on-site helium recovery systems — which capture and reuse 80 to 90 percent of the gas in each cycle — eventually need top-up volume that has to come from somewhere, and most of those somewheres are now constrained.

Then there is the strait. The Strait of Hormuz carries roughly 20 percent of global oil consumption and a non-trivial share of LNG flows. Per Fortune’s reporting on Asia’s AI playbook getting a Hormuz-shaped reality check, South Korea imports roughly 70 percent of its crude oil through the strait, with fossil fuels accounting for more than 78 percent of the country’s energy mix. Samsung and SK Hynix fabs are among the most power-hungry industrial operations on earth. When energy prices spike — South Korean industrial power tariffs have risen sharply since March — the marginal cost of every wafer rises with them. Per the Carnegie Endowment’s deeper look at the Korea-Hormuz energy linkage, the Korean fab cluster is the single most concentrated piece of memory-and-HBM capacity on the planet, and it sits at the end of a fuel supply line that runs through the world’s most contested waterway.

The downstream effects are starting to show in chemical and component pricing. Per a TechInsights update on the helium disconnect, gas suppliers are quietly putting force-majeure clauses on the table and rerouting where they can, but the cryogenic logistics for helium are particularly unforgiving: helium ships in specialty ISO containers that have to be moved by sea, and the global fleet of those containers is limited. Per an Entrepreneur explainer on what a helium shortage means for chip fabs, the most exposed subsegments are the leading-edge nodes — 5nm, 3nm, and the 2nm chips that Nvidia’s next-generation roadmap depends on — because their tolerance for atmospheric contamination is the tightest. The further down the node curve the industry pushes, the more sensitive it becomes to exactly the kind of input it has the least control over.

The economic math is unflattering. Per a Motley Fool note on helium-driven reshoring, a prolonged shortage would drive up helium prices and could force TSMC to slow production, which would in turn affect Nvidia’s ability to launch new products or sustain margins. TSMC’s on-site recycling recovers 80 to 90 percent of the helium it uses, the company has diversified supplier contracts, and it maintains inventories. Those are the right buffers. They are still buffers — they buy time, not immunity. The wafer-cost arithmetic for the rest of the chain is harsher: at current spot pricing, an additional $30 to $80 per advanced-node wafer is plausible for non-TSMC fabs without recovery infrastructure, and HBM yields are sensitive to gas purity in ways that compound through the memory packaging step. Multiply that across the roughly 12 to 15 million advanced wafers expected to flow through the AI supply chain in 2026, and the rough order of magnitude is hundreds of millions of dollars in input-cost drag — a price the buyer pays even if the war ends tomorrow. My estimate, stitched across the price-surge data points in the OilPrice piece and the Digitimes scramble report, is that the helium tax alone runs $0.5 to $1.2 billion in AI infrastructure cost for the calendar year. That is not catastrophic on its own. It is also not the only tax in this bill.

The geopolitical overlay sharpens the point. Per Tom’s Hardware’s coverage of Iran’s direct strike threats against major US tech firms, Iran has publicly threatened Nvidia, Microsoft, Apple, Google, and roughly a dozen other US tech companies in retaliation for the broader conflict. The threats may be largely posturing — Iran’s reach into US corporate facilities is constrained — but the fact that they were issued at all has tightened security postures, insurance pricing, and corporate-travel calculus across the industry. The cost of operating an AI infrastructure footprint in mid-2026 is therefore not just helium spot prices and Korean industrial power tariffs; it is the cumulative friction of running a multi-continent supply chain under conditions of active geopolitical hostility. That friction shows up in capex schedules, in insurance line items, and ultimately in the unit economics of every model serving cluster being stood up this year.

Reasons the panic curve might flatten

The reasonable counter to all of this starts with the boring observation that the chip industry has weathered shocks before and the AI demand curve has, so far, swallowed every cost increment thrown at it. There is a serious case that the price action is overstated and the operational pain is manageable. Three threads of that argument deserve careful airing.

The first is that TSMC’s buffers are real and probably underappreciated. Per the Globe and Mail’s republication of the Motley Fool’s reshoring analysis, TSMC has spent the last decade engineering exactly this contingency. Its on-site helium recovery captures 80 to 90 percent of usage per cycle, its supplier contracts are diversified across Qatar, Algeria, and US sources, and its inventory practice is conservative even by Taiwanese-industrial standards. The company has publicly told investors that supply chain disruptions have not yet impacted financial results, and the Q1 2026 earnings print supported that claim. If TSMC holds the line on advanced-node output, Nvidia’s GPU production curve stays largely intact, and the AI capex story rides through the shock without the catastrophic margin compression that the helium spot-price screens imply. The right way to read TSMC’s posture is not that the problem is fake — it is that the most important link in the chain is also the most prepared.

The second counterpoint is that helium price elasticity at the chip-fab level is lower than the spot market suggests. Per a J2 Sourcing analysis of helium contract dynamics in the semiconductor chain, most fabs purchase helium under long-term contracts whose prices move on a quarterly or annual schedule, not in real time. Spot prices can spike 100 percent without the average fab seeing more than a 20 to 40 percent input-cost increase, because the contract market absorbs the variance. Combined with on-site recovery, that means the realized cost-of-goods impact in 2026 is materially smaller than the screaming headlines suggest. The path the bears need is a multi-quarter sustained price level, not a temporary spike, and that requires either the war to drag on through year-end or the Ras Laffan facility to remain offline beyond the most aggressive recovery scenarios. Both are plausible. Neither is guaranteed.

The third and most consequential counterpoint is that the demand side of the AI equation has so much slack that buyers will simply absorb the cost. Per a Morningstar piece on Iran war risks to AI chip supply and critical minerals, the projected 2026 to 2027 demand for AI accelerators is so far in excess of confirmed supply — by some estimates, two to three times confirmed wafer allocations — that any price increase short of a doubling would still leave buyers competing aggressively for output. In that demand environment, helium-induced wafer-cost inflation gets passed straight through to hyperscalers, then to enterprise customers, then to the tokens consumed by every downstream AI product. The risk is not that AI capex collapses; the risk is that the price of inference rises faster than the productivity gains rise to absorb it. Counterintuitively, that is a manageable problem at the system level even if it is a brutal one for individual margin lines.

The skeptic case also has to acknowledge the secular trends working against it. Per a Carra Globe analysis of how the helium crisis is hitting chip fabs, the industry was already on a long-term helium tightening curve before the war — driven by federal helium reserve drawdowns in the US, geological constraints on new production, and rising demand from medical imaging and other uses. The war did not invent the helium problem; it accelerated and exposed one that the chip industry had been quietly absorbing. That cuts both ways. The optimistic read is that the industry’s structural adaptation — on-site recovery, contract diversification, helium-light process development — was already in motion and will continue to compound. The pessimistic read is that the war has telescoped a decade’s worth of helium-supply risk into a single year, and the industry’s existing adaptation is not yet fast enough to cover the gap.

Reading all three counterpoints together, the most honest synthesis is this: the helium tax on AI is real but probably ranges from “annoying” to “painful,” not “catastrophic.” The price of GPUs and inference will move up. The pace of capacity additions will get marginally compressed. The leading-edge node roadmap will not snap. The companies that get hurt most are not the marquee names — Nvidia, TSMC, Microsoft, Google — but the smaller fabs, the second-tier memory makers, and the AI infrastructure projects in regions with the thinnest helium-supply diversification. Per The Motley Fool’s broader take on Nvidia’s exposure, even an extended conflict scenario leaves Nvidia’s earnings trajectory largely intact, with the principal effect being on customer mix and margin variance rather than headline revenue. The bear case requires the war to widen materially or the helium recovery curve to stall well past 2027. Those are real risks. They are not the base case.

Playing AI capex in a Hormuz-shaped world

Where this leads is straightforward to articulate and hard to execute. The AI industry has discovered, mid-buildout, that its physical supply chain is more fragile than the financial supply chain that funds it. Capital is fungible; helium and HBM are not. The strategic response will play out in three places: industrial-gas reshoring, leading-edge node geographic dispersion, and the unglamorous work of inventory and contract hardening across every link from gas plant to GPU.

The most visible response will be reshoring. Per the Motley Fool’s reshoring prediction, the helium crunch is already pulling forward the case for North American and European industrial-gas independence — and, by extension, for the leading-edge fab construction that depends on local gas access. Arizona, Ohio, and the European Chips Act program become structurally more attractive in a world where Hormuz can close on a quarter’s notice. The Korean fab cluster, conversely, faces a hard look from its largest customers about geographic concentration risk. Expect Samsung’s Texas fab and SK Hynix’s American footprint to get the kind of executive attention this year that they did not get last year. The compounding effect over five years could be one of the biggest secular shifts in chip-supply geography since the original rise of Taiwan and Korea as fabrication hubs.

The second response is technical: helium-light process engineering, deeper on-site recovery, and selective node-level innovation that reduces gas dependence. Those are not glamorous lines on a slide deck, but they are where the durable advantage will accumulate. Per the Semiconductors Insight summary of the Iran war’s semiconductor impact, the foundries with the most aggressive recovery and substitution programs are now the ones with the lowest cost-curve exposure to the helium crunch — and by extension the most pricing power with Nvidia and the hyperscalers. The reward for boring industrial discipline has never been higher.

The third response is financial — and it is the one that AI operators outside the chip industry actually have some leverage over. Long-dated capacity contracts, multi-source gas commitments, region-diversified compute, and explicit contingency budgeting for input-cost inflation are the levers. The companies that have been quietly building those into their 2026 to 2028 plans look smart this morning. The companies that have not are exposed to a surprise that has, in fact, already happened — they just haven’t priced it yet.

A short operator checklist for anyone whose 2026 P&L depends on AI compute:

  • Stress-test your training and inference budgets against a 15 to 30 percent rise in unit GPU and inference prices through 2026 — this is the band implied by current helium pricing and South Korean energy tariffs, before any further conflict escalation. The cost is not theoretical; it is being absorbed somewhere in your stack already.
  • Reread your cloud and chip contracts for force-majeure language, supply-allocation triggers, and price-adjustment mechanisms. Many AI-era contracts were drafted in a pre-Hormuz world. The carve-outs may be wider than you remember.
  • Map your geographic concentration risk for both training and inference. If more than 60 percent of your compute footprint sits in a single national jurisdiction or relies on a single chip-manufacturing geography, that is now a board-level item, not an IT item.
  • Push your model selection and inference-stack decisions toward efficiency. Per the analysis I laid out in my May 13 piece on Google’s first AI-built zero-day and the security surface area of frontier models, the marginal value of a smaller, more capable model is higher than ever — and that math improves further when GPU input costs rise.
  • Reconsider on-prem and hybrid deployments where data gravity supports it. The May 19 OpenAI–Dell Codex partnership is the new template — bringing model capability to enterprise infrastructure rather than the reverse — and the Iran war makes the case for hybrid topology stronger, not weaker.
  • Track helium spot pricing and Korean industrial power tariffs as leading indicators. They are the early-warning system for the next leg of AI input-cost inflation, and they move weeks before the GPU price increases that follow them.
  • Build a real Bill of Materials for your AI infrastructure that includes not just chips and electricity but the industrial gases, water, and shipping lanes underneath them. If you cannot describe your second- and third-tier supply chain, you do not actually understand your cost structure.

The deeper takeaway is the most uncomfortable one. The AI capex thesis was built on the assumption that the world would remain stable enough for the physical supply chain to scale at the pace the financial markets demanded. That assumption has broken in 2026 in a way that was not on the bingo card a year ago. The companies that survive the next phase will be the ones that internalize a simpler, older truth: every AI model is, in the end, a piece of physical infrastructure. The chips are made of silicon. The silicon is etched in helium. The helium comes from Qatar. And Qatar, this year, has a problem.

In other news

Amazon ships AI-generated podcasts inside Alexa+ — Amazon rolled out Alexa Podcasts on May 18, giving Alexa+ subscribers the ability to generate full podcast-style episodes on any topic in minutes via AI-narrated co-hosts; the feature draws on agreements with AP, Reuters, The Washington Post, and 200-plus local outlets, per TechCrunch’s coverage of the launch. The move sharpens Alexa+‘s positioning against Google’s Notebook LM and OpenAI’s audio products, with Amazon leveraging Prime distribution as its wedge.

OpenAI–Dell Codex partnership lands Codex on-prem — OpenAI and Dell Technologies announced on May 18 a partnership to deploy Codex across hybrid and on-premises enterprise environments via the Dell AI Data Platform and Dell AI Factory; OpenAI said Codex now has more than 4 million weekly developers, per the official OpenAI announcement. The deal is OpenAI’s first explicit hybrid-and-on-prem distribution play and a direct response to Anthropic’s enterprise traction.

Mozilla expands Firefox’s Shake to Summarize to Android — Per Mozilla’s official blog post on the expansion, Firefox’s Shake to Summarize landed on Android on May 19 in English, with more languages coming. The Mistral-powered cloud feature summarizes any page under 5,000 words on a shake gesture, extending a tool that earned a special mention in TIME’s Best Inventions of 2025.

xAI’s Memphis Colossus 2 enters Phase 2 buildout — Per a Tech Startups roundup of the day’s tech news, xAI continues to expand its Memphis data-center footprint as part of the Colossus 2 buildout supporting Grok and the broader xAI product line; the project is on track to be among the largest single-site AI training facilities in the US once complete.

Anthropic’s Project Glasswing pulls in industry’s biggest defenders — Per Anthropic’s Project Glasswing page, the program now lists AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks as launch partners, with $100 million in Claude credits behind the effort to harden critical software against the same class of zero-day discovery capability Anthropic has demonstrated internally.