skip to content
Stephen Van Tran
Table of Contents

The API was never the endgame

OpenAI spent three years selling intelligence by the token. Now it wants to sell intelligence by the employee. On May 12 the company announced the OpenAI Deployment Company, a majority-owned subsidiary backed by more than $4 billion in initial capital and valued at roughly $14 billion post-money, whose entire purpose is to plant OpenAI engineers inside the largest organizations on the planet and rebuild their workflows around GPT. The model is not licensing. It is not SaaS. It is embedding — Forward Deployed Engineers sitting at your desk, reading your Jira tickets, auditing your data pipelines, and wiring AI into the processes that actually run your business. If that sounds like what Palantir has done for two decades, it should. Per Axios’s reporting on the deal, this is OpenAI’s explicit attempt to replicate the Palantir model at a scale and speed that Palantir itself never managed in its early years.

The thesis is straightforward and aggressive. API revenue is a commodity play. Every foundation-model lab can sell tokens, and margins are compressing as open-weight alternatives from Meta, Mistral, and the Chinese labs close the capability gap on standard benchmarks. The real margin is in deployment — the messy, expensive, human-intensive work of taking a model from a demo environment into a production system that handles payroll, underwrites insurance policies, or routes ten million customer-service interactions a month. OpenAI has watched roughly 90 percent of enterprise AI use cases die in pilot purgatory, per a McKinsey analysis of enterprise AI scaling challenges, and has concluded that the bottleneck is not model capability but organizational absorption. DeployCo is their answer: if the enterprise cannot come to the model, the model’s engineers will come to the enterprise.

The stakes dwarf the subsidiary’s own valuation. OpenAI hit $25 billion in annualized revenue by February 2026, roughly $2 billion a month, with more than nine million paying business users and coverage across 92 percent of the Fortune 500. But most of that revenue is seat-based ChatGPT subscriptions and API calls — transactional, defensible only as long as OpenAI maintains a model-quality edge, and vulnerable the moment a competitor ships a model that is good enough for the same tasks at half the price. DeployCo is the moat-deepening play. Once your critical workflows are co-developed with OpenAI engineers, switching costs stop being about API endpoints and start being about institutional knowledge, retrained staff, and rewritten systems of record. Sam Altman is not selling software. He is selling dependency.

The first concrete asset is the acquisition of Tomoro, an applied AI consulting firm headquartered in London with offices in Edinburgh, Manchester, Singapore, Sydney, and Melbourne. Tomoro brings roughly 150 experienced Forward Deployed Engineers and Deployment Specialists to DeployCo on day one — professionals who have already delivered production AI systems for Fidelity International, Virgin Atlantic, Tesco, the NBA, Red Bull, and Supercell, where Tomoro shipped an in-game support agent serving 110 million users in just twelve weeks. That client list is the pitch deck. It tells every Fortune 500 CFO that these engineers have done this before, at scale, in production, with real revenue at stake.

$4 billion to sit in your chair

The capital structure of DeployCo is worth close scrutiny because it reveals what OpenAI actually thinks this business is — and who it thinks will lose.

The $4 billion raise launched at a $10 billion pre-money valuation, reaching roughly $14 billion post-money once the capital is in. Per PitchBook’s analysis of the deal, OpenAI’s own contribution is a $500 million equity commitment at close with an option to inject another $1 billion, ensuring majority control. Investors receive a guaranteed minimum 17.5 percent return with profits capped beyond that threshold — a structure more typical of infrastructure private equity than of a software startup, signaling that OpenAI and its partners view deployment revenue as steady, recurring, and largely derisked by the stickiness of embedded engineering relationships.

The partnership roster reads like a who’s-who of global capital. TPG leads the round, with Advent International, Bain Capital, and Brookfield as co-lead founding partners. B Capital, BBVA, Emergence Capital, Goanna, Goldman Sachs, SoftBank Corp., Warburg Pincus, and WCAS fill out the founding investor bench. But the names that matter most for understanding the strategic logic are three that sit uncomfortably among the rest: Bain & Company, Capgemini, and McKinsey & Company. These are the legacy management consultancies whose enterprise integration work DeployCo most directly threatens. The charitable reading is that they are buying optionality — paying for a seat at the table so they can shape how the new model interacts with their existing client relationships. The harsher reading, and the one the market seems to prefer, is that OpenAI persuaded them to help fund their own displacement.

The Palantir comparison is not hyperbolic. Palantir Technologies invented the Forward Deployed Engineer model — embedding software engineers at client sites to customize its Gotham and Foundry platforms for specific operational environments. Per CNBC’s coverage of Palantir’s Q1 2026 earnings, that model has now propelled Palantir to a $350 billion market capitalization, Q1 revenue of $1.63 billion growing 85 percent year over year (the fastest rate since its 2020 direct listing), and full-year 2026 revenue guidance of $7.65 billion with U.S. commercial growth tracking at 120 percent. Revenue per employee has reached $1.5 million annualized. These are the numbers OpenAI is chasing. But there is a critical difference in starting position. Palantir spent its first decade selling almost exclusively to intelligence agencies and defense departments — customers with enormous budgets, high tolerance for bespoke work, and near-zero price sensitivity. OpenAI is starting with commercial enterprise clients who compare consulting fees against quarterly earnings targets and who have, in many cases, already been burned by AI pilots that promised transformation and delivered a chatbot.

The global AI consulting services market sits at roughly $14 billion in 2026 and is projected to grow at a 27.3 percent compound annual rate through 2035, per industry estimates. Accenture alone tripled its advanced AI revenue to $2.7 billion in fiscal 2025 and committed $3 billion to expanding its Data & AI practice, planning to double its AI workforce to 80,000 specialists. Those are the incumbents. OpenAI is not entering a greenfield market. It is entering a market that already has entrenched players with deep client relationships, tens of thousands of deployed consultants, and the institutional knowledge that comes from years of sitting in the same chairs that OpenAI’s Forward Deployed Engineers now want to occupy. The bet is that model expertise — knowing how to coax production-grade reliability out of frontier systems — is more valuable than domain expertise, and that the incumbents cannot acquire it fast enough to defend their positions.

The early data is suggestive. Per OpenAI’s blog post announcing the launch, DeployCo’s investment and consulting partners collectively sponsor more than 2,000 businesses worldwide, and its consulting and integrator partners work with many thousands more. Those relationships are the distribution channel. The question is whether OpenAI can convert channel access into long-term deployment contracts before the consultancies realize that the product they are distributing is a substitute for their own teams. Meanwhile, OpenAI has also been striking multiyear deals with Accenture, BCG, Capgemini, and McKinsey through its Frontier program, giving these firms early access to new models and fine-tuning capabilities in exchange for enterprise distribution. The genius of the strategy is that the consultancies are simultaneously DeployCo investors, Frontier partners, and future competitive casualties. OpenAI has them surrounded.

Three consultancies are funding their own obituary

The counterarguments to the DeployCo thesis are serious, and the most persuasive ones come not from OpenAI’s competitors but from the structural challenges of the business itself.

Start with talent. The Forward Deployed Engineer model is legendarily hard to scale. Palantir spent twenty years building a culture that attracts a specific kind of engineer — someone who can write production code, navigate enterprise politics, and translate between C-suite strategy and database schemas. The company’s revenue per employee of $1.5 million reflects both the quality of those hires and the difficulty of replicating them. OpenAI is attempting to bootstrap that talent pool overnight by acquiring Tomoro’s 150 engineers and presumably hiring aggressively from there. But the talent market for engineers who understand both frontier AI systems and enterprise deployment is one of the tightest labor markets in technology. Every major lab, every hyperscaler, and every well-funded startup is competing for the same candidates. DeployCo’s ability to recruit at the pace its valuation implies — scaling from 150 people to thousands within a year or two — is far from guaranteed.

Then there is the vendor lock-in problem, viewed from the customer’s side. DeployCo engineers will build systems on OpenAI models. Those systems will be optimized for GPT-class architectures, trained on OpenAI-specific APIs, and integrated with OpenAI’s tool-use and agentic frameworks. The moment an enterprise signs a DeployCo contract, it is making a multi-year bet on a single model provider in a market where the competitive landscape shifts quarterly. Anthropic’s Claude, Google’s Gemini, and an increasingly capable tier of open-weight models offer enterprises real alternatives, and the smart procurement teams know it. The risk is that sophisticated buyers view DeployCo not as a deployment partner but as a lock-in vector, and either avoid it entirely or insist on model-agnostic architectures that defeat the purpose of the embedded engineer model.

Venture capitalist Chamath Palihapitiya offered the most vivid critique, warning on his podcast that consultancies like PwC and Accenture are letting the “fox into the hen house” by deeply integrating OpenAI and Anthropic into their operations. His argument is that the model providers have every incentive to learn the consulting playbook, automate the repeatable parts, and then offer the automated version directly to the end client at a fraction of the consultancy’s billing rate. The consultancies counter that their domain expertise — understanding the regulatory nuances of insurance underwriting, the compliance requirements of pharmaceutical trials, the operational logistics of global supply chains — cannot be captured in a model. That may be true today. It is unlikely to be true in three years, especially if the model providers are spending those three years embedded inside the very organizations where that domain expertise lives.

There is also a governance concern that the AI safety community has raised. Per reporting from AI Business, critics argue that DeployCo risks replicating at enterprise scale AI systems embedded in consequential workflows without adequate human oversight. The Forward Deployed Engineer model works because the engineer is the oversight — a human in the loop who understands both the model’s capabilities and the business context. But as DeployCo scales, the ratio of engineers to deployed systems will inevitably decline, and the pressure to automate the oversight itself will grow. The question is not whether OpenAI can deploy AI responsibly at scale. It is whether a consulting subsidiary with capped profits and a guaranteed return structure has the right incentive architecture to prioritize safety over speed.

Finally, the competitive response is already underway. Anthropic’s expanded strategic alliance with PwC, announced on May 14, takes the opposite approach — rather than building a competing consulting subsidiary, Anthropic is embedding Claude Code and Cowork inside PwC’s existing 30,000-person U.S. workforce, certifying those professionals on Anthropic technology, and letting PwC’s institutional relationships do the distribution. The early results are striking: insurance underwriting cycles compressed from ten weeks to ten days, and delivery improvements of up to 70 percent across production deployments. If Anthropic can achieve comparable deployment outcomes through partner augmentation rather than direct competition, DeployCo’s value proposition — that only the model provider’s own engineers can deploy the model effectively — starts to look like an expensive assumption rather than a proven thesis.

The operator’s playbook for the deployment era

The DeployCo launch marks a structural shift in how AI companies think about revenue. For three years, the industry operated on a simple model: build the smartest model, charge per token, and let the market figure out deployment. That model is not dead, but it is no longer sufficient. The marginal cost of intelligence is falling faster than anyone predicted — open-weight models like Llama 4 and Mistral’s latest releases have compressed the quality gap on standard tasks to a margin that many enterprises cannot distinguish in production. The value is migrating from the model layer to the deployment layer, and the companies that control deployment will control the next cycle of AI economics.

For enterprise technology leaders evaluating their position, the calculus has changed materially. The market now offers two distinct deployment models. The first is the Palantir-OpenAI model: the vendor’s own engineers embed inside your organization, build on the vendor’s stack, and deliver systems optimized for that vendor’s models and tools. The upside is speed and depth — nobody knows GPT better than OpenAI’s engineers. The downside is dependency. The second model is the Anthropic-PwC model: the vendor trains and certifies your existing partners and internal teams, providing tools like Claude Code and Cowork that augment human capability rather than replacing the humans who deploy it. The upside is flexibility and institutional continuity. The downside is that your deployment quality is bounded by the capability of your existing workforce, not the vendor’s best engineers.

Neither model is obviously superior, and the right choice depends on organizational maturity, risk tolerance, and strategic intent. But the decision itself is now unavoidable. The era of running a few pilots, presenting the results at a board meeting, and deferring the deployment question to next quarter is over. DeployCo’s investors — TPG, Goldman Sachs, SoftBank — are not patient capital by nature, and the $4 billion they have committed will be deployed aggressively. Enterprises that do not have a deployment strategy will have one imposed on them by vendors and partners who do.

Here is what operators should be evaluating now. First, audit your pilot portfolio. If more than half of your AI initiatives are still in proof-of-concept after six months, you have a deployment problem, not a model problem, and DeployCo or a competitor will diagnose it for you — at their price, on their terms. Second, assess your switching costs. Any deployment engagement that uses vendor-specific APIs, fine-tuned models, or proprietary agentic frameworks creates lock-in. Insist on architecture reviews that document abstraction layers, model-swapping protocols, and data portability guarantees before signing a multi-year deployment contract. Third, evaluate the Anthropic-PwC alternative. If your organization has strong internal engineering talent and established consulting relationships, the partner-augmentation model may deliver comparable outcomes at lower strategic risk. The PwC alliance has already demonstrated 70 percent delivery improvements with Claude-certified internal teams, suggesting that the embedded-engineer model is not the only path to production AI.

Fourth, negotiate governance terms before they become afterthoughts. Any embedded-engineer engagement should include explicit agreements on data access boundaries, model evaluation cadences, escalation paths for high-stakes automated decisions, and clear ownership of intellectual property generated during the deployment. These are the terms that define whether an AI deployment partner is an asset or a liability, and they are far easier to negotiate before the engineers are sitting in your building than after your quarterly earnings depend on the systems they built.

The biggest risk in the deployment era is not choosing the wrong vendor. It is waiting. The consulting market for AI deployment is consolidating rapidly — Accenture has committed $3 billion, PwC is certifying 30,000 Claude professionals, and OpenAI has launched a $14 billion subsidiary. The enterprises that move first will secure the best engineering talent, negotiate the most favorable terms, and build the institutional muscle for AI-native operations before their competitors even finish evaluating proposals. The enterprises that wait will find themselves choosing from the leftovers of a talent market that was already tight, signing contracts on terms dictated by vendors with full order books, and deploying AI into organizations that have spent another year reinforcing the habits and processes that AI is supposed to replace.

The numbers tell the story of urgency. Palantir proved the FDE model works — $350 billion in market cap and revenue per employee of $1.5 million. OpenAI has the capital, the model advantage, and now the consulting army to pursue that same trajectory at enterprise scale. Anthropic has the partner network and the augmentation thesis. Google has the cloud distribution and $750 million in partner financing. The deployment layer is being claimed in real time, and the window for enterprises to shape those claims on favorable terms is measured in quarters, not years.

DeployCo is not the future of AI. It is the future of how AI gets from a research lab into the operations that run the economy. That distinction matters. The model is the engine. Deployment is the road. OpenAI just spent $14 billion to own the road — and it invited the incumbents to help pave it.

In other news

Anthropic acquires Stainless for over $300M — Anthropic announced the acquisition of Stainless, a developer-tools startup whose SDK generators are used by OpenAI, Google, and Cloudflare, among hundreds of others. Anthropic will wind down all hosted Stainless products, effectively pulling a key infrastructure supplier out of its competitors’ hands — the fourth acquisition in six months after Bun, Vercept, and Coefficient Bio.

Snap and Perplexity end $400M AI search partnership — Snap confirmed in its Q1 earnings that it amicably ended its $400 million cash-and-equity deal with Perplexity before a broad rollout, with Snap’s sales guidance now assuming no revenue contribution from the partnership. The companies reportedly could not align on whether the integration should drive Snapchat engagement or Perplexity subscriptions.

OpenAI opens ChatGPT ads to everyone — OpenAI launched a self-serve Ads Manager for ChatGPT on May 5, removing the $50,000 minimum spend that gated its managed pilot and adding CPC bidding alongside CPM. The company is targeting $2.5 billion in ad revenue for 2026 and $100 billion by 2030, with early pilot brands including Best Buy, Target, and Albertsons.

Claude for Small Business ships 15 agentic workflows — Anthropic launched Claude for Small Business on May 13 with connectors to QuickBooks, PayPal, HubSpot, Canva, Docusign, and Google Workspace, turning Claude Cowork into a lightweight operations suite that handles payroll planning, invoice chasing, contract review, and cash-flow monitoring — no IT support required.

OpenAI and Dell bring Codex on-premises — OpenAI and Dell Technologies announced a partnership to deploy Codex on Dell’s AI Data Platform and AI Factory for hybrid and on-premises enterprise environments, citing more than four million developers using Codex weekly and growing demand for AI agents that can access data behind corporate firewalls.