Goldman Sachs Analysis Finds AI Inference Costs Comparable to Corporate Payroll Expenses
TL;DR
A Goldman Sachs analysis has drawn attention to a striking milestone: at several major technology companies, AI inference costs — the expense of running trained models in production — now rival or exceed total payroll spending. While per-token inference costs have plummeted by as much as 280x since 2022, total enterprise AI spending surged 320% in 2025 alone due to exploding usage, raising questions about whether the payroll comparison signals genuine economic transformation or reflects an accounting artifact in an industry still lacking standardized reporting.
A Goldman Sachs research note circulating among institutional investors has crystallized a trend that CFOs across the S&P 500 are grappling with: at a growing number of companies, the cost of running AI models in production — known as inference — is approaching the same order of magnitude as their total payroll expenses. The finding has sparked debate about whether corporate America is entering a structural shift in how it allocates spending between human labor and machine intelligence.
The Numbers Behind the Claim
The core of the Goldman Sachs analysis rests on a comparison that would have been unthinkable three years ago. At major AI-native companies, compute expenses — encompassing both training and inference — now consume 54% to 62% of total operating costs, while staff spending accounts for less than 25% . At Anthropic, for instance, total compute spending in 2025 reached $6.8 billion (42% for R&D compute, 28% for inference), compared to $2.9 billion for staff — meaning the company spent more than twice as much on running and training models as it did on paying people . Minimax and Z.ai show similar patterns, with compute representing 57% and 62% of costs respectively .
These ratios are most extreme at AI-first companies, but the trend is spreading. Enterprise generative AI spending jumped from $11.5 billion in 2024 to $37 billion in 2025 — a 320% increase — and is projected to reach $55 billion in 2026 . The average enterprise AI budget has grown from $1.2 million per year in 2024 to $7 million in 2026, with 45% of organizations now spending over $100,000 per month on AI . Goldman Sachs projects that AI companies collectively may invest more than $500 billion in infrastructure during 2026 alone .
The Inference Cost Paradox
The most counterintuitive aspect of this story is that inference costs per unit have collapsed even as total spending has skyrocketed. The cost of querying a model at GPT-3.5 performance levels fell from $20 per million tokens in November 2022 to $0.07 by October 2024 — a more than 280-fold reduction in roughly 18 months . Epoch AI's analysis of price trends across multiple benchmarks found median annual decline rates of 50x, with post-January 2024 declines accelerating to 200x per year .
This is a textbook illustration of Jevons' Paradox — the 19th-century observation that making a resource cheaper increases rather than decreases total consumption. As inference became 1,000 times cheaper, companies deployed it in customer service automation, real-time data analysis, personalized content generation, and continuous agentic AI systems running around the clock . The per-unit savings were overwhelmed by demand that the lower prices unlocked.
Goldman Sachs CIO Marco Argenti has warned about "token sticker shock" — the moment when executives realize that despite dramatically lower unit costs, their aggregate AI bills are growing faster than almost any other budget line item . The 2026 turning point, where inference spending now exceeds training spending for the first time — accounting for 55% of the $37.5 billion in AI cloud infrastructure spending — confirms that production deployment, not research, is now the primary cost driver .
Which Industries Hit the Threshold First
The payroll-parity claim holds most cleanly in technology and financial services. Among S&P 500 companies, AI-related stocks have accounted for 75% of index returns, 80% of earnings growth, and 90% of capital spending growth since ChatGPT launched in November 2022 . These are also the sectors spending most aggressively on inference.
But adoption remains uneven. As of March 2026, fewer than 19% of U.S. establishments have adopted AI, though this is expected to rise to 22.3% within six months . Large firms (250+ employees) show a 35.3% adoption rate, while small firms lag at 21.5% . Goldman Sachs itself estimates that 80% of companies are not yet using AI meaningfully .
This means the "inference costs rival payroll" comparison applies to a relatively small vanguard of companies — primarily hyperscalers, AI labs, and major financial institutions — rather than the median S&P 500 firm. For most enterprises, AI spend remains a single-digit percentage of operating costs. The question is how quickly the rest of corporate America follows the leaders into production-scale deployment.
What Counts as 'Inference Cost'
One significant complication in the Goldman Sachs analysis is the absence of standardized accounting treatment for AI inference expenses. The costs bundled under "inference" can include cloud compute fees paid to providers like AWS, Azure, or Google Cloud; amortized capital expenditures on proprietary GPU clusters; software licensing fees for model access; and energy costs for running data centers .
Companies report these expenses inconsistently. Some classify AI compute under R&D; others fold it into cost of goods sold or general technology spending. Only 51% of organizations can confidently measure their AI return on investment, creating what one analysis calls an "accountability gap" in spending that continues to accelerate .
This matters because the payroll comparison depends on what you include. A narrow definition — just cloud API fees — produces a smaller number. A broader definition — encompassing capital depreciation on GPU clusters, energy, cooling, and internal engineering time spent on model optimization — can dwarf the narrow figure. Without consistent reporting standards, cross-company comparisons remain imprecise.
The Labor Displacement Question
If inference costs are functionally replacing payroll line items, the implied question is whether workers are being displaced. The evidence is mixed but directional.
In a survey of 1,000 U.S. business leaders, 39% reported conducting layoffs in 2025, with 58% expecting further layoffs in 2026 . Corporate America cut more than 1.17 million jobs in the first 11 months of 2025 — a 54% increase from 2024 . High-salary roles are being targeted first for immediate payroll savings, with workers lacking AI-related skills facing the highest risk .
Specific categories face acute pressure: 80% of customer service roles are projected to be automated, potentially displacing 2.24 million out of 2.8 million U.S. jobs in that sector . Paralegals face an 80% automation risk by 2026, and legal researchers face 65% by 2027 . Mid-layer management — including HR, talent acquisition, and compliance roles — is being disproportionately targeted .
Total nonfarm payrolls stood at 158.6 million as of March 2026, up only 0.2% year-over-year , while average hourly earnings rose 3.5% to $37.38 . The labor market remains broadly stable, but the composition is shifting. Goldman Sachs' own research warns that AI-driven job losses could have "lasting costs," with displaced workers facing pay cuts and slower career growth .
Historical precedent offers a partial guide. ATMs did not eliminate bank tellers — the number of tellers per branch fell, but the lower cost per branch led to more branches being opened, and tellers shifted toward sales and relationship roles . ERP systems eliminated many data-entry positions in the 1990s but created new categories of work in systems administration and business analysis. The net employment effect of prior automation waves has generally been positive over 10-to-20-year horizons, though the transition period imposes real costs on displaced workers .
Women face disproportionate exposure: 79% of employed women are concentrated in occupations at high risk of AI automation, compared to 58% of men — a 1.4x differential .
Goldman Sachs' Conflicts of Interest
Goldman Sachs occupies an unusual position in this analysis. The firm is simultaneously a producer of research about AI infrastructure spending, a major financial advisor to the hyperscalers driving that spending, and an investor in AI infrastructure itself. Goldman's stock soared 56% in 2025, driven in part by AI-related deal flow .
Goldman formally discloses these conflicts: "Goldman Sachs does and seeks to do business with companies covered in its research reports. As a result, investors should be aware that the firm may have a conflict of interest that could affect the objectivity of this report" . The firm's business model creates a structural incentive to normalize large AI capital expenditures — every upward revision to AI capex estimates generates advisory fees, trading commissions, and asset management opportunities .
This does not mean the analysis is wrong. But it does mean the framing — presenting inference costs reaching payroll parity as a natural and inevitable progression rather than a potentially unsustainable spending pattern — serves Goldman's institutional interests. When a bank that profits from AI infrastructure deals publishes research arguing that massive AI infrastructure spending is rational, the audience should weigh that context.
Goldman is also transforming its own operations with AI through its "OneGS 3.0" initiative, embedding generative and predictive AI across the firm . CFO Denis Coleman has described it as a multi-year overhaul of Goldman's operating model . The firm is both analyst and subject in this story.
The International Picture
The payroll-parity claim is most applicable to U.S. companies, where AI infrastructure investment is concentrated. The international picture differs substantially.
China has created a structurally lower cost environment through aggressive state subsidies. Chinese provinces like Gansu, Guizhou, and Inner Mongolia have slashed cloud providers' power bills by as much as 50% for AI workloads . Electricity in China's western provinces costs as little as $0.05 per kilowatt-hour, compared to $0.25 in Beijing and Shanghai or $0.40 in parts of the U.S., and power makes up roughly 35% of inference costs . Chinese AI labs like Kimi offer inference at one-quarter the price of comparable U.S. models . Beijing is also directly subsidizing API access and model licensing .
The European Union faces the opposite problem. EU fiscal rules constrain member states to deficits below 3% of GDP, limiting the kind of public investment that could subsidize AI infrastructure . The EU produced only about three large foundation models in 2025, compared to roughly forty in the U.S. and fifteen in China . European companies face higher compliance costs under the EU AI Act's risk-based regulatory framework . The payroll-parity dynamic is largely absent in Europe — not because companies are spending less on AI relative to workers, but because they are spending less on AI altogether.
These differences suggest that the Goldman Sachs analysis describes a phenomenon concentrated in the U.S. and, in modified form, in China — not a universal global trend.
Tax, Governance, and the Balance Sheet
If AI inference is functionally substituting for labor, the implications extend beyond operating budgets into corporate tax strategy, governance, and shareholder relations.
Payroll spending carries significant tax overhead — employer-side Social Security and Medicare contributions, unemployment insurance, workers' compensation, and benefits costs. Compute spending carries none of these. A dollar shifted from payroll to inference is worth more to the company on a post-tax basis, creating a structural incentive to automate even when the pre-tax costs are roughly equal .
Proposals to address this imbalance are emerging. The "Transition Economy Act," a bipartisan framework under discussion in Congress, would fund direct transfers to displaced workers through a tax on AI inference compute . Whether such legislation gains traction depends on how quickly the payroll-to-compute substitution becomes visible in employment statistics.
At the board level, the shift is beginning to register. Companies now spend an average of 36% of their digital budgets on AI, with digital budgets themselves growing from 7.5% of revenue in 2024 to 13.7% in 2025 . Some compensation committees are starting to evaluate AI spending alongside headcount in workforce planning, though formal treatment of AI as a payroll-substitute line item remains rare.
What the Productivity Data Actually Shows
For the payroll comparison to be economically meaningful — rather than merely arithmetically interesting — AI inference spending needs to generate productivity gains that justify the outlay. The early evidence is encouraging but incomplete.
Enterprise workers using AI tools report recovering 40 to 60 minutes per day, according to Goldman Sachs' own survey data . Academic studies show a 23% average productivity uplift, while individual company reports indicate gains closer to 33% . Seventy-five percent of AI-equipped workers say they can now complete tasks they previously could not do at all .
But only 51% of organizations can confidently measure AI return on investment . The gap between perceived productivity gains and measurable financial returns remains wide. And the agentic AI systems that are driving the fastest inference cost growth — autonomous agents running 24/7 — are too new to have established clear ROI benchmarks.
The Bottom Line
The Goldman Sachs analysis identifies a real and accelerating trend: at AI-intensive companies, compute costs have reached and in some cases surpassed payroll as a share of operating expenses. The comparison is most stark at AI-native firms like Anthropic, where compute consumes 70% of costs versus 30% for staff , and it is spreading into financial services, technology, and customer-facing industries as deployment scales.
But the framing requires several caveats. The comparison applies to a small fraction of companies today — fewer than one in five U.S. establishments have adopted AI at all . The "inference cost" category lacks standardized accounting treatment, making precise comparisons unreliable. The Jevons' Paradox dynamic — plummeting unit costs driving exploding total spend — means the trend could reverse if usage growth slows or if further cost reductions outpace demand expansion. And Goldman's own financial interests in normalizing large AI capital expenditures should inform how much weight readers place on the analysis.
What is not in dispute is the direction. Enterprise AI spending tripled in a single year . Inference now exceeds training as a share of AI infrastructure budgets . Companies are beginning to treat AI compute as a functional substitute for certain categories of labor. Whether that substitution produces net economic gains or net displacement — and over what time horizon — is the central economic question of this decade.
Related Stories
Treasury Secretary Summons Bank CEOs to Discuss Cyber Risks Posed by Anthropic AI Model
Trump Officials Reported to Be Encouraging Banks to Test Anthropic AI Model
Goldman Pitches Hedge Funds on Corporate Loan Shorting Strategies
Wall Street Warns Iran Conflict Will Cause Prolonged Energy Crisis
Anthropic's Claude Managed Agents Launch Rattles Banks and Triggers SaaS Stock Selloff
Sources (20)
- [1]Compute accounts for the majority of expenses of AI companiesepoch.ai
Across three major AI companies, compute expenses represent 54-62% of total costs, while staff spending accounts for less than 25% of operating expenses.
- [2]The Inference Cost Paradox: Why Generative AI Spending Surged 320% in 2025arturmarkus.com
Enterprise generative AI spending grew from $11.5B in 2024 to $37B in 2025 despite per-token costs dropping 1,000x, with average monthly AI budgets rising 36%.
- [3]Why AI Companies May Invest More than $500 Billion in 2026goldmansachs.com
Goldman Sachs projects AI companies may collectively invest more than $500 billion in infrastructure during 2026.
- [4]Research and Development - The 2025 AI Index Reporthai.stanford.edu
GPT-3.5-level performance cost $20 per million tokens in Nov 2022 and fell to $0.07 by Oct 2024 — a 280-fold reduction.
- [5]LLM inference prices have fallen rapidly but unequally across tasksepoch.ai
Price decline rates range from 9x to 900x per year depending on benchmark, with a median of 50x and post-2024 rates accelerating to 200x per year.
- [6]How Goldman Is Scaling AI to Transform Its Business Operationsnasdaq.com
Goldman Sachs CIO Marco Argenti warns of 'token sticker shock' as the firm embeds generative AI across workflows through its OneGS 3.0 initiative.
- [7]Where is AI Spending Going in 2026?mktclarity.com
AI cloud infrastructure spending projected at $37.5B in 2026, with 55% going to inference — exceeding training for the first time. Companies spend 36% of digital budgets on AI.
- [8]Big Tech AI Spending: $300B Capex Race in 2026tech-insider.org
AI-related stocks account for 75% of S&P 500 returns, 80% of earnings growth, and 90% of capital spending growth since ChatGPT launched.
- [9]AI is saving workers up to an hour a day — but Goldman Sachs says 80% of companies aren't using it yetfortune.com
Fewer than 19% of U.S. establishments have adopted AI as of March 2026. Workers using AI recover 40-60 minutes daily with 23% average productivity uplift.
- [10]AI Costs: A Complete Breakdown (2026)nerdleveltech.com
Comprehensive breakdown of AI inference cost components including cloud compute, GPU clusters, licensing fees, and energy costs.
- [11]Nearly 4 in 10 companies will replace workers with AI by 2026hrdive.com
39% of business leaders conducted layoffs in 2025, with 1.17 million jobs cut in the first 11 months — a 54% increase from 2024.
- [12]77 AI Job Replacement Statistics 2026demandsage.com
80% of customer service roles projected for automation; paralegals face 80% automation risk by 2026. Women are 1.4x more exposed to displacement than men.
- [13]Total Nonfarm Payrolls - FREDfred.stlouisfed.org
Total nonfarm payrolls: 158,637K as of March 2026, up 0.2% year-over-year.
- [14]Average Hourly Earnings - Bureau of Labor Statisticsdata.bls.gov
Average hourly earnings for private sector workers: $37.38 as of March 2026, up 3.5% year-over-year.
- [15]AI-Driven Job Losses Could Have 'Lasting Costs,' Goldman Sachs Warnsbenzinga.com
Goldman Sachs warns displaced workers face pay cuts and slower career growth from AI-driven job losses.
- [16]Goldman Sachs Soars 56% in 2025: The Dow's Surprising AI Infrastructure Powerhousedisruptionbanking.com
Goldman's stock rose 56% in 2025 driven by AI deal flow. The firm formally discloses conflicts of interest in AI research reports.
- [17]China could be the 'big winner' in the AI racefortune.com
Chinese provinces slash cloud power bills by 50% for AI workloads; electricity costs as low as $0.05/kWh vs $0.40 in parts of the U.S.
- [18]AI power play: Can Europe catch up with the US and China?euronews.com
The EU produced only about 3 large foundation models in 2025 vs 40 in the U.S. and 15 in China; fiscal rules constrain AI infrastructure investment.
- [19]AI Dilemma: Regulation in China, EU & US - Comparative Analysispernot-leplay.com
EU shows highest regulatory strictness with risk-based framework; US maintains flexible approach; China prioritizes state control with comprehensive monitoring.
- [20]The future of tax policy: A public finance framework for the age of AIbrookings.edu
Discussion of proposed frameworks for taxing AI compute, including the Transition Economy Act for funding transfers to displaced workers.
Sign in to dig deeper into this story
Sign In