Two symmetrical steel railway bridges at night with train light trail on the left. Stars on the sky.

Unlocking agentic value: a new investment discipline for the agentic era

Related topics

As the agentic era reshapes the AI economics of enterprise technology, organizations have an opportunity to govern run-rate exposure as a growth investment, protecting operating budgets and margin while compounding enterprise value.


In brief
  • Agentic AI is shifting enterprise AI from fixed software/labor costs to variable compute use, where token costs are only part of the total cost. 
  • Organizations need agentic FinOps to manage total agent costs, including infrastructure, governance, org change, failure recovery, and regulatory risk. 
  • Leaders should treat agents as growth investments, with clear ownership, cost controls, and value metrics so scaling is based on measurable return.

The loudest conversation about AI right now is about what it costs. The more important one is about what it earns. Effective governance treats that spend as a growth investment, directing it toward measurable enterprise value.

Token costs are the most visible line item in an AI budget and are often the first sign that the economics of agentic AI are changing. They are not the full cost, but they matter because they are where inference intensity, model choice, orchestration complexity and upstream compute scarcity first surface, often only after the work is done.

The issue is becoming more important as token pricing is ultimately tied to a constrained physical supply chain: chips, power, data centers, cloud capacity and model provider infrastructure. Enterprises may experience that constraint in the form of token bills, usage caps, rate limits, model access restrictions or unexpected repricing. For many companies pushing AI adoption, the last few months have been a shock to the budget.

Current pricing may also understate the long-term economics of agentic AI if some portion of compute cost is being absorbed, subsidized or strategically priced by upstream providers. As agentic use cases become more valuable and more compute-intensive, those costs are unlikely to remain invisible indefinitely. Someone will ultimately pay for the compute that agents consume. Many providers are already shifting from subscription-based fees to consumption-based models, ending the days of vendor-subsidized pricing.


Stay ahead of the agentic AI cost curve.

Receive notification when the next edition of the EY Total Cost of Agents series is released. 


But while tokens are the story today, the real economics of running agentic workflows at scale include infrastructure, operations, people, risk and the hidden cost of engineering around AI’s limitations. Optimizing tokens without understanding total cost of ownership is like managing a factory by watching the electricity bill.

The story previously presented to many boards was straightforward: an expensive predictable line of human labor traded for a cheaper predictable line of software. But an agent is not a license. It lives in an operating stack of compute, models, data pipelines, governance, oversight and redesign work, each of which compounds rapidly. As a result, the tokens AI consumes, serving as the metered proxy for much of that operating stack, become a highly variable operating cost.

Consider a customer service AI assistant built two years ago to answer product questions. A chat that once cost $0.04 may now involve tool retrieval, planning and subagents that turn it into a $1.20 orchestration. Additional costs — such as knowledge-base updates, agent evaluation and human-collaboration design — may not appear on the model vendor’s invoice but still surface to the enterprise.

cost per chat
$0.04

 

>

cost per orchestration
$1.20

When a CFO asks what one agent costs, they are often shown a vendor invoice that captures only part of the total financial exposure.


The challenge is that agentic AI shifts enterprise AI from a fixed-cost labor comparison to a dynamic compute consumption model. Boards and CFOs are therefore looking for investment cases that capture the full operating picture, including usage, orchestration, governance, change, risk controls and remediation.

This also changes how AI is budgeted and managed. CFOs need visibility into consumption across use cases. CTOs need to understand inference volume, model mix and retrieval load. CEOs need to assess not only labor savings, but the ongoing technology run rate required to deliver them.

According to a Gartner poll, more than 40% of agentic AI projects will be canceled by the end of 2027 due to escalating costs, unclear business value or inadequate risk controls. 1 These outcomes are not inevitable. They are more likely where organizations scale agents before establishing agentic FinOps to manage cost, risk, usage and value together.

40% of agentic projects are predicted to be canceled by the end of 2027.

How to break down the cost of an agent

Most companies only include costs 1-3 in their agentic investments and business cases, with costs 4-7 often emerging later in the lifecycle as agents scale.


The practice of agentic FinOps does three things at once:

  • It allows enterprises to estimate the costs, including lag. 
  • It assigns an owner to each row before the spend, not after the invoice. 
  • It gives leadership the basis to decide which initiatives earn the right to scale based on the value they unlock against their fully loaded cost. 

The key challenge: The total cost of an agent, at most organizations, is structurally invisible until designed for visibility. This is not because the costs are hard to capture, but because they surface irregularly across uncoordinated budgets.

The table below is our working framework for agentic FinOps, illustrating how agent costs are organized and evaluated.

 CostWhere it hidesHow it movesWhen you knowBudget owner
1Tokens and API callsInvoice — model vendorHighly variable based on usageAt the end of the month after the work is doneCIO/CTO BU Leader
2Subscriptions and licensesInvoice — multiple vendors; visible but fragmentedFairly predictable, based often on license count or tiered consumption patternsFairly constant and predictableCPO/CIO
3Platform infrastructureCloud bill – often filed under infrastructureStep-fixed by tier, variable on top; rarely goes downBefore spend, can be estimated within a rangeCTO/CIO
4Governance burdenRisk and compliance budget – often headcountCompounding, every agent widens the surface; limited economies of scale.Baseline is scopable; compounding over quartersCRO/CAE
5Organizational changeWorkforce budget, HR, L&D and consultingFront-loaded per workflowInitial cost is plannable; recurrence is triggered by someone else’s roadmapCOO/CHRO
6Expected failure and recoveryNowhere – until the incidentZero until it isn’t. Probabilistic moves with scale of agentsAfter the damage is doneCFO
7Potential AI taxes for agentsDoesn’t exist yet; regulatory signals onlyUnknownWhen/if regulation landsCRO/GCO/CFO

What leaders can do now

The pace of the market does not allow for a year of observation. Three key priorities will help put most enterprises on sharper footing within a single planning cycle.

1. Appoint a Head of Agent Economics

Centralize accountability under a single executive. Agentic AI costs cut across budgets, so enterprises need clear ownership for model usage, cost leakage and value realization. Whether led by a Head of Agent Economics or Agent FinOps Lead, the focus should be visibility across the seven line items of AI and cloud spend, with total spend management treated as both a KPI and a capital allocation decision.

2. Install agentic circuit breakers before scale

Benchmark current operations on a per-task or per-outcome basis to establish a defensible view of what good and bad look like. Today it is almost impossible to answer what a unit of agentic work consumes or what stops it when the agent runs away. Then, install hard kill switches: spend ceilings, call-volume caps and automatic shutoffs at the agent, workflow and BU level. Without circuit breakers, the bill is only visible after the damage is done.

3. Embed full TCO in the business case upfront

Incorporate both capital expenditures (CapEx) and operating expenditures (OpEx) into the business case from the outset. Most agentic initiatives underestimate cost by isolating model pricing from the broader operating envelope. Capture infrastructure, orchestration, monitoring and human-in-the-loop costs to create a transparent view of total cost of ownership (TCO) and force a direct comparison against the enterprise value expected from agents.

The capital cycle where agents get weighed instead of counted

The era that exploration is giving way to is one that rewards discipline. Investment numbers are only meaningful when tied to value. Every agent should have a value metric on day one: output, revenue, productivity, speed, quality, risk reduction or decision improvement it is expected to produce. No agent should be approved, scaled or renewed without a clear measure of what it produces per dollar.

That requirement moves the discussion from spending to return. It gives the Head of Agent Economics a practical basis to shift investment away from agents that consume resources and toward agents that create compounding value over time.

The next AI cycle will not be won by companies with the most agents, the best models or the largest spend. It will be won by those who treat agent capacity as capital and allocate it deliberately, like any other investment driving growth, margin and enterprise value.

What’s next in the EY Total Cost of Agents series

This whitepaper is the first edition of the EY Total Cost of Agents series. Other papers in this series will examine:

  • The path to break-even and the pivot from productivity gains to growth 
  • The strategic management of compute as a critical supply chain asset 
  • The policy and regulatory landscape that will shape how AI is governed, priced and consumed in the years ahead.

Download the full whitepaper

Download the report to explore key insights from the first article in our Total Cost of Agents series and learn how organizations can better understand the full cost implications of agentic AI.

Summary 

Agentic AI is shifting enterprise technology economics from predictable software or labor costs to variable, consumption based compute. While token costs are most visible, they represent only a fraction of total cost, which also includes infrastructure, governance, organizational change, risk and potential regulatory impacts. These costs are often fragmented and become visible only after scaling. The paper argues organizations must adopt “Agent FinOps” to manage total cost of ownership, assign accountability and link spend to measurable value. Leaders should implement cost controls, embed full TCO in business cases and treat agent capacity as a capital investment to drive sustainable growth and enterprise value.

AI Token Cost FAQs

About this article

Authors

Related articles

Responsible AI monitoring

As AI evolves from prediction to autonomous action, businesses need a framework for effective AI monitoring across governance, risk and performance.

How agentic AI should reinvent work beyond automation

AI ROI will not come from automation alone. Explore how agentic AI helps organizations reinvent work, processes and roles to unlock new value.

Onshore, offshore, agent: welcome to the modern, three-pronged workforce

AI is becoming the third pillar of the workforce. Discover how to balance human and machine collaboration for long-term growth.


    Contact us