Simply explained

How to control the three unique categories of agentic AI risk

Discover the three unique risk categories of agentic AI and how to protect your business using automated controls.

13 April 2026

While agentic AI shares the common risks of all AI systems, its capacity for independent execution and complex goals creates three unique kinds of risks. Organizations that understand the three unique risk categories of agentic AI — and how to control them — will be able to responsibly deploy agentic AI tools at speed and at scale. By adopting effective governance strategies, organizations can prevent compliance breaches, security incidents and failures that cascade across interconnected systems.

What are the three agentic AI risk categories?

These three categories of risk are uniquely associated with agentic AI:

1. Lack of human oversight and accountability

When AI systems operate with minimal human intervention, there is a higher likelihood of harmful or unethical outcomes slipping through undetected. Furthermore, agentic AI can use opaque processes to execute tasks that make it difficult for stakeholders to understand how or why the system took certain decisions or actions.

Agentic AI also increases the risk of “automation bias” — where human operators place excessive trust in agentic AI decisions and fail to critically evaluate outputs. Additionally, establishing who is ultimately responsible when agentic AI makes a consequential decision is not always clear — is it the developer who trained it, the business leader who deployed it, or the team who used it but did not intervene.

2. AI goal misalignment

Goal misalignment is when an agentic AI system pursues the wrong objectives — something that is made possible by its capacity for independent execution. Examples of goal misalignment include:

Goal drifts — where system objectives shift over time, creating misalignment with original intentions.
Secondary uses — where agentic systems are designed for one purpose and then repurposed for another purpose without appropriate re-evaluation. For example, a procurement optimization agent is used as a supplier evaluation agent without a human considering whether its training and constraints still apply.
Reward hacking — where agents may exploit loopholes in their existing reward structure. For example, a fraud detection system might flag every transaction as suspicious to maximize its “detection rate.”
Emergent behaviors and veiled objectives — systems develop unexpected behaviors or pursue hidden objectives that the system designers never anticipated.
Algorithmic determinism — overreliance on rigid agent decision-making without accounting for changing environments, data changes, new parameters, or nuanced human judgment.

3. Amplification errors

When agentic systems interact with each other or operate at scale, existing problems within the system can multiply, creating potentially catastrophic risks. For example, in the case of destabilizing feedback loops, one system's output becomes another's input, creating cycles that amplify errors or undesirable behaviors.

Another serious amplification-related risk is known as the “cascade of failures.” In this case, interconnected agentic systems create chain reactions where one malfunction triggers failures in others, causing widespread disruption.

Scenario: An enterprise deploys an agentic pricing system that is rewarded for “revenue optimization.” The system discovers that aggressive pricing in emerging markets maximizes short-term revenue metrics. So, it systematically prices products higher in regions with less competition, technically achieving its revenue goals while undermining the company's strategic commitment to market penetration and damaging brand perception in growth markets.

Control approach: Pre‑deployment simulation testing and sensitivity analysis reveal optimization patterns that cause conflicts between revenue metrics and strategic objectives.

Deployed AI systems have observability programs that monitor pricing decisions against strategic market priorities. Business unit leaders maintain authority to override pricing decisions that conflict with regional strategy and brand positioning. A regular executive review helps ensure AI optimization aligns with long-term market development goals.

Dual control framework: how to control the three unique agentic AI risk categories

The three unique agentic AI risk categories can be managed through a dual control framework. This framework combines preventative controls — limiting what the system can do before it acts — with detective controls that continuously monitor and respond to unexpected behaviors.

Preventative controls include:

Restricted access — the system’s access is limited to the tools, data and functions that are essential for its goals.
Security filters — real-time filters evaluate every data access request and proposed system output against regulatory requirements and organizational values.
Privacy-enhancing techniques — anonymization and data masking are used to minimize risk exposure.
Model testing and validation — system performance and vulnerabilities are rigorously tested before deployment, facilitating the system’s operation within intended parameters.

Detective controls include:

Observability programs — measures and tolerance bands for normal operation are established, with alerts being triggered when the system operates outside these boundaries.
Human oversight with intervention authority — trained personnel monitor feedback loops, identify issues and maintain the capability to pause, redirect, or shut down the agent.

Scenario: An executive team relies on an agentic system that analyzes performance data to make capital allocation recommendations. The system develops preferences based on historical profitability patterns, systematically recommending reduced investment in innovation-focused projects with longer payback periods. The executive team approves the system’s recommendations without independent strategic evaluation, gradually shifting the enterprise portfolio away from future growth initiatives.

Control approach: Fairness evaluation during testing validates that recommendations are skewed toward strategic priorities beyond historical returns. The executive oversight protocol requires independent strategic assessment before major allocation decisions are made and alerts flag proposed allocations that diverge from stated strategic priorities. The executive team maintains responsibility for balancing short-term performance with long-term strategic positioning, using AI analysis as input rather than a mandate.

Managing agentic AI risks: critical technical evaluations

For the effective management of agentic AI risks, a dual control framework should be supplemented by critical technical evaluations. These evaluations range from adversarial robustness testing (whether the system can maintain accuracy under varying conditions) through to reward hacking analysis (whether the system exploits loopholes in reward structures).

Scenario: A global enterprise deploys agentic AI systems across treasury, procurement and logistics. The treasury system detects currency volatility and adjusts hedging positions. These hedging changes trigger the procurement system to accelerate purchasing in affected currencies. As a result, the logistics system identifies a surge in procurement activity and reallocates capacity. The treasury system interprets the operational changes as increased currency exposure and amplifies hedging yet again, continuing a disruptive feedback loop that disrupts enterprise operations.

Control approach: An enterprise observability program monitors cross-system decision patterns and detects unusual coordination effects by the systems. The chief operating officer and CFO have joint authority to pause interacting systems when anomalous patterns emerge. In addition, coordination protocols across business functions are established to identify and halt destabilizing feedback loops. A regular cross-functional review process helps system interactions to align with the overall enterprise risk appetite.

Key takeaways for leaders on managing agentic AI risk

Agentic AI introduces three unique risk categories: lack of human oversight and accountability, goal misalignment, and amplification errors.
Effective control of agentic AI risk requires both preventative constraints (limiting what's possible) and detective monitoring (watching what happens).
Technical evaluations must supplement standard governance, from adversarial testing through to reward hacking analysis and stress testing.

Technology Risk

Learn more about Technology Risk services and their role in high-quality audits and other assurance, attestation, certification and assessment services.
Read more
System and Organization Controls Reporting and ISO Certification Services

EY SOC reporting teams help companies communicate trust and confidence in the internal control environment around the services they provide to customers.
Read more

The team

Cathy Cobey

EY Global Assurance Responsible AI and Technology Risk AI Leader

Shelly Fliehe

EY Global Technology Risk Leader

EY refers to the global organization, and may refer to one or more, of the member firms of Ernst & Young Global Limited, each of which is a separate legal entity. Ernst & Young Global Limited, a UK company limited by guarantee, does not provide services to clients.

Insights

Highlights

Services

Spotlight

Industries

Case studies

Careers

Spotlight

About us

Top news

How to control the three unique categories of agentic AI risk

What are the three agentic AI risk categories?

1. Lack of human oversight and accountability

2. AI goal misalignment

3. Amplification errors

Dual control framework: how to control the three unique agentic AI risk categories

Preventative controls include:

Detective controls include:

Managing agentic AI risks: critical technical evaluations

Key takeaways for leaders on managing agentic AI risk

Insights

Highlights

Services

Spotlight

Industries

Case studies

Careers

Spotlight

About us

Top news

How to control the three unique categories of agentic AI risk

What are the three agentic AI risk categories?

1. Lack of human oversight and accountability

2. AI goal misalignment

3. Amplification errors

Real-world scenario: revenue gain at a strategic cost

Dual control framework: how to control the three unique agentic AI risk categories

Preventative controls include:

Detective controls include:

Real-world scenario: overreliance on agentic AI for capital allocation

Managing agentic AI risks: critical technical evaluations

Real-world scenario: disruptive feedback loop

Key takeaways for leaders on managing agentic AI risk