EY digital lab report view

How data quality makes the difference in successful AI applications

Related topics

Discover the potential of AI by mastering data quality: unlock the power of AI with high-quality insights.


In brief

  • Data quality, trust, and security are critical for reliable AI outcomes.

  • Most organizations lack the essential data infrastructure needed to effectively implement Agentic AI.

  • Without a trustworthy data foundation, Agentic AI initiatives risk failure, bias, and reputational damage.

Few organizations truly have their enterprise-level data in order. Our recent research shows that 36% of CIOs report their data platform infrastructure is inadequately prepared. According to Databricks, only 22% of organizations have sufficiently built their data infrastructure to implement AI effectively.We live in an era where data is generated in unprecedented volumes—billions of bytes every second across platforms and devices. This explosion of information presents enormous opportunities, but also significant challenges for both businesses and individuals.

AI promises improved automation, higher productivity, and lower costs. But the increasing volume, velocity, and variety of data introduces new obstacles. Regulations like the EU AI Act add pressure on organizations to manage their data properly. That’s why a solid data platform is essential. But how do you combine internal data with relevant external information from the internet? Where does an AI agent source its data, and what criteria guide its decisions? The output of one agent influences the actions of the next—a chain of decisions that must be carefully managed to ensure reliable results.

Without strong data quality and infrastructure, deploying AI effectively—especially autonomous agents—is a challenge. The key lies in building data products and agents that perform well, comply with regulations, and minimize risks such as errors, hallucinations, and bias.

The key to successful AI implementation lies not just in the technology itself, but in the quality of the data

Data: The fuel for AI success

In today’s digital world, AI is a hot topic attracting companies across sectors. Its promise to automate processes, reduce costs, and generate insights drives growing interest. But the key to successful AI implementation isn’t just the technology—it’s the quality of the data that feeds these systems. Simply put: garbage in is garbage out.

Our research explores the critical role of data in the AI revolution and offers insights for organizations looking to benefit from this technology. EY asked 500 CIOs about their biggest concerns when adopting AI, and 36% cited insufficient data and infrastructure platforms.

A fundamental challenge

These insights highlight a critical business challenge: how do you use data effectively in practice? As Gartner notes, the real differentiator today isn’t the large language models (LLMs) or AI models you build, but the quality and completeness of your data.

What CIOs are saying

  • 21% struggle to meet regulations or face increased ethical and legal risks.
  • 36% lack the right data and infrastructure platforms to implement generative AI.
  • 31% are concerned about rising cyber and data privacy risks (e.g., data leaks, data poisoning).
  • 5% risk reputational damage due to inaccurate data.
  • 4% lack the right skills and capabilities; employees worry about job security.
  • 2% don’t know, as ROI remains unclear.

What can go wrong?

Real-world examples show how things can go seriously wrong. Headlines speak for themselves:

  • An airline was held liable after a chatbot gave passengers incorrect advice.
  • Zillow’s failed AI project revealed the difficulty of valuing real estate with AI.
  • New York City defended an AI chatbot that advised entrepreneurs to break laws.
  • A major healthcare algorithm showed racial bias.
  • Amazon scrapped a secret AI recruitment tool that discriminated against women.
AI is only as good as the quality of the data you feed it

It’s about trust, reliability, and security

Data—and how we differentiate ourselves with it—is the core. But as AI matures and scales, it brings challenges in data management, security, infrastructure, and governance/MLOps. To trust AI in decision-making, it must be fed accurate and reliable information.

Quantity vs. quality: The data dilemma

Organizations must ask themselves a crucial question: do we have access to all the data needed to achieve optimal results with AI? Many models are trained on publicly available data, but the real opportunity lies in effectively combining external data with internal business data. That integration is what sets organizations apart.

To trust AI in decision-making, it must be fed accurate and reliable information. Data quality is essential: complete, accurate, and error-free. But security is also increasingly important. As AI becomes more widespread, the risk of malicious actors exploiting faulty or manipulated data grows. Organizations must invest not only in combining internal and external data sources, but also in securing them. Only then can AI be deployed in a way that is trustworthy, responsible, and distinctive.

To trust AI in decision-making, it must be fed accurate and reliable information.

Robust Data Infrastructure

Data is the fuel for AI. Organizations must organize and manage their data effectively to reap the benefits of AI. That means investing not only in AI technology, but also in a robust data infrastructure. Combining internal data with external sources is a key differentiator for organizations aiming to stand out.

 

Consequences of poor data

One of the biggest challenges for companies is the lack of consistent and reliable data. Many still operate in data silos, leading to inefficiency and poor decision-making. This lack of integration and quality can result in suboptimal AI performance and even failed AI initiatives.

 

Poor data can have serious consequences, such as bad decisions and reputational damage. Think of scenarios where chatbots are hacked and leak sensitive information, or where unreliable data leads to inappropriate decisions. Companies must be aware of these risks and take steps to secure their data. That includes implementing data quality controls, security measures, and governance structures to ensure the data used for AI applications is reliable and safe.

 

Organizations must prepare for the AI revolution by optimizing their data and ensuring it is trustworthy and secure to survive.


What is an AI-ready data platform?

An AI-ready data platform aligns with the organizational data strategy, delivers discoverable and trusted data on an auditable and scalable data architecture that mitigates risk by emphasizing security and compliance through data governance to support modern AI use cases.


Take the 15 minute assessment and understand your current AI capabilities

To build an AI-driven organization for the future, it is essential to understand your current AI capabilities. As the potential of AI continues to grow, enterprises will need to transform; understanding the current state of the business and quickly identifying opportunities for improvement will be crucial differentiators.

The EY.ai Maturity Model is designed to help organizations visualize their current level of Generative AI (GenAI) maturity across seven dimensions and provides recommended actions to progress to the next level. The assessment takes just 15 minutes to complete and delivers a personalized output report.

An EY-team can also work with you to evaluate your current levels of GenAI adoption and equip your leadership with a clear understanding of your current capabilities versus your desired future state.



EY AI Week: Harness the Power of AI

From November 3rd to November 7th it is time for EY AI Week. EY AI Week is a hybrid program, online and in-person, featuring perspectives from industry experts and visionary thinkers, along with hands-on sessions.

EY AI Week

Summary

To succeed with GenAI, organizations must align data quality, governance, and infrastructure. Trustworthy AI starts with secure, well-integrated, and reliable data.

Related articles

How responsible AI can unlock your competitive edge

Discover how closing the AI confidence gap can boost adoption and create competitive edge. Explore three key actions for responsible AI leadership.

Four steps to embed data ethics into your data risk control environment

Data privacy concerns are pushing organizations to consider data ethics in data use decisions. Learn how to incorporate data ethics in the process. Learn more.

EU AI Act Roadmap: What does the AI act mean for your organization?

The EU AI Act is coming soon. What does this mean and what steps should you take now?


    About this article

    Authors