EY helps clients create long-term value for all stakeholders. Enabled by data and technology, our services and solutions provide trust through assurance and help clients transform, grow and operate.
At EY, our purpose is building a better working world. The insights and services we provide help to create long-term value for clients, people and society, and to build trust in the capital markets.
Healthcare payers struggle with maintaining accurate provider directories, with studies showing 80%+ of provider entries contain errors like incorrect addresses, phone numbers, professional details, license details. Manual validation processes are time-intensive, requiring staff to call providers, verify credentials, and update multiple systems. This creates frustration among members when they can't reach providers, regulatory compliance risks, and wasted operational resources. A simplified AI solution focused on automating basic provider data validation and directory updates can demonstrate significant value while being feasible for hackathon development using publicly available data sources.
Business Problem
Current Challenges:
Provider directories contain 40-80% inaccurate contact information causing member frustration and access issues
Manual verification processes requiring staff to call hundreds of providers monthly for basic updates
Multiple data entry points creating inconsistencies between online directories, mobile apps, and printed materials
Regulatory requirements demanding frequent provider data updates with limited automation capabilities
Time-consuming credential verification processes delaying provider network additions by weeks or months
Member complaints about outdated provider information leading to unsuccessful appointment attempts
Desired Outcomes
Automate provider data validation through intelligent web scraping and API calls
Reduce manual verification time through AI assistance
Achieve target provider contact information accuracy through continuous automated validation
Create unified provider data management reducing inconsistencies across member-facing platforms
Demonstrate reduction in provider directory maintenance costs through intelligent automation
Goal
Develop a simplified Agentic AI system that automates basic provider data validation using publicly available sources, demonstrates intelligent data quality improvement, and showcases the potential for full-scale provider data management automation with synthetic and public data sources.
Provider data includes
Demographics: Name, contact information
Professional Details: Specialties, licenses, certifications
Network Affiliations: Insurance networks, affiliations with other providers or groups
Location and Facilities: Addresses, medical imaging facilities
Key Deliverable (Demo)
Demonstration Scenario: Automated validation and updating of 200 provider profiles using publicly available data sources.
Input: Sample provider dataset with names, addresses, phone numbers, specialties, and basic credential information. Must include scanned pdf (unstructured data) formats.
Process: AI agent automatically validates contact information via web scraping, checks credentials against public databases, identifies inconsistencies, and flags providers needing manual review
Output: User interface to show updated provider profiles with confidence scores, actions status reports, prioritized list of providers requiring human attention, and generate communication email.
Timeline: Complete validation cycle in under 30 minutes versus traditional manual work
Agentic AI Roles (Suggested / Illustrative)
Data Validation Agent:
Performs automated web scraping of provider practice websites to verify current contact information and services
Cross-references provider information against public databases including NPI registry and state licensing boards
Conducts intelligent phone number and address validation using publicly available verification services
Generates confidence scores for each data element based on source reliability and cross-validation results
Information Enrichment Agent:
Searches public sources for additional provider information including education, board certifications, and specialties
Analyzes provider websites and online profiles for updated practice information and service offerings
Identifies potential network gaps by analyzing geographic distribution and specialty coverage
Creates standardized provider profiles with enriched data from multiple public sources
Quality Assurance Agent:
Compares provider information across multiple sources to identify discrepancies and inconsistencies
Flags providers with suspicious or potentially fraudulent information for manual review
Tracks data quality metrics and generates reports on validation success rates and common error patterns
Prioritizes providers for manual verification based on member impact and data confidence levels
Directory Management Agent:
Generates updated provider directory entries in multiple formats (web, mobile app, PDF)
Creates automated alerts for providers requiring immediate attention or manual verification
Produces summary reports showing validation results, data quality improvements, and recommended actions
Manages workflow queues for human reviewers with prioritized tasks and supporting documentation
Data and System Assumptions
Publicly Available Data Sources:
NPI Registry (CMS): Free API access for provider basic information, credentials, and practice locations
State Medical Board Websites: Public license verification and disciplinary action information (can be scraped)
Hospital/Health System Websites: Provider directory pages with current practice information and contact details
Google My Business/Maps API: Practice location verification, phone numbers, and patient review data
Medicare Provider Utilization Database: Public claims data showing provider specialties and practice patterns
Synthetic Data Generation:
Provider Profile Generator: Create realistic provider datasets with names, addresses, specialties, and credential information
Validation Scenario Creator: Generate common data quality issues like outdated phone numbers, moved practices, and credential changes
Member Impact Simulator: Create synthetic member complaint data related to provider directory accuracy
Network Coverage Generator: Generate geographic and specialty distribution data for network adequacy analysis