Testing and verification of AI outputs
Two core concepts underpin the credibility of AI-generated insights: accuracy and explainability.
- Accuracy reflects AI’s ability to produce consistent and correct results. Investigators should apply reperformance techniques to verify that the system arrives at the same outcome under similar conditions. This not only validates the tool's effectiveness but also demonstrates to external stakeholders that results were not arbitrary or the product of a “black box.”
- Explainability applies to subjective or judgment-oriented assessments — particularly with decision-making often seen in Agentic AI — such as prioritizing documents, identifying thematic patterns, or suggesting relationships between data points. Here, investigators must be able to follow the model’s rationale, understand why it reached a conclusion and assess whether the logic is reasonable within the broader fact pattern. This ability to test and challenge the AI’s reasoning is essential to establishing that human judgment, not automation, guided the investigation and interpretation of results.
AI should enhance professional expertise — not replace it. Investigators remain responsible for interpreting outputs, assessing their reasonableness, adjusting inputs or parameters and documenting key decisions. They also oversee and verify the accuracy and explainability of AI generated results. Throughout the process, investigators must stay actively engaged: testing outputs, validating that they align with the broader fact pattern and ensuring no model operates with unchecked autonomy. Maintaining clear records of how AI supported the investigation, how outputs were validated and where human judgment guided the analysis is essential to demonstrating reliability and sufficiency during external review.
Consistent with these principles, courts have made clear that uncritical or unchecked reliance on AI — particularly without independent verification, transparency and documented human judgment — can expose organizations to serious legal and evidentiary consequences. Recent federal decisions demonstrate that AI related failures are treated no differently than other lapses in professional responsibility.1 Where parties have relied on AI-generated outputs without meaningful review, courts have imposed significant sanctions, awarded attorneys’ fees and costs and warned that such conduct can undermine the integrity of proceedings and jeopardize case outcomes. These decisions underscore a central expectation: AI may assist professional work, but responsibility for accuracy, reasonableness and reliability remains firmly with the humans deploying it.
Protecting privilege
Further, investigators must exercise caution both in the degree to which they rely on AI and in the nature of the information they disclose to generative AI tools. Courts have clarified that the use of AI tools does not, standing alone, create or preserve attorney‑client privilege or work‑product protection. In particular, disclosures to publicly available AI platforms may undermine confidentiality and result in the waiver of otherwise applicable protections. Recent decisions emphasize that recognized privileges depend on a confidential, fiduciary relationship with a licensed professional and do not extend to interactions with non‑human, publicly accessible systems.2 Accordingly, investigators and counsel must exercise care in determining what information is shared with AI tools and must ensure that AI is used in a manner consistent with privilege, confidentiality obligations and professional oversight.