Four steps for implementing a large language model (LLM)

How a comprehensive AI and LLM framework helps prepare companies

A thorough AI framework that evaluates readiness and addresses potential issues before investing can help organizations get on the right path. For example, some private equity firms are experimenting with LLMs to analyze market trends and patterns, manage documents and automate some functions. They are also considering how GenAI may impact their investing strategy. The following four-step analysis can assist an organization in deciding whether to build its own LLM or work with a partner to facilitate an LLM implementation.

1) Define the use case for adopting an LLM

There is a lot of hype around GenAI and all that it can do. Although it’s a powerful technology, it may not be suitable for addressing some problems and could be costly if deployed without defining the specific use case. Use cases related to lower-level customer support, content creation and document analysis tend to be best suited for GenAI experimentation.

Once businesses determine they have zeroed in on the right set of use cases, it is beneficial for them to start experimenting with one of the pretrained enterprise-scale LLMs like OpenAI’s GPT-4. Using a state-of-the-art pretrained model can lead to multiple operational efficiencies by:

Streamlining hybrid and multi-cloud management, which enables teams to communicate with cloud infrastructure using natural language queries

Simplifying tasks such as monitoring, troubleshooting and maintaining multi-cloud deployments

Facilitating the automation of testing processes by analyzing input materials, including requirements documents and user stories (e.g., generating structured test cases that can be executed automatically or manually and creating realistic test data)

Accelerating the software process by automating code reviews and suggesting potential optimization via advanced natural language understanding capabilities that enable the analysis of code quality and provide actionable feedback, thus streamlining the development workflow and reducing the time required for manual code inspections and improvements

Still, in certain scenarios where pretrained models fail to meet accuracy goals, companies may opt to train or fine-tune a model by funneling proprietary data to improve the overall performance.

2) Evaluate AI readiness

The next step is to assess the organization’s AI and machine learning (ML) readiness across three categories: AI capabilities, data and data practices, and analytics capabilities. If companies don’t take this step, there’s a risk of going in unprepared and not being able to successfully achieve the project’s goals.

Software Strategy Consulting

Learn more about how EY-Parthenon Software Strategy Consulting teams apply real-world software CTO and CPO expertise to drive value.
Read more

4) Assess cost, data ownership options and resources

The choice of an LLM implementation approach impacts the complexity and costs, including those associated with:

Training
Data collection, ingestion and cleansing
Hiring data scientists
Maintaining the model in production

The selection also greatly affects how much control a company will have over its proprietary data. The key reason for using this data is that it can help a company differentiate its product and make it so complex that it can’t be replicated, potentially gaining a competitive advantage. In addition, proprietary data can be crucial for addressing narrow, business-specific use cases.

Also, there are regulatory and ethical reasons for sustaining control. For example, depending on the data that is stored and processed, secure storage and auditability could be required by regulators. In addition, uncontrolled language models may generate misleading or inaccurate advice. Implementing control measures can help address these issues; for instance, preventing the spread of false information and potential harm to individuals seeking medical guidance.

Typically, there are three ways to implement an LLM — an API, platform as a service (PaaS) or self-hosted — each of which presents different considerations.

Off-the-shelf model via API

Using an API can alleviate the complexities of maintaining a sizable team of data scientists, as well as a language model, which involves handling updates, bug fixes and improvements. Using an API shifts much of this maintenance burden to the provider, allowing a company to focus on its core functionality. In addition, an API can enable on-demand access to the LLM, which is essential for applications that require immediate responses to user queries or interactions.

When a company uses an LLM API, it typically shares data with the API provider. It’s important to review and understand the data usage policies and terms of service to confirm they align with a company’s privacy and compliance requirements. The ownership of data also depends on the terms and conditions of the provider. In many cases, while companies will retain ownership of their data, they will also grant the provider certain usage rights for processing it. It’s beneficial for companies to clarify data ownership in their provider contracts before investing.

PaaS

PaaS provides companies access to use its LLM as part of a broader platform offering and allows customers to operate their LLMs without managing the underlying application infrastructure, middleware or hardware. However, by using this approach, companies may incur higher model costs associated with purchasing the rights to build on top of the LLM using their own data, as well as allowing domain specificity and model customization during deployment. It also enables companies to control their data and minimize the time to value and cost compared to the self-hosted approach. On the flip side, auditability of the data and the ability to provide comprehensive explanations for results can pose challenges as organizations are constrained given that their PaaS providers don’t provide the underlying data. In addition, PaaS can result in a greater total cost of ownership for the LLM and can be more complex than utilizing an API.

Self-hosting an LLM

This is the most expensive approach because it means rebuilding the entire model from scratch and requires mature data processes to fully train, operationalize and deploy an LLM. Furthermore, upgrading the underlying model for self-hosted implementations is more intensive than a typical software upgrade. On the other hand, it provides maximum control — since a company would own the LLM — and the ability to customize extensively.

Conclusion

Advances in deep learning networks are foreshadowing a productivity revolution, which is spurring companies to keep up with the adoption of GenAI technologies. When embarking on an AI initiative that includes an LLM implementation, companies can better inform their decisions by employing a comprehensive AI implementation framework. Taking this approach can help businesses prepare by analyzing their purpose, goals, costs and readiness factors, including regulatory compliance and ethical safeguards.

Lianda Luo, Austin Chen, Naina Wodon, Andrea Lamas-Nino and Ioannis Wallingford of Ernst & Young LLP also contributed to this article.

Four steps to implement a large language model successfully

Organizations are eager to adopt LLMs but navigating potential pitfalls can be challenging.

In brief

Top factors hindering the adoption of AI, such as an LLM

Source: IBM Global AI Adoption Index 2022, IBM Corporation, May 2022