Build a self-service digital assistant using Amazon Lex and Knowledge …

Organizations strive to implement efficient, scalable, cost-effective, and automated customer support solutions without compromising the customer experience. Generative artificial intelligence (AI)-powered chatbots play a crucial role in delivering human-like interactions by providing responses from a knowledge base without the involvement of live agents. These chatbots can be efficiently utilized for handling generic inquiries, freeing up live agents to focus on more complex tasks.
Amazon Lex provides advanced conversational interfaces using voice and text channels. It features natural language understanding capabilities to recognize more accurate identification of user intent and fulfills the user intent faster.
Amazon Bedrock simplifies the process of developing and scaling generative AI applications powered by large language models (LLMs) and other foundation models (FMs). It offers access to a diverse range of FMs from leading providers such as Anthropic Claude, AI21 Labs, Cohere, and Stability AI, as well as Amazon’s proprietary Amazon Titan models. Additionally, Knowledge Bases for Amazon Bedrock empowers you to develop applications that harness the power of Retrieval Augmented Generation (RAG), an approach where retrieving relevant information from data sources enhances the model’s ability to generate contextually appropriate and informed responses.
The generative AI capability of QnAIntent in Amazon Lex lets you securely connect FMs to company data for RAG. QnAIntent provides an interface to use enterprise data and FMs on Amazon Bedrock to generate relevant, accurate, and contextual responses. You can use QnAIntent with new or existing Amazon Lex bots to automate FAQs through text and voice channels, such as Amazon Connect.
With this capability, you no longer need to create variations of intents, sample utterances, slots, and prompts to predict and handle a wide range of FAQs. You can simply connect QnAIntent to company knowledge sources and the bot can immediately handle questions using the allowed content.
In this post, we demonstrate how you can build chatbots with QnAIntent that connects to a knowledge base in Amazon Bedrock (powered by Amazon OpenSearch Serverless as a vector database) and build rich, self-service, conversational experiences for your customers.
Solution overview
The solution uses Amazon Lex, Amazon Simple Storage Service (Amazon S3), and Amazon Bedrock in the following steps:

Users interact with the chatbot through a prebuilt Amazon Lex web UI.
Each user request is processed by Amazon Lex to determine user intent through a process called intent recognition.
Amazon Lex provides the built-in generative AI feature QnAIntent, which can be directly attached to a knowledge base to fulfill user requests.
Knowledge Bases for Amazon Bedrock uses the Amazon Titan embeddings model to convert the user query to a vector and queries the knowledge base to find the chunks that are semantically similar to the user query. The user prompt is augmented along with the results returned from the knowledge base as an additional context and sent to the LLM to generate a response.
The generated response is returned through QnAIntent and sent back to the user in the chat application through Amazon Lex.

The following diagram illustrates the solution architecture and workflow.

In the following sections, we look at the key components of the solution in more detail and the high-level steps to implement the solution:

Create a knowledge base in Amazon Bedrock for OpenSearch Serverless.
Create an Amazon Lex bot.
Create new generative AI-powered intent in Amazon Lex using the built-in QnAIntent and point the knowledge base.
Deploy the sample Amazon Lex web UI available in the GitHub repo. Use the provided AWS CloudFormation template in your preferred AWS Region and configure the bot.

Prerequisites
To implement this solution, you need the following:

An AWS account with privileges to create AWS Identity and Access Management (IAM) roles and policies. For more information, see Overview of access management: Permissions and policies.
Familiarity with AWS services such as Amazon S3, Amazon Lex, Amazon OpenSearch Service, and Amazon Bedrock.
Access enabled for the Amazon Titan Embeddings G1 – Text model and Anthropic Claude 3 Haiku on Amazon Bedrock. For instructions, see Model access.
A data source in Amazon S3. For this post, we use Amazon shareholder docs (Amazon Shareholder letters – 2023 & 2022) as a data source to hydrate the knowledge base.

Create a knowledge base
To create a new knowledge base in Amazon Bedrock, complete the following steps. For more information, refer to Create a knowledge base.

On the Amazon Bedrock console, choose Knowledge bases in the navigation pane.
Choose Create knowledge base.
On the Provide knowledge base details page, enter a knowledge base name, IAM permissions, and tags.
Choose Next.
For Data source name, Amazon Bedrock prepopulates the auto-generated data source name; however, you can change it to your requirements.
Keep the data source location as the same AWS account and choose Browse S3.
Select the S3 bucket where you uploaded the Amazon shareholder documents and choose Choose. This will populate the S3 URI, as shown in the following screenshot.
Choose Next.
Select the embedding model to vectorize the documents. For this post, we select Titan embedding G1 – Text v1.2.
Select Quick create a new vector store to create a default vector store with OpenSearch Serverless.
Choose Next.
Review the configurations and create your knowledge base. After the knowledge base is successfully created, you should see a knowledge base ID, which you need when creating the Amazon Lex bot.
Choose Sync to index the documents.

Create an Amazon Lex bot
Complete the following steps to create your bot:

On the Amazon Lex console, choose Bots in the navigation pane.
Choose Create bot.
For Creation method, select Create a blank bot.
For Bot name, enter a name (for example, FAQBot).
For Runtime role, select Create a new IAM role with basic Amazon Lex permissions to access other services on your behalf.
Configure the remaining settings based on your requirements and choose Next.
On the Add language to bot page, you can choose from different languages supported. For this post, we choose English (US).
Choose Done. After the bot is successfully created, you’re redirected to create a new intent.
Add utterances for the new intent and choose Save intent.

Add QnAIntent to your intent
Complete the following steps to add QnAIntent:

On the Amazon Lex console, navigate to the intent you created.
On the Add intent dropdown menu, choose Use built-in intent.
For Built-in intent, choose AMAZON.QnAIntent – GenAI feature.
For Intent name, enter a name (for example, QnABotIntent).
Choose Add. After you add the QnAIntent, you’re redirected to configure the knowledge base.
For Select model, choose Anthropic and Claude3 Haiku.
For Choose a knowledge store, select Knowledge base for Amazon Bedrock and enter your knowledge base ID.
Choose Save intent.
After you save the intent, choose Build to build the bot. You should see a Successfully built message when the build is complete. You can now test the bot on the Amazon Lex console.
Choose Test to launch a draft version of your bot in a chat window within the console.
Enter questions to get responses.

Deploy the Amazon Lex web UI
The Amazon Lex web UI is a prebuilt fully featured web client for Amazon Lex chatbots. It eliminates the heavy lifting of recreating a chat UI from scratch. You can quickly deploy its features and minimize time to value for your chatbot-powered applications. Complete the following steps to deploy the UI:

Follow the instructions in the GitHub repo.
Before you deploy the CloudFormation template, update the LexV2BotId and LexV2BotAliasId values in the template based on the chatbot you created in your account.
After the CloudFormation stack is deployed successfully, copy the WebAppUrl value from the stack Outputs tab.
Navigate to the web UI to test the solution in your browser.

Clean up
To avoid incurring unnecessary future charges, clean up the resources you created as part of this solution:

Delete the Amazon Bedrock knowledge base and the data in the S3 bucket if you created one specifically for this solution.
Delete the Amazon Lex bot you created.
Delete the CloudFormation stack.

Conclusion
In this post, we discussed the significance of generative AI-powered chatbots in customer support systems. We then provided an overview of the new Amazon Lex feature, QnAIntent, designed to connect FMs to your company data. Finally, we demonstrated a practical use case of setting up a Q&A chatbot to analyze Amazon shareholder documents. This implementation not only provides prompt and consistent customer service, but also empowers live agents to dedicate their expertise to resolving more complex issues.
Stay up to date with the latest advancements in generative AI and start building on AWS. If you’re seeking assistance on how to begin, check out the Generative AI Innovation Center.

About the Authors
Supriya Puragundla is a Senior Solutions Architect at AWS. She has over 15 years of IT experience in software development, design and architecture. She helps key customer accounts on their data, generative AI and AI/ML journeys. She is passionate about data-driven AI and the area of depth in ML and generative AI.
Manjula Nagineni is a Senior Solutions Architect with AWS based in New York. She works with major financial service institutions, architecting and modernizing their large-scale applications while adopting AWS Cloud services. She is passionate about designing cloud-centered big data workloads. She has over 20 years of IT experience in software development, analytics, and architecture across multiple domains such as finance, retail, and telecom.
Mani Khanuja is a Tech Lead – Generative AI Specialists, author of the book Applied Machine Learning and High Performance Computing on AWS, and a member of the Board of Directors for Women in Manufacturing Education Foundation Board. She leads machine learning projects in various domains such as computer vision, natural language processing, and generative AI. She speaks at internal and external conferences such AWS re:Invent, Women in Manufacturing West, YouTube webinars, and GHC 23. In her free time, she likes to go for long runs along the beach.

<