building qa app with aws bedrock kendra s3 and streamlit Download - building qa app with aws bedrock kendra s3 and streamlit Source code download

building qa app with aws bedrock kendra s3 and streamlit

AI Source Code

1.0.0

Download

Introduction

This repository contains a practical example about how to build a Q&A app capable of answering questions related to your private documents using the GenAI AWS services.

Previously, I built a Q&A app using Azure OpenAI GPT-4 and Pinecone (you can find it in here).
In this repository I will built exactly the same app but using only AWS services (+ Streamlit for the UI).

To be more precise, the app has the following architecture:

And it uses the following technologies:

AWS Bedrock
AWS Kendra
AWS S3
Streamlit

Content

This repository contains the following app:

A Streamlit app, which allow us to query the data stored in AWS Kendra using one of the available AWS Bedrock LLM models.

How the Q&A app works

The private documents are being stored in an s3 bucket.
The Kendra Index is configured to use an s3 connector. The Index checks the s3 bucket every N minutes for new content. If new content is found in the bucket, it gets automatically parsed and stored into Kendra database.
When a user runs a query through the Streamlit app, the app follows these steps:
- Retrieve the relevant information for the given query from Kendra.
- Assembles the prompt.
- Sends the prompt to one of the available Bedrock LLM and receives the answer that comes back.

Jupyter notebooks

If instead of using the Streamlit app, you prefer to execute step by step the instructions for setting up the RAG pattern, you have a Jupyter notebook (/notebooks/rag-with-langchain.ipynb) that will precisely allow you to do this.
The Streamlit app makes heavy use of the LangChain library to implement the RAG pattern. If you prefer not to use any third-party libraries and set up the RAG pattern solely with the boto3 library, you have a second Jupyter notebook (/notebooks/rag-with-only-boto3.ipynb) that will allow you to do this.

AWS Infrastructure

In the /infra folder, you'll find a series of Terraform files that will create every AWS services required for the proper functioning of the app.

These Terraform files will create the following resources:

An s3 bucket for housing our private docs.
A Kendra index with an s3 connector.
An IAM role with the required permissions to make everything work.

Prerequisites

There are a few prerequisites that you should be aware of before attempting to run the application.

AWS Bedrock third-party LLMs

By default, in Bedrock you will have access only to the Amazon Titan LLM. To utilize any of the third-party LLMs (Anthropic and AI21 Labs LLM models), you must register for access separately.

In the "Model Access" section, you have an overview of which LLMs you have access to and which ones you do not.

To fully use this application, you must have access to every AWS Bedrock third-paty LLMs.

boto3 credentials

Within the app, boto3 is setup to retrieve the AWS credentials from the default profile of the AWS config file on your local machine.
Feel free to adjust this configuration to align it with your preferences, whether that involves utilizing environment variables, passing credentials as parameters, or employing other suitable methods.

For an overview of the multiples approaches for configuring boto3 credentials, go to the following link:

https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html

How to run the app

Before trying to run the app, read the AWS Infrastructure and the Prerequisites section.

Run the app locally

Set the required environment variables

The repository has a .env file that contains the environment variables that the app requires to run successfully:

KENDRA_INDEX='<kendra-index>'
AWS_BEDROCK_REGION='<bedrock-region>'
AWS_KENDRA_REGION='<region-where-kendra-index-is-deployed>'

Change the values accordingly.

Restore dependencies

pip install -r requirements.txt

Run the app

When you install Streamlit, a command-line (CLI) tool gets installed as well. The purpose of this tool is to run Streamlit apps.
To execute the app, just run the following command:

streamlit run app.py

Run the app using Docker

This repository has a Dockerfile in case you prefer to execute the app on a container.

Build the image

docker build -t aws-rag-app .

Run it

docker run -p 5050:5050 
        -e KENDRA_INDEX="<kendra-index>" 
        -e AWS_BEDROCK_REGION="<bedrock-region>" 
        -e AWS_KENDRA_REGION="<region-where-kendra-index-is-deploy>" 
        -e AWS_ACCESS_KEY_ID="<aws-access-key>" 
        -e AWS_SECRET_ACCESS_KEY="<aws-secret-access-key>" 
        aws-rag-app

Output

Changelog

10/02/2023

Updated app.py to make it compatible with the latest Bedrock changes.
Updated Jupyter Notebooks to make them compatible with the latest Bedrock changes.
Removed boto3 and botocore3 wheel files. A boto3 compatible with Bedrock is finally publicly available.

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2024-12-25
size 11.93MB
From Github

Related Applications

Craftsman Crafting Building Latest Edition

2024-08-31
Building & Fighter Chinese version game

2024-08-27
Monster Pocket Run and Building游戏

2024-02-20
Block Craft 3D Building Game Chinese latest version

2023-11-08
Building Fighter mobile game genuine

2023-10-30
On-site smart management qa app

2023-08-07

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
node telegram bot api

AI Source Code

v0.50.0
typebot.io

AI Source Code

v3.1.2
python wechaty getting started

AI Source Code

1.0.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All