genai system evaluation Download - genai system evaluation Source code download

genai system evaluation

Other source code

1.0.0

Download

GenAI System Evaluation

This repository contains sample notebooks to demonstrate how to evaluate an LLM-augmented system. It provides tools and methods for local evaluation.

Environment

Ensure you've enabled Claude Sonnet and Claude Haiku in the Bedrock Console
Ensure you have adequate permissions to call Bedrock from the Python SDK (Boto3)

Local

These notebooks were tested with Python 3.12. If you're running locally, ensure you're using 3.12. Also ensure that you have the AWS CLI setup with the credentials you want set to the default profile. These credentials need access to Amazon Bedrock Models

Folder Structure

LLM-System-Validation/
├── data/                  # RAG context and validation datasets
├── example-notebooks/     # Notebooks for evaluating various components
|__ script/                # Various scripts for setting up environment.
|__ .github/               # Example github actions

Details

data/: Contains the datasets used for Retrieval-Augmented Generation (RAG) context and validation.
example-notebooks/: Jupyter notebooks demonstrating the evaluation of:
- Embeddings and Chunking Strategy
- Reranking with large chunk sizes
- LLM-As-A-Judge Prompt Engineering
- RAG Prompt Engineering
- E2E RAG Testing

Getting Started

Clone the repository:

git clone [email protected]:aws-samples/genai-system-evaluation.git
cd genai-system-evaluation

Set up a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows, use `venvScriptsactivate`

Install the required dependencies:
```
pip install -r requirements.txt
```

Download opensearch docs for RAG context.

$ cd data && mkdir opensearch-docs && cd  opensearch-docs
$ git clone https://github.com/opensearch-project/documentation-website.git

Go to notebook examples & start jupyter notebooks!
```
$ cd ../../example-notebooks
$ jupyter notebook
```
Start at notebook 1 and work your way through them!