AskyourDocuments
1.0.0
Welcome to the Document QA system! This repository contains the code for a system that allows you to ask questions about your documents and get answers based on their contents. It supports a wide range of document formats, including PDF, Word, Excel, PowerPoint, text files, and even images!
git clone https://github.com/AiGptCode/AskyourDocuments.git
pip install transformers langchain fitz pillow textract pandas python-pptx opencv-python concurrent-futures
export HUGGINGFACE_API_TOKEN=your-api-key
main.py
script and enter the path to the directory containing your documents:python AskyourDocuments.py
Note: If you want to include images in your search, make sure they are in a supported format (e.g., JPEG, PNG) and are located in the same directory as your other documents.
If you would like to contribute to this project, please follow these steps:
git checkout -b my-feature-branch
git commit -am 'Add some feature'
git push origin my-feature-branch
This project is licensed under the MIT License.
langchain
library for providing utilities for creating and working with language modelsfitz
library for providing utilities for working with PDF filesPillow
library for providing utilities for working with image filestextract
library for providing utilities for extracting text from various file formatspandas
library for providing utilities for working with tabular data in Pythonpython-pptx
library for providing utilities for working with PowerPoint filesconcurrent-futures
library for providing a high-level interface for asynchronously executing callablesopencv-python
library for providing utilities for working with image and video data (for image support)