Harness the power of AI with PyScreen, a state-of-the-art tool designed to extract, analyze, and visualize pertinent information from screen recording videos. Built upon advanced computer vision, text processing, and artificial intelligence techniques, PyScreen transforms your screen recordings into data-rich insights.
Screen Extraction: Seamlessly extracts screens and data from the screens of any given video file.
Screen Blur: Blurs sensitive information (e.g. passwords, credit card numbers) in your screens, ensuring privacy and security.
Text Extraction: Extracts text from screens using Tesseract OCR, enabling text-based analysis.
Word Cloud Generation: Generates dynamic word clouds reflecting the most frequently used words in your screens, offering a quick content overview.
Color Analysis: Identifies the dominant color schemes prevalent in your screens, providing a snapshot of aesthetic and design choices.
GPT Analysis: Employs OpenAI's GPT-4 model to generate human-like text descriptions based on word data extracted from screens, enhancing content understanding and discoverability.
Clone the repository:
git clone https://github.com/alexandrevl/pyscreen.git
Navigate to the project directory:
cd pyscreen
Run PyScreen on your preferred screen recording:
python main.py --input your_screen_recording.mp4
PyScreen relies on the following Python libraries:
gc
for optimal memory managementjson
and pandas
for effortless data manipulationcv2
(opencv) for comprehensive image processingpytesseract
for OCR functionalitynltk
for advanced text processingWordCloud
for crafting visually compelling word cloudsopenai
for utilizing the GPT-4 modelpython-dotenv
for storing sensitive informationunidecode
for handling Unicode charactersEnsure these dependencies are installed prior to running PyScreen. You can install them using pip:
pip install opencv-python pytesseract nltk wordcloud openai pandas python-dotenv unidecode
To use chatGPT functionality, you must have an OpenAI API key. You can obtain one here. Once you have your API key, create a .env
file in the project directory and add the following line:
OPENAI_API_KEY=your_api_key
Additionally, Tesseract OCR needs to be installed on your system. Follow this guide for detailed installation instructions.
Your contributions matter! Feel free to submit a pull request to augment the functionality and usability of PyScreen.
PyScreen is open-source software, licensed under the terms of the MIT license. Feel free to share, modify, and distribute.