SmartImageExtractionEcommerce
1.0.0
This project emphasizes leveraging LLMs and ChatGPT for effective prompting to enhance image extraction precision.
This is an ongoing research project, so the code may not be very clean.
The goal of this project is to extract product images from e-commerce product pages while excluding irrelevant images such as logos or similar product images. This requires handling various languages and filtering based on text content.
Initial Setup
HTML Cleaning
Identifying Product Images
Final Steps