The general version of the Yide website data collection system is a system that collects data from selected websites by writing or downloading rules. It can collect data from most websites and save image files. It is an indispensable data collection tool for website building. Moreover, the collector is an open source code with Chinese annotations, making it easy to modify and learn. The collection system has the following characteristics:
Mainstream language - written in php+mysql, just install the corresponding server.
Completely open source - open source code, and the code has Chinese comments to facilitate management, learning and communication.
Rule customization - collection rules can be customized and most website content can be collected.
Data modification - Customize modification rules and optimize data content.
Data saving - In array form, serialized data is saved to files or databases for easy uploading and calling.
Image reading - can read images of content and save them locally.
Encoding control - Convert encoding, you can save gb2312, gbk and other encodings to utf-8.
Tag cleaning - you can customize the retained tags and clean up unnecessary tags.
Security performance - Reading is controlled by password, and remote reading is also safe.
Simple operation - one-click reading operation, you can read in groups according to rules, or read by specifying a rule ID, and read with a single ID.
Rule grouping - Read data according to rule groups and update the collected data in a timely manner.
Customized reading - reading data according to custom rule ids, which is more effective and timely.
JS reading - Use js to control the reading time and reduce the server load.
Timeout control - The page execution time can be set to reduce timeout errors.
Multiple reads - You can set multiple read controls for web pages to read data more efficiently.
Error control - If errors occur multiple times, reading can be stopped to reduce server resource usage.
Load control - Save data in multiple folders, which can effectively solve the server load under multiple files.
Data modification - Not only can you browse the data, but you can also modify the main data.
Rule Analysis - You can share your rules with others so that more people can use them.
Rule download - Download sharing rules and quickly get the content you need.
Expand