Blue Sky Collector is a free data collection and publishing crawler software. It is developed using php+mysql. It can be deployed on a cloud server. It can collect almost all types of web pages, seamlessly connect with various CMS website building programs, and publish data in real time without logging in. Automatically without manual intervention! It is a completely cross-platform cloud crawler system in web big data collection software.
Features of blue sky collector:
SkyCaiji, a web crawler system, is developed using PHP+Mysql. It can be deployed in cloud servers and virtual hosts, and data can be collected using a browser. The software is free for unlimited use, and rules and plug-ins can be customized.
Data collection:
It supports multi-level, multi-page, and paging collection, and custom collection rules (supports regular expressions, XPATH, JSON, etc.) to accurately match any information flow. It can collect almost all types of web pages, and the content of most article types can be intelligently identified.
Content release:
Seamlessly connects to various CMS website building programs to import data without logging in. It supports custom data publishing plug-ins. It can also be directly imported into the database, stored as Excel files, remote API publishing, etc.
Cloud deployment and automation:
This software is similar to a CMS program, completely cross-platform, can be installed on any system, and can also run well on a virtual host. Realize timed and quantitative fully automatic collection and release, and simple operation can achieve continuous collection!