illustrate
This collector is customized for specific websites that cannot be collected by Locomotive. If you need to collect other websites, you need to modify the source code;
I write collection rules, publish modules, customize various software, and am good at cracking various anti-collection;
The source code is written for VS2008. Due to my own limitations, I do not rule out bugs in the program. The source code is for reference only.
The program does not need to set parameters, just click "Start Collection". The following is the technology used in this program
Multithreading using ThreadPool.QueueUserWorkItem();
Use regular expressions to match the strings to be collected;
Use POST to submit data and get the returned source code;
Use WebClient DownloadFile to download remote images to the local;
Use the access database class to save the collected data to the database.