The collection system has the following characteristics:
Mainstream language - written in php+mysql, just install the corresponding server.
Completely open source - open source code, and the code has Chinese comments to facilitate management, learning and communication.
Rule customization - collection rules can be customized and most website content can be collected.
Data modification - customize modification rules to optimize data content.
Data saving - in array form, serialized data is saved to files or databases for easy uploading and calling.
Image Reading - Images with content can be read and saved locally.
Encoding control - Convert encoding, you can save gb2312, gbk and other encodings to utf-8.
Tag cleaning - you can customize the retained tags and clean up unnecessary tags.
Security performance - reading is controlled by password, and remote reading is also safe.
Simple operation - one-click reading operation, you can read in groups according to rules, or read by specifying a rule id, and read with a single id.
Rule grouping - Read data according to rule groups and update the collected data in a timely manner.
Custom read - read data according to custom rule id, which is more effective and timely.
JS reading - Use js to control the reading time and reduce the server load.
Timeout control - You can set the page execution time to reduce timeout errors.
Multiple reads - You can set multiple read controls for web pages to read data more efficiently.
Error control - If errors occur multiple times, reading can be stopped to reduce server resource usage.
Load control - saving data in multiple folders can effectively solve the server load under multiple files.
Data modification - Not only can you browse the data, but you can also modify the main data.
Rule Analysis - Share your rules with others so more people can use them.
Rule download - Download sharing rules and quickly get the content you need.
it works
it works
it works