Efficient crawler method based on IP
A crawler and high-efficiency technology, applied in the field of high-efficiency crawlers based on IP, can solve the problems of low IP utilization rate, achieve the effect of improving utilization and efficiency, improving crawler efficiency, and saving time for frequent IP switching
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] In order to describe the technical content of the present invention more clearly, further description will be given below in conjunction with specific embodiments.
[0028] This efficient crawler method based on IP of the present invention, comprises:
[0029] (1) Obtain the proxy IP, put the IP into the availability detection queue, request the server built locally, and put the high-quality proxy IP into the common IP pool;
[0030] (1.1) Obtain the proxy IP and put the IP into the availability detection queue;
[0031] (1.2) Request the server built locally, and judge whether the server response can be obtained within 2 seconds. If so, this IP is a high-quality proxy, add the target website quality detection queue, put it into the common IP pool, and continue to step (2); Otherwise, the IP is a non-high-quality agent, and is put into the usability detection queue again;
[0032] (1.3) Judging whether the number of times that the IP is put into the availability detec...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
