Crawler realization method and system capable of breaking through IP limit
An implementation method and system technology, which is applied in the crawler implementation method and system field that breaks through IP restrictions, can solve the problems of uneconomical and high cost, and achieve the effect of low cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] Such as figure 1 As shown, this crawler implementation method for breaking through IP restrictions includes the following steps:
[0020] (1) The crawler scheduling server sends a crawling task, which includes the task ID, the URL of the HTTP request, all parameters, and the longest waiting time;
[0021] (2) After the client receives the grab task, it immediately initiates an HTTP request to grab the corresponding page;
[0022] (3) The page capture is completed, check whether the maximum waiting time is exceeded, if the maximum waiting time is not exceeded, step (4) is executed, otherwise step (1) is executed;
[0023] (4) Send the captured data to the crawler scheduling server, and mark the task ID at the same time, and the captured data is the string returned by the HTTP response.
[0024] The present invention sends the task of grabbing pages to the client (for example, the APP installed on the user's mobile phone), and breaks through the limit by the huge number...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
