A method and device for intercepting reptiles
A crawler and page technology, applied in the network field, can solve the problems that normal users mistakenly think it is a web crawler, and the efficiency of intercepting web crawlers is not high, so as to achieve high concurrency, reduce pressure, and increase the interception rate.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] Example 1, in one embodiment,
[0037] 1) The browser sends an HTTP request to the server, requesting the first page of the current category;
[0038] The server generates an image URL path containing the cookie value and saves it to the first page;
[0039] The server side pre-sets the range of pages that allow direct access to pages as 1-10 pages, and the server side judges that the first page belongs to the direct access range, so it returns the first page that includes the image URL path to the browser;
[0040] The browser automatically downloads the picture to the browser according to the URL path of the picture contained in the returned page of the first page of the current category; parses the picture with the JS method, extracts the cookie value, and saves it; carries the cookie value when turning the page later .
[0041] 2) The browser sends an HTTP request carrying a cookie value to the server, requesting page 10 of the current category;
[0042] The serv...
Embodiment 2
[0050] Embodiment 2, in another embodiment,
[0051] If the browser receives a link to page 10 of the category, then,
[0052] The browser sends an HTTP request to the server, requesting page 10 of the current category;
[0053] The server side generates the image URL path containing the cookie value and saves it to page 10;
[0054] The server side pre-sets the range of pages that allow direct access to pages 1-10, and the server judges that the 10th page belongs to the direct access range. Therefore, although the HTTP request does not contain a cookie value at this time, it will directly include pictures. Page 10 of the URL path is returned to the browser.
[0055] The browser automatically downloads the picture to the browser according to the URL path of the picture contained in the returned page of the 10th page of the current classification; parses the picture with the JS method, extracts the cookie value in it, and saves it; carries the cookie value when turning the pa...
Embodiment 3
[0056] Embodiment three, in another embodiment,
[0057] If the browser receives a link to category page 11, then,
[0058] The browser sends an HTTP request to the server, requesting page 11 of the current classification;
[0059] The server generates an image URL path containing the cookie value and saves it to page 11;
[0060] The server judges that the 11th page does not belong to the scope of direct access. Therefore, it further judges whether there is a cookie value in the HTTP request. Since it is a link directly received by the browser, the HTTP request does not contain a cookie value. Therefore, to browse The browser returns to the first page of the current category.
[0061] Next, if you want to continue to visit other pages, you can repeat the operation in Embodiment 1 to achieve normal page visits.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

