Crawler interception method based on user behavior portrait, electronic equipment, and storage medium
A user and behavior technology, applied in the field of network security, can solve the problem of inefficient interception of web crawlers, and the common IP can be set arbitrarily without considering the effect of avoiding interception errors, reducing the interception error rate, and improving the accuracy rate.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0029] figure 1 A flow chart showing a method for intercepting crawlers based on user behavior portraits according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the crawler interception method based on user behavior portrait specifically includes the following steps:
[0030] Step S101, analyzing known crawler access requests to obtain user behavior portraits corresponding to known crawler access requests.
[0031] Based on the determined and known crawler access requests, user behavior data such as access traces left during the access process, operations on the page, and access to the server can be analyzed. For example, a large amount of user behavior data can be analyzed. User behavior portraits can be obtained through training, induction, and other methods. Among them, the user behavior profile includes data in multiple dimensions such as the frequency of user access to the server, the length of time spent on the page, the speed of page access,...
Embodiment 2
[0051] figure 2 shows a flow chart of a crawler interception method based on user behavior portraits according to Embodiment 2 of the present invention, as shown in figure 2 As shown, the crawler interception method based on user behavior portrait includes the following steps:
[0052] Step S201, analyzing known crawler access requests to obtain user behavior portraits corresponding to known crawler access requests.
[0053] For this step, refer to the description of step S101 in Embodiment 1, and details are not repeated here.
[0054] Step S202, receiving a page access request sent by the client.
[0055] Step S203, judging whether the originator of the access request is in the pre-established search engine white list.
[0056] Since some search engines also use crawler technology to access pages, the user behavior characteristics generated by them are very consistent with user behavior portraits, but these search engines are not objects that need to be intercepted, and...
Embodiment 3
[0069] Embodiment 3 of the present application provides a non-volatile computer storage medium. The computer storage medium stores at least one executable instruction. The computer executable instruction can execute the crawler interception method based on user behavior portrait in any of the above method embodiments. .
[0070] Specifically, the executable instruction can be used to make the processor perform the following operations:
[0071] Analyze known crawler access requests to obtain user behavior portraits corresponding to known crawler access requests; receive page access requests sent by clients, and obtain user behavior characteristics based on user behavior data generated by access requests; user behavior The feature is compared with the user behavior portrait of the crawler access request to determine whether the access request is a crawler access request; if so, the access request is intercepted.
[0072] In an optional implementation manner, the user behavior ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


