Page data protection method and device, computer equipment and storage medium

A technology of page data and page access, applied in the computer field, can solve the problems of identification, inflexibility, and single method.

Active Publication Date: 2018-08-10
KINGDEE SOFTWARE(CHINA) CO LTD
View PDF4 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using this method to prevent malicious crawlers from crawling data is relatively simple and inflexible, and is easily recognized by malicious crawler scripts or programs, and then cracked by controlling the access frequency, which cannot fundamentally solve the problem of malicious crawlers crawling data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Page data protection method and device, computer equipment and storage medium
  • Page data protection method and device, computer equipment and storage medium
  • Page data protection method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0053] The page data protection method provided by this application can be applied to such as figure 1shown in the application environment. Wherein, the terminal 102 communicates with the proxy server 104 and the server 106 respectively through the network. The proxy server 104 obtains the page access request sent by the terminal 102. The page access request includes a request header and a terminal identifier. The proxy server 104 identifies the crawler type corresponding to the page access request according to the request header. When the crawler type corresponding to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a page data protection method and device, computer equipment and a storage medium. The method comprises the following steps that: obtaining a page access request sent from a terminal, wherein the page access request comprises a request header and terminal identification; according to the request header, identifying a crawler type corresponding to the page access request; when the crawler type corresponding to the page access request is a first type, refusing to forward the page access request to a corresponding server, wherein the first type is a malicious type; when the crawler type corresponding to the page access request is a second type, obtaining the access frequencies of the page access request in a preset time period; and if the access frequencies are smaller than a first threshold value, forwarding the page access request to the corresponding server, and returning the corresponding page data to the terminal by the server according to the page access request and the terminal identification, wherein the second type is a type to be judged. When the method is adopted, the malicious network crawler can be prevented from crawling page data.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a page data protection method, device, computer equipment and storage medium. Background technique [0002] A web crawler is a program or script that automatically crawls information from a website according to certain rules. Using a web crawler can obtain a large amount of useful data information from a website. However, for some content-based websites that provide data services, data is the core of their service quality, and data needs to be protected, and malicious web crawlers cannot be allowed to crawl unlimitedly. It is necessary to screen crawlers and reject some malicious crawlers, otherwise it will affect the competitiveness of your own website. [0003] In the traditional technology, the method to prevent malicious crawlers from crawling data is mainly by setting a fixed anti-crawling frequency, and prohibiting certain IPs (Internet Protocol, the protocol f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/55G06F21/56G06F21/62H04L29/06
CPCG06F21/554G06F21/566G06F21/629G06F2221/033H04L63/1416
Inventor 林泽鹏蔡晓胜陈桓张良杰
Owner KINGDEE SOFTWARE(CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products