Site content pickup-preventing method

An anti-crawling and content technology, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve the problems of data theft, occupation of network resources, damage to the interests of the scraped website, etc., to ensure instant updates, The effect of preventing crawling

Active Publication Date: 2013-02-06
深圳华强电子交易网络有限公司
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] According to the above mentioned search engine capture, business value capture, or malicious attack capture, there are two main problems: one is that large-scale data theft will bring certain business impacts to website operations, and there may be some Exposure of private data will have a negative impact on individuals or companies; second, whether it is normal crawling or malicious attack crawling, it will indirectly or directly affect the performance of the website server, thereby reducing the stability of the website, especially malicious ones. Attacks and crawling directly damage the interests of websites and enterprises
For the crawled websites, especially those with original content, these operations occupy a large amount of network resources of the crawled websites and reduce the speed and efficiency of the network; on the other hand, they also infringe Intellectual property rights of the crawled website, thus harming the interests of the crawled website

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Site content pickup-preventing method
  • Site content pickup-preventing method
  • Site content pickup-preventing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be described in detail below with reference to the accompanying drawings and embodiments.

[0028] Such as figure 1 As shown, the schematic diagram of the network structure of the present invention is described, including the WEB server end, the anti-grab system server and the client, a method for website content anti-grab, including the following steps:

[0029] 1. First establish rules for judging crawling behavior;

[0030] 2. The WEB server obtains the client information, and passes it to the anti-grab system server after obtaining it;

[0031] 3. The anti-grabbing system server verifies according to the information transmitted by the WEB server, and returns the verification identification result to the WEB server, and the WEB server decides whether to execute the data query of the requested page or output a prompt of denial of access according to the verification result.

[0032] The rule in the step (1) is composed of the number of tim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a site content pickup-preventing method. The site content pickup-preventing method comprises the following steps: firstly, establishing a pickup-judging rule; secondly, acquiring client-side information by a WEB server side, and transferring the acquired client-side information to a pickup-preventing system server; thirdly, verifying the client-side information transferred by the WEB server side by the pickup-preventing system server; fourthly, returning a verifying identification result to the WEB server side; and finally, determining whether to execute data query on a request page or output a prompt of access rejection by the WEB server side. By the site content pickup-preventing method, through strict establishment of a verification process, pickup of site data is effectively prevented by verifying the request of a client side; and besides the verification process, a periodical automatic updating mechanism is established, data in both a blacklist table and a client state table is ensured to be updated in real time and the operation of the whole process is more effectively and stably ensured.

Description

technical field [0001] The invention relates to a method for preventing website content from grabbing. Background technique [0002] "Crawling" mentioned in this article refers to a way for programs to obtain data from other websites in accordance with specified rules. [0003] In the early years, a search engine system appeared on the Internet, a platform formed by crawling website content to achieve massive data. This technology obtains the website address through various channels, and crawls the content of the webpage according to the URL. The obtained content is analyzed and finally the corresponding data information is obtained; at the same time, there are also other non-search engine platforms for data capture. Competitors or other related companies can bring business value to them by capturing specific information content. [0004] The other kind of crawling is malicious. Competitors exist regardless of corporate websites or personal websites. In order to paralyze co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06G06F17/30
Inventor 刘翔黄有富彭平源管燕卿
Owner 深圳华强电子交易网络有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products