Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for preventing website content from being crawled

An anti-grabbing and content technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as data theft, occupation of network resources, damage to the interests of websites and enterprises, etc., to ensure instant update, prevent The effect of being grabbed

Active Publication Date: 2017-08-25
深圳华强电子交易网络有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] According to the above mentioned search engine capture, business value capture, or malicious attack capture, there are two main problems: one is that large-scale data theft will bring certain business impacts to website operations, and there may be some Exposure of private data will have a negative impact on individuals or companies; second, whether it is normal crawling or malicious attack crawling, it will indirectly or directly affect the performance of the website server, thereby reducing the stability of the website, especially malicious ones. Attacks and crawling directly damage the interests of websites and enterprises
For the crawled websites, especially those with original content, these operations occupy a large amount of network resources of the crawled websites and reduce the speed and efficiency of the network; on the other hand, they also infringe Intellectual property rights of the crawled website, thus harming the interests of the crawled website

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for preventing website content from being crawled
  • A method for preventing website content from being crawled
  • A method for preventing website content from being crawled

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0028] Such as figure 1 As shown, the schematic diagram of the network structure of the present invention is described, including the WEB server end, the anti-grab system server and the client, a method for website content anti-grab, including the following steps:

[0029] 1. First establish rules for judging crawling behavior;

[0030] 2. The WEB server obtains the client information, and passes it to the anti-grab system server after obtaining it;

[0031] 3. The anti-grabbing system server verifies according to the information transmitted by the WEB server, and returns the verification identification result to the WEB server, and the WEB server decides whether to execute the data query of the requested page or output a prompt of denial of access according to the verification result.

[0032] The rule in the step (1) is composed of the number of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for preventing website content from grabbing. Firstly, a rule for judging grabbing behavior is established; the WEB server side obtains client information, and then passes it to the anti-grabbing system server; Perform verification, and return the verification identification result to the WEB server side, and the WEB server side decides whether to execute the data query of the requested page or output a prompt of denial of access according to the verification result. The anti-grabbing method of the website content proposed by the present invention, through the strict formulation of the verification process, prevents from verifying the request of the client, effectively preventing the website data from being grabbed, and at the same time, in addition to the verification process, there is also a regular automatic update mechanism , to ensure the real-time update of the blacklist table and customer status table data, and to maintain the operation of the entire process more effectively and stably.

Description

technical field [0001] The invention relates to a method for preventing website content from grabbing. Background technique [0002] "Crawling" mentioned in this article refers to a way for programs to obtain data from other websites in accordance with specified rules. [0003] In the early years, a search engine system appeared on the Internet, a platform formed by crawling website content to achieve massive data. This technology obtains the website address through various channels, and crawls the content of the webpage according to the URL. The obtained content is analyzed and finally the corresponding data information is obtained; at the same time, there are also other non-search engine platforms for data capture. Competitors or other related companies can bring business value to them by capturing specific information content. [0004] The other kind of crawling is malicious. Competitors exist regardless of corporate websites or personal websites. In order to paralyze co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/06G06F17/30
Inventor 刘翔黄有富彭平源管燕卿
Owner 深圳华强电子交易网络有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products