Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Website safety protection and test diagnosis system structure method based on crawler technology

A crawler technology and security protection technology, which is applied in the field of website security protection and test diagnosis system construction based on crawler technology, can solve problems such as manual testing is difficult to complete, manual visual inspection accuracy and completeness cannot be guaranteed, and achieve enhanced preventive measures Effect

Inactive Publication Date: 2009-08-19
刘峰 +1
View PDF0 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

How many "empty links" does the site have? Where are they? What is the speed of each URL access request? Although these can be tested manually, when the site scale is large, it is obvious that manual testing is already difficult to complete
Moreover, the accuracy and completeness of manual visual inspection cannot be guaranteed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Website safety protection and test diagnosis system structure method based on crawler technology
  • Website safety protection and test diagnosis system structure method based on crawler technology
  • Website safety protection and test diagnosis system structure method based on crawler technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] A method for constructing a website document management, security protection and test diagnosis system based on crawler technology, including the following steps:

[0044] (1) Initial test diagnosis

[0045] figure 1 It is the flow chart of "Initialization Test Diagnosis". First, enter the domain name of the website in unit 101, and unit 102 sends an http request to the website through the domain name, and obtains the URL and html of the home page of the website, responds to the Head information through http, and obtains the URL (global resource location), server (server), and content of the home page. type (content type), Last-modified (document last modification time), Content-length (document length) information, saved in the "Website URL link table". Unit 103 analyzes the link tags in html, builds a crawler, uses any method such as "depth first" or "breadth first", and starts traversing the URLs of all links on the website, and uploads the URL of each URL and the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A website system configuration method which is based on crawler technology and used for file management, safety defense and testing diagnosis can automatically establish management files for files and URL link of website and adopts the management files as the basic foundation to judge whether the webpage is attacked by hackers and whether the link relationship is falsified so as to find the error link of the website. The method can automatically and fully test the URL response time contained in the website so as to find illegal link and 'sensitive words' in the webpage; and the method can be adopted as the tool for the website test, diagnosis, evaluation and safety defense.

Description

technical field [0001] The invention relates to the automation technology of document management, safety protection and testing of a website. Background technique [0002] Crawler Spider is a technology commonly used on the modern Internet, but it is mainly used for content retrieval. In fact, crawler technology can also be used in three aspects: website document management automation, website information security protection and website testing and diagnosis. At present, there is no system and method for applying crawler technology to these three aspects. [0003] First, crawler technology is applied to the automation of website document management. The content of the Internet site consists of a series of static and dynamic files, which form a system through hypertext links. With the continuous modification of the website system, there will be more and more files, and the link structure is also changing. How to know the final system structure, link relationship and file li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F21/00G06F21/56
Inventor 刘峰宋洁
Owner 刘峰
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products