Supercharge Your Innovation With Domain-Expert AI Agents!

Method and device for creating and querying catalog database

A database and directory technology, applied in other database retrieval, network data indexing, network data retrieval and other directions, can solve the problems of small amount of data, unfavorable website security detection, difficult to scan website vulnerability catalogue, etc. Effect

Active Publication Date: 2020-02-07
BEIJING UNIV OF POSTS & TELECOMM
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the purpose of the embodiments of the present invention is to provide a method and device for creating and querying a catalog database, so as to solve the problem that the website catalog database in the prior art stores a small amount of data and lacks classified storage, which makes it difficult to scan the website. Sexual directory, which is not conducive to the security detection of the website

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for creating and querying catalog database
  • Method and device for creating and querying catalog database
  • Method and device for creating and querying catalog database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] The embodiment of the present invention provides a method for generating a catalog database. The catalog database generated by the method provided by the embodiment of the present invention has a large amount of data, and the primary key and slave key of the source code data of the website in the catalog database are set to facilitate cataloging. Inquire.

[0056] Such as figure 1 As shown, the directory database generation method provided by the embodiment of the present invention includes steps S110-S140, specifically as follows.

[0057] S110. Obtain the website structure of the target website, and determine a crawling strategy according to the above website structure.

[0058] The above-mentioned target website is an open source website building platform. For example, the above-mentioned target website can be Github, Webmaster's Home, etc., and of course it can also be other websites. The embodiment of the present invention does not limit the specific type of the a...

Embodiment 2

[0079] The embodiment of the present invention provides a catalog database query method, which is applied to the catalog database generated by the catalog database generating method in Embodiment 1 of the present invention. set, which improves the query efficiency.

[0080] Such as figure 2 As shown, when using the directory database query method provided by the embodiment of the present invention to query the directory database subset matching the target website, it specifically includes steps S210-S220.

[0081] S210, setting a crawler strategy for the target website, and obtaining a site map (Sitemap) of the target website according to the crawler strategy, the crawler strategy includes setting a crawler start URL, encrypted data of the target website, and header information of a request.

[0082] When using web crawler technology to obtain the Sitemap of the target website, it is necessary to set the starting URL for crawling. Therefore, the website has an anti-crawler m...

Embodiment 3

[0099] An embodiment of the present invention provides a device for generating a directory database, which is used to execute the method for generating a directory database provided in Embodiment 1 of the present invention.

[0100] Such as image 3 As shown, the directory database generation device provided by the embodiment of the present invention includes a first determination module 310, an acquisition module 320, a second acquisition module 330 and a generation module 340;

[0101] The above-mentioned first determining module 310 is used to obtain the website structure of the target website, and determine the crawler strategy according to the website structure;

[0102] The acquisition module 320 is configured to acquire the source code data of the target website according to the crawler strategy;

[0103] The above-mentioned second determination module 330 is used to determine the primary key of the source code data in the directory database according to the path for o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides directory database generation and query methods and apparatuses. The generation method comprises the steps of obtaining a website framework of a target website and determining a crawler policy according to the website framework; obtaining source code data of the target website according to the crawler policy; determining a primary key of the source code data in a directory database according to a path of obtaining the source code data, and determining a secondary key of the source code data in the directory database according to feature parameters of the source code data; and determining a storage directory of the source code data of the target website in the directory database according to the primary key and the secondary key, and storing the source code data, thereby generating the directory database. According to the methods and the apparatuses, the generated directory database stores a large amount of data information, and a storage path of the source code data of the target website in the directory database is determined according to the primary key and the secondary key, so that a directory query can be conveniently performed, a vulnerability directory can be scanned more easily during directory scanning, and the website framework can be adjusted in time for protecting the website security.

Description

technical field [0001] The invention relates to the technical field of computer and information security, in particular to a method and device for generating and querying a directory database. Background technique [0002] The WEB website is the most widely used website on the Internet, so it is often subject to malicious attacks. Among them, the directory scanning attack is a kind of attack method with strong versatility and high harm. In a directory scanning attack, the attacker initiates a Hypertext transfer protocol (HTTP) request by iterating a large number of directories and file name lists to obtain the directory information topology of the website, thereby leaking sensitive information such as upload pages and background login pages. Once the attacker obtains this information, the security protection of the entire website system will fall short. [0003] In order to prevent the website from being attacked, it is necessary to detect the security of the website and fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/958
CPCG06F16/951G06F16/958
Inventor 郭燕慧孙博文徐国爱
Owner BEIJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More