Method and system for space-efficient multi-pattern matching

A pattern string and pre-matching technology, applied in the fields of computational biology, information retrieval, and information filtering, to achieve the effects of reducing preprocessing time, saving storage space, and improving efficiency

Active Publication Date: 2015-09-02
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the above method is relatively efficient in one aspect (time or space), according to the current increasing information security requirements, better space-time efficient multi-pattern string matching technology still needs further research

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for space-efficient multi-pattern matching
  • Method and system for space-efficient multi-pattern matching
  • Method and system for space-efficient multi-pattern matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the above objects, features and advantages of the present invention more obvious and understandable, the present invention will be further described below through specific embodiments and accompanying drawings.

[0045] The multi-pattern string matching technology proposed by the present invention mainly includes two stages: a preprocessing stage and a searching stage. figure 1 The basic flow of the multi-pattern string matching technology and the structure of the keyword matching system are illustrated. The basic process is: read in the keyword file, the algorithm enters the preprocessing stage, and construct the HashTrie by using the recursive hash function and Rank operation. Read in the text data and enter the search phase. Use the recursive hash function to calculate the hash value of the current string, and combine the previously constructed HashTrie to search and verify the text character by character. Finally, the results of the final search m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and a system for space-efficient multi-pattern matching. Firstly, a novel data structure HashTrie for storing pattern string is proposed, in which an original pattern string matrix is stored in the form of one-dimensional table by using a bit vector table, thus avoiding the problem of state transition matrix of conventional method memory automations; resolving the special bit vector table by using the method of recursive Hash function so as to save memory space; in the process of calculating the Hash function, converting the Hash function into a simple and efficient bit operation by using bit operation; the adoption the Rank technology in HashTrie construction and key word searching increases space and time efficiency of search. The method and system herein markedly reduce memory overhead and pretreatment time, can meet the need of timelessness as required by a real-time intrusion detection system as for when rules come into effect, and is more suitable for handling multi-pattern real-time matching which features in comparatively large scale of pattern string collection and comparatively short length of pattern string.

Description

technical field [0001] The invention relates to the fields of information filtering, information retrieval, computational biology and the like, in particular to a space-efficient multi-pattern string matching method and system. Background technique [0002] In recent years, with the development of broadband technology and the popularity of multimedia applications, Internet technology has been greatly popularized and developed. With the rapid growth of network users, the attack patterns are also rapidly increasing, and the demand for intrusion detection systems is also increasing. In the face of the increasingly serious network security problems such as the current Internet protocol design flaws, computer system vulnerabilities, and network intrusion attacks, the storage space and computing speed of existing algorithms have been difficult to meet the application requirements for real-time matching of feature strings in high-speed network environments. Therefore, it is of gre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 张萍刘燕兵谭建龙郭莉
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products