Method for accelerating character string matching by trans-border protection mechanism

A technology of character string matching and cross-border protection, applied in the field of information processing, can solve the problems of being unable to apply and minimizing the number of cross-border protection checks

Inactive Publication Date: 2008-09-24
HARBIN ENG UNIV
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The above method is known as the simple out-of-boundary protection method, which does not minimize the number of out-of-boundary protection checks
At the same time, this method can only be applied to the t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for accelerating character string matching by trans-border protection mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The present invention is described in more detail below in conjunction with accompanying drawing example:

[0020] A method for accelerating character string matching by using an out-of-bounds protection mechanism, the steps are as follows:

[0021] a) According to the length of the text to be matched, the end position of the text is obtained, assuming that the last character of the text is located at the loc position.

[0022] b) Set a 1-character isolation word at the loc+1 position, and the isolation word can be any character that does not appear in the pattern. It is used to prevent a false match when several characters at the end of the text and several characters at the beginning of the added out-of-bounds protection are exactly one occurrence of the pattern.

[0023] c) Copy the pattern string to the loc+2 position of the text (it can be copied multiple times depending on the situation, and one of the multiple patterns to be matched is copied in a multi-pattern ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method which uses a boundary violation protection mechanism for accelerating the matching of character strings. A tail position of a text is obtained according to the length of the text to be matched, and the last end character of the text is assumed to be positioned at the position of loc; an isolation word of one character is arranged in the position of loc plus 1, and the isolation word is any character that does not appear in a mode; a copy mode is connected to the position of loc plus 2 of the text; a normal character string matching is implemented without checking whether a subscript crosses a boundary; whether a subscript crosses the boundary or not is judged in front of the matching position of an output mode, if the subscript does not cross the boundary, the matching position is output, and if the subscript crosses the boundary, the matching action is then finished. The method of the invention has no relation with the concrete realization of the matching of the character strings and is a general improved method for present matching problems of various character strings. The output action after the mode matching in the whole string matching process is the action with the lowest frequency of all the actions appearing in the string matching process. Therefore, the method of the invention can minimize the total number of the examination operations for the subscript boundary violation.

Description

(1) Technical field [0001] The invention relates to an information processing method, in particular to a character string matching method used in the field of information processing. (2) Background technology [0002] String matching can be understood as finding one or several patterns with certain properties from a given sequence of symbols. The application range of string matching is extremely wide. Almost all applications involving text processing will involve string matching operations. This problem is the core problem in important fields such as network security, information retrieval, and computational biology. At present, with the emergence of network security issues and the emergence of applications such as super-large databases, high-speed search engines, and the Human Genome Project, there is a strong demand for high-speed matching algorithms. [0003] String matching can be classified based on whether the text to be processed was known at the time of the matching...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 范洪博姚念民
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products