Unlock instant, AI-driven research and patent intelligence for your innovation.

Character string multimode fuzzy matching method based on AC automaton

A matching method and character string technology, which is applied in the fields of electronic digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of low repetition efficiency, low real-time performance, and insufficient utilization of similarity calculations, etc., to achieve Reduce the number of comparisons and achieve high real-time performance

Active Publication Date: 2020-12-18
南京中孚信息技术有限公司
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For a large number of strings, when the similarity calculation needs to be matched, the number of calculations equal to the number of strings in the database is performed, which has low real-time performance, and the calculation of the similarity does not fully utilize the existence of equality in multi-mode strings. Partial features, duplication and inefficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character string multimode fuzzy matching method based on AC automaton
  • Character string multimode fuzzy matching method based on AC automaton
  • Character string multimode fuzzy matching method based on AC automaton

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to further illustrate the various embodiments, the present invention provides accompanying drawings, which are part of the disclosure of the present invention, and are mainly used to illustrate the embodiments, and can be used in conjunction with the relevant descriptions in the specification to explain the operating principles of the embodiments, for reference Those of ordinary skill in the art should be able to understand other possible implementations and advantages of the present invention. The components in the figures are not drawn to scale, and similar component symbols are generally used to represent similar components.

[0049] According to an embodiment of the present invention, an AC automaton-based method for multi-mode fuzzy matching of character strings is provided.

[0050] Now in conjunction with accompanying drawing and specific embodiment the present invention is further described, as figure 1 As shown, according to the character string multim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a character string multimode fuzzy matching method based on an AC automaton, and the method comprises the following steps: defining a plurality of groups of mode string sets with labels through employing a rule, and adding defined mode strings with labels into a database; judging whether the text content is inquired for the first time; if the text content is inquired for the first time, reading all mode strings with labels in the database, and constructing a Trie tree through a pre-configured method; completing the construction of a fail pointer on the Trie tree by adopting a preset rule; and adopting a preset method to achieve query matching between the text content and the plurality of groups of pattern string sets with the labels. The method has the beneficial effects that a fuzzy matching function is added on the basis of the AC automaton, the common prefix in the multimode character string can be effectively utilized, the comparison frequency is reduced, fuzzy matching can be supported, and the method has certain robustness and is simple and efficient.

Description

technical field [0001] The invention relates to the field of multi-mode fuzzy matching of character strings, in particular to an AC automaton-based multi-mode fuzzy matching method for character strings. Background technique [0002] The Internet is flooded with a large amount of text data, and quickly extracting key information from the text helps to quickly locate and find the text and make it easier for users to make decisions. Text labels are often a high-level summary of text information and are used to present key information in the text. Text tagging is a process of transforming unstructured text into structured tags, which is crucial for text processing systems. A text may have multiple tags, so in the process of text tagging, it is necessary to perform multi-mode matching on the text and a large number of custom rule strings, and in order to achieve strong robustness, fuzzy matching needs to be implemented when string matching function, so there is an urgent need ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/9532
CPCG06F16/332G06F16/9532
Inventor 陈姝张玉林熊英超曲志峰苗功勋
Owner 南京中孚信息技术有限公司