Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system for large scale keyboard matching

A technology of keyword matching and matching methods, which is applied in the field of text processing, can solve the problems of slowing down and not meeting application requirements well, and achieve the effect of speeding up

Active Publication Date: 2005-08-03
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At this time, the speed of the traditional matching technology will obviously drop sharply, and it can no longer meet the application requirements, especially the real-time data processing requirements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for large scale keyboard matching
  • Method and system for large scale keyboard matching
  • Method and system for large scale keyboard matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Such as figure 1 As shown, the system of the present invention includes:

[0022] Device (1): a device for standardizing keywords, which is used to count a large number of given keywords according to their length, and then sort them according to their length;

[0023] Device (2): The device for solving the optimal grouping and the best matching method. Its function is: two mechanisms can be used to solve the optimal grouping: one is to use the dynamic programming mechanism to obtain the optimal grouping, and then obtain the optimal grouping of each group through training. The best matching method; the other is to use the shortest path mechanism to directly obtain the grouping and the best matching method of each group; this device finally stores the results of the grouping and the best matching method in the file in the form of a configuration file;

[0024] Device (3): a device for setting up a scanning automaton, the effect is: read the configuration file, adopt the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is large scale keyword matching method and system. According to the method and the system, the given keyword set is first standardized, and one optimal grouping and in-group optimal matching is then solved in the standardized keyword set. In the said process, two mechanisms are adopted. One is the dynamic layout method including first calculating one optimal grouping, and dividing the keyword set into several groups; and the subsequent training in every group to obtain one optimal matching. The other includes training to establish one oriented graph with weight in the edge and solving the shortest path in the graph to obtain optimal grouping and in-group optimal matching. After that, automatic scan machine is constituted successively to all the groups based on the training result, so as to form one automatic scan machine sequence, through which the input texts to be scanned are passed through to obtain final scan result.

Description

technical field [0001] The invention relates to the technical field of text processing, in particular to a large-scale keyword matching method and system. Background technique [0002] The technology of multi-keyword matching is relatively mature, and it is widely used in various aspects of text processing and content filtering. The traditional multi-keyword matching algorithm regards the text to be scanned as a one-dimensional string, makes full use of the characteristics of the known keyword string, and jumps forward as much as possible during the scanning process to improve the matching performance. Multi-keyword matching algorithms can be divided into three forms according to different preprocessing methods for keywords: prefix mode (KMP, AC, Shift-AND, Shift-Or and other algorithms), suffix mode (Boyer-Moore, Wu-Manber and other algorithms), substring mode (BDM, BOM, SBDM, SBOM and other algorithms). The performance of the multi-keyword matching algorithm is mainly af...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 刘萍谭建龙程学旗
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI