Method for carrying out term matching on data in database

A database and data technology, applied in the field of computer communication, can solve the problems of slow term indexing, slow word search efficiency, slow query, etc., to achieve a good user experience, improve performance, and speed up matching.

Active Publication Date: 2017-03-08
IOL WUHAN INFORMATION TECH CO LTD
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is that the current term matching engine based on the database is relatively slow in word search efficiency, and the way to improve this prob

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for carrying out term matching on data in database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The technical solutions of the present invention will be further specifically described below in conjunction with the accompanying drawings and specific embodiments.

[0026] In order to solve the above technical problems, the present invention provides a method for term matching of data in the database, such as figure 1 as shown,

[0027] Including the steps of building an index, and using the index to perform term query matching;

[0028] Wherein, the steps of building an index are as follows:

[0029] Traversing the database to obtain character string data, the character string data including character string or target character string;

[0030] Generate a double-array trie based on the source string and the target string;

[0031] Insert the string data into the double-array dictionary tree, return the position value of the string data in the double-array dictionary tree, and calculate the string data ID through the position value, and put the string data ID into...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for carrying out term matching on data in a database. The method is characterized in comprising a step of establishing an index and a step of carrying out term search matching by employing the index. The step of establishing the index is as follows: traversing the database and generating a double-array dictionary tree according to a source character string and a target character string; inserting character string data into the double-array dictionary tree, thereby realizing a first-level index of the character string data; establishing a corresponding relationship between an ID of the character string data and a keyword filed of a data record in which the character string data is located, wherein the data record is in the database, and the corresponding relationship is used for realizing a second-level index of the character string data; and storing the second-level index in a memory cache system. The step of carrying out term search matching by employing the index is as follows: searching an object ID and term bank ID of the data record corresponding to the character string in the database according to the ID of the second-level index, searching according to the object ID and the term bank ID and obtaining a searching result. Various searching demands of the term matching can be satisfied, and the integrated matching performance is improved.

Description

technical field [0001] The invention belongs to the field of computer communication, in particular to a method for matching terms to data in a database. Background technique [0002] At present, computer-assisted translation is an important means to improve the consistency and efficiency of translation. It requires the software to continuously memorize the latest terms and corpus, and to call out terms or corpus that meet the corresponding conditions in a timely manner for selection in the subsequent translation process. . With the continuous expansion of terminology and corpus, the efficiency of retrieving complete translation information directly from the original text or translation based on traditional relational databases or newer non-relational databases will decrease significantly. For relatively large manuscripts to be translated, the speed is naturally unacceptable. Taking mongo database as an example, each document record contains document ID, original text, tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28G06F17/27G06F17/30
CPCG06F16/334G06F40/289G06F40/58
Inventor 冯泽康
Owner IOL WUHAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products