Check patentability & draft patents in minutes with Patsnap Eureka AI!

Unit name matching and searching method and device based on fuzzy matching algorithm

A technology of fuzzy matching and matching method, which is applied in computing, text database query, digital data information retrieval, etc., and can solve the problems that ES cannot be identified, has no weight concept, user input uncertainty, etc., and achieves accurate matching/finding results Reliable, fine-grained match/find methods, effects that improve accuracy

Pending Publication Date: 2021-08-17
BANK OF COMMUNICATIONS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. There is an abbreviation in the unit name, and ES cannot directly recognize the abbreviation;
[0006] 2. The unit name contains the area name, but ES cannot recognize that this is an area, and will filter out other unit names that are not in this area;
[0007] 3. The unit name contains an alias field, but ES cannot directly filter out the unit name of the alias;
[0008] 4. Due to the uncertainty of user input, the unit name will contain many invalid characters, such as brackets, dots, etc., which ES cannot recognize;
[0009] 5. There is no concept of weight. The weight of each word segmentation index matched by ES is the same. There is no way to dynamically adjust the weight of different word segmentation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unit name matching and searching method and device based on fuzzy matching algorithm
  • Unit name matching and searching method and device based on fuzzy matching algorithm
  • Unit name matching and searching method and device based on fuzzy matching algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] Present embodiment 1 provides a kind of unit name matching method based on fuzzy matching algorithm, and this method has realized one-to-one fuzzy matching, and this method comprises the following steps:

[0053] S1. Obtain the names of the two units to be matched, denoted as A and B respectively, and determine whether they include each other or match completely. If so, directly output the fuzzy matching score as 100, otherwise, execute step S2.

[0054] S2. Perform preprocessing on the two unit names respectively, including standardization and filtering, wherein the standardization includes: replacing the abbreviations in the unit names with standard words based on the abbreviation thesaurus; Significant characters, such as: "!\%\$\#\&\*\'\《\》\:\:\+\, \.\,\" and other meaningless characters, delete units based on invalid thesaurus Invalid word in name.

[0055] S3. Perform custom word segmentation processing on the two unit names to obtain the corresponding custom wor...

Embodiment 2

[0086] like figure 2 Shown, present embodiment a kind of unit name search method based on fuzzy matching algorithm, this method comprises:

[0087] Obtain the unit name to be searched, and determine the name to be matched with the highest matching degree in the unit name lexicon through ES fuzzy matching;

[0088] The name of the unit to be searched and the name to be matched are matched using the unit name matching method based on the fuzzy matching algorithm described in any one of claims 1 to 7 to determine the fuzzy relationship between the name to be matched and the name of the unit to be found in the unit name lexicon. match score.

[0089] In this embodiment, the unit name matching method based on the fuzzy matching algorithm is completely consistent with Embodiment 1, and will not be repeated in this embodiment.

[0090] Specific examples are given below:

[0091] The name of the unit to be searched: Jiangxi Radio, Film and Television Administration;

[0092] The ...

Embodiment 3

[0097] This embodiment provides a unit name matching device based on a fuzzy matching algorithm based on Embodiment 1. The device includes a memory and a processor, the memory is used to store a computer program, and the processor is used to implement the method in Embodiment 1 when the computer program is executed. A unit name matching method based on a fuzzy matching algorithm.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a unit name matching and searching method and device based on a fuzzy matching algorithm. The method comprises the steps: carrying out user-defined word segmentation processing on two unit names to be matched to obtain corresponding user-defined segmented words, the user-defined segmented words comprising three types, namely attribute words, region words and name words; and calculating fuzzy matching scores of various self-defined segmented words in the two unit names based on a self-defined word segmentation result, and weighting the fuzzy matching scores of the various self-defined segmented words to obtain the fuzzy matching scores of the two unit names. Compared with the prior art, the method has the advantages of being accurate in unit name matching and searching and the like.

Description

technical field [0001] The invention relates to a unit name matching and searching method and device, in particular to a unit name matching and searching method and device based on a fuzzy matching algorithm. Background technique [0002] In the context of today's information technology era, various industries, especially the financial industry, have increasingly urgent requirements for customer information mining, requiring more accurate customer information and faster mining speed. In order to meet this requirement, the ES database is introduced. As a distributed database, ES provides scalable search, has near real-time search capabilities, and supports precise query and fuzzy query processing of large amounts of data. [0003] ES fuzzy query is a fast fuzzy matching query method based on Elasticsearch's built-in word segmentation method. Elasticsearch is a distributed database that provides a distributed multi-user capable full-text search engine that uses JSON for data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/295G06F40/284G06F16/335G06F16/33
CPCG06F40/295G06F40/284G06F16/335G06F16/3344
Inventor 李君许志坚
Owner BANK OF COMMUNICATIONS
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More