Enterprise name duplicate checking method and device

A technology of enterprise name and similarity, which is applied in the computer field, can solve problems such as abbreviation, verification of duplicate items, the accuracy of duplication check cannot meet the demand, and low efficiency, so as to achieve the effect of improving accuracy and efficiency of duplication check

Pending Publication Date: 2021-02-12
BANK OF CHINA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Since the name of the enterprise is entered manually, there are often abbreviations and omissions, etc., the accuracy of duplicate checking of simple verification of duplicate items can no longer meet the demand, and when the amount of data is large, the efficiency of fuzzy query is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enterprise name duplicate checking method and device
  • Enterprise name duplicate checking method and device
  • Enterprise name duplicate checking method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings. Here, the exemplary embodiments and descriptions of the present invention are used to explain the present invention, but not to limit the present invention.

[0029] The embodiment of the present invention provides a method for checking the duplicate name of an enterprise, such as figure 1 As shown, the method includes step 101 to step:

[0030] Step 101, using the ES to search for a second business name that matches the first business name to be checked.

[0031] The full name of ES is ElasticSearch, which is a distributed full-text search engine developed based on Lucene (full-text search engine). Lucene is recognized as the best search engine library so far, but the API provided by Lucene requires users to spend a lot ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an enterprise name duplicate checking method and device. The method comprises the following steps of: searching a second enterprise name matched with a first enterprise name tobe subjected to duplicate checking by utilizing ES; performing word segmentation on the first enterprise name and the second enterprise name according to structural elements, wherein the structural elements comprise administrative regions, company description and organization forms, and the company description comprises company word sizes and industry description; comparing each structural element in the first enterprise name with each structural element in the second enterprise name, and determining a first similarity corresponding to the administrative region, a second similarity corresponding to the company description and a third similarity corresponding to the organization form; determining the total similarity between each second enterprise name and the first enterprise name based on the first similarity, the second similarity and the third similarity; and determining the second enterprise name corresponding to the total similarity meeting the preset condition as the enterprisename which is the same as the first enterprise name. According to the invention, the duplicate checking precision and the duplicate checking efficiency can be improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for checking duplicates of enterprise names. Background technique [0002] This section is intended to provide a background or context to embodiments of the invention that are recited in the claims. The descriptions herein are not admitted to be prior art by inclusion in this section. [0003] The basic requirements for the structure of an enterprise name are as follows: it generally consists of four parts, namely "administrative division + company name + industry description + organizational form", for example, in the name "Xi'an Tianrui Financial Consulting Co., Ltd.", where "Xi'an" is the administrative division, "Tianrui" is the name of the company, "Financial Consulting" is the description of the industry, and "Limited Limited" is the form of organization. [0004] For the enterprise user platform, in order to prevent the same enterprise from being re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/279G06F40/194
CPCG06F40/279G06F40/194Y02P90/30
Inventor 田晓丹孙业宝曲婕
Owner BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products