Disambiguation processing method, system and device for cross-enterprise personnel name duplication in industrial and commercial registration information, processor and storage medium thereof

A processing method, industrial and commercial technology, applied in the fields of electrical digital data processing, natural language data processing, instruments, etc., can solve problems such as inability to implement, and achieve the effect of fast calculation and high recall rate
CN113269244APending Publication Date: 2021-08-17上海睿翎法律咨询服务有限公司

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
上海睿翎法律咨询服务有限公司
Publication Date
2021-08-17

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to a disambiguation processing method for a cross-enterprise personnel name duplication phenomenon in industrial and commercial registration information, and the method comprises the steps: carrying out the data collection and filtering processing according to the industrial and commercial registration information, and obtaining an industrial and commercial information personnel list; sampling the obtained business information personnel list to obtain part of personnel information data and corresponding enterprise registration information; grouping the obtained data by constructing an undirected graph model, and calculating the similarity between every two nodes in each sub-graph generated by the undirected graph model; and according to the training vector and the prediction vector, constructing a similarity vector to train a logic regression model, and carrying out similarity weighting processing to obtain a name disambiguation result. The invention also relates to a corresponding system, device, processor and storage medium. By adopting the method, the system, the device, the processor and the storage medium, the enterprise names can be automatically disambiguated, and a certain support is provided for enterprise association relationship analysis.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of data analysis, in particular to the field of enterprise name disambiguation analysis, in particular to a method, a system, a device, a processor and a computer-readable method for disambiguating the phenomenon of cross-enterprise personnel duplication in industrial and commercial registration information storage medium. Background technique

[0002] As the threshold for enterprise registration is getting lower and lower, enterprises are paying more and more attention to the risk management and control of cooperative enterprises in terms of cooperation. Risk management and control include its own risk and associated risk. The risk of corporate executives is an important part of associated risk.

[0003] In real life, there is the problem of the same names of executives between different companies. Although this information is legally graded by relevant departments, a large amount of corporate data comes ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More