Unlock instant, AI-driven research and patent intelligence for your innovation.

Similar entity identification method and system based on centrally connected subgraphs

A technology for connected subgraphs and entity recognition, applied in the field of big data, can solve the problems of complex logical sorting process and high cost, and achieve the effect of wide application and improved accuracy.

Inactive Publication Date: 2017-09-05
SOUTH CHINA NORMAL UNIVERSITY
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the logical combing process is complicated, or the cost is relatively high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Similar entity identification method and system based on centrally connected subgraphs
  • Similar entity identification method and system based on centrally connected subgraphs
  • Similar entity identification method and system based on centrally connected subgraphs

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] refer to figure 1 , a similar entity recognition method based on a central connected subgraph of the present invention, comprising the following steps:

[0044] Transform the entities that need to be compared into descriptions through centrally connected subgraphs;

[0045] Perform similarity calculation on the central connected subgraph to obtain the total similarity;

[0046] It is judged whether the total similarity is greater than the preset similarity threshold, and if so, it is judged as similar; otherwise, it is judged as dissimilar.

[0047] refer to figure 2 , further as a preferred embodiment, the described similarity calculation is performed on the central connected subgraph to obtain the total similarity, this step specifically includes:

[0048] Get the two centrally connected subgraphs of the input;

[0049] Structural similarity calculation and semantic similarity calculation are performed on two center-connected subgraphs to obtain structural similari...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a similar entity identification method and system based on centrally connected subgraphs. The method includes the steps that entities which need to be compared are converted into the centrally connected subgraphs and are described; similarity calculation is conducted on the centrally connected subgraphs, and total similarity is obtained; whether or not the total similarity is greater than a preset similarity threshold value is determined, and if the total similarity is greater than the preset similarity threshold value, it is determined that the entities are similar; otherwise, it is determined that the entities are dissimilar. The system includes a conversion unit, a similarity calculation unit and a similarity determination unit. According to the similar entity identification method and system based on the centrally connected subgraphs, by converting the entities into the centrally connected subgraphs, the overall similarity calculation can be carried out, compared with the prior art which can simply aim at databases, researched entities are more abstract, the application is wider, similarity comparison can be conducted in combination with structures and semantic information, and the accuracy of the similarity calculation is effectively improved. The similar entity identification method and system based on the centrally connected subgraphs can be applied to the field of the databases.

Description

technical field [0001] The invention relates to the field of big data technology, in particular to a method and system for identifying similar entities based on a central connected subgraph. Background technique [0002] Data fusion can become a research hotspot in the computer field, which is closely related to the actual needs and the huge potential of data fusion technology. Data fusion was originally proposed due to the needs of military operations. It is a data horizontal comprehensive information processing technology formed to coordinate, integrate and integrate the data information of multiple sensors on various combat equipment. Therefore, early domestic researchers who studied data fusion understood data fusion as a technical idea from a technical point of view, and regarded it as a general term for multi-source information coordinated processing technology. With the rapid development of computer science and technology, the concept of data fusion is no longer limi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/288G06F40/295
Inventor 赵淦森廖智锐庄序填吴杰超任雪琦余达明汤庸马朝辉王欣明聂瑞华
Owner SOUTH CHINA NORMAL UNIVERSITY