Data fusion method based on voting mode

A technology of data fusion and data pairing, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems that cannot be widely used, achieve the effect of improving efficiency and accuracy, and expanding the scope of application

Active Publication Date: 2017-03-15
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Purpose of the invention: In order to solve the technical problem that the existing algorithm for eliminating data redundancy can only work in some specific situations, but cannot be used generally, this invention proposes a data fusion method based on voting

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data fusion method based on voting mode
  • Data fusion method based on voting mode
  • Data fusion method based on voting mode

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Aiming at the fusion of data pairs, the present invention proposes a data fusion method based on a voting method. The present invention will be further described below in conjunction with the accompanying drawings.

[0035] The principle of the present invention is as figure 1 shown, including the following steps:

[0036] 1) Analyze the principle and applicability of the existing algorithms, and divide the algorithms into several groups;

[0037] 2) For a given data pair, each algorithm independently gives a judgment or approximation, that is, each algorithm independently votes;

[0038] 3) Determine whether the data pair represents the same entity. If so, end; otherwise, go to the next step.

[0039] 4) Execute the approximation calculation based on the fusion of multiple algorithms to calculate the approximation of the data pair.

[0040] 5) According to the calculation result of step 4), it is judged whether the data represent the same entity.

[0041] It can b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data fusion method based on a voting mode. The method includes the following steps that (1) existing algorithms are subjected to principle and applicability analysis and divided into a plurality of sets; (2) for given data pairs, judgment or the approximation degree is independently given through every algorithm, namely, the voting process; (3) whether the data pairs represent same entities or not is judged, if the data pairs represent the same entities, data fusion is completed, and if the data pairs do not represent the same entities, the next step is carried out; (4) the method based on multiple-algorithm fusion is carried out, the approximation degree of the data pairs is calculated; (5) whether data represents the same entities or not is judged. According to the data fusion method, due to the existing data connection algorithms and the field advantages of the existing data connection algorithms, the algorithm interdisciplinary defects are overcome, and the accuracy and the recalling rate of data redundancy removing can be increased.

Description

technical field [0001] The invention relates to the fields of data management and data analysis, in particular to a data fusion method based on voting. Background technique [0002] For most databases and data applications, users hope that the data in the database (or data set) is unique, including a unique expression, that is, there is no redundant data. However, in reality, data redundancy will inevitably occur. There are many reasons for data redundancy, such as inconsistent spelling of multi-source data, abbreviations and abbreviations, word order reversal, etc. One of the main purposes of data fusion is to eliminate data redundancy and combine multiple sources of data into a whole. [0003] The process of eliminating data redundancy can be understood as judging whether a data pair represents the same entity, and if it is the same entity, the fusion operation can be performed. There are already several (classes) of algorithms to solve this problem, such as algorithms ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/215
Inventor 李鑫秦小麟
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products