Data fusion method and device

A data fusion and data technology, applied in the field of data processing, can solve the problems of lack of data fusion and low data fusion rate.

Active Publication Date: 2018-09-25
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, strict matching of features based on strings will result in a low fusion rate of da

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data fusion method and device
  • Data fusion method and device
  • Data fusion method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0096] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0097] The present invention provides a kind of data fusion method, refer to figure 1 , which is a flow chart of a data fusion method provided by an embodiment of the present invention, the method may specifically include:

[0098] S101: Extracting attributes in the first data and the second data, wherein the first data and the second data include correspondences between attributes and attribute values.

[0099] Both the first data and the second data in the embodiment of the prese...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data fusion method and device. The method comprises the steps of extracting attributes in first data and second data, wherein the first data and the second data comprise a corresponding relationship of the attributes and attribute values; calculating semantic similarity values among all the attributes, determining the semantic similarity values larger than a first presetthreshold value, and determining attributes corresponding to all the semantic similarity values as pairs of common attributes of the first data and the second data; determining the similarity value between the first data and the second data by comparing the attribute values corresponding to each pair of common attributes, and fusing the first data and the second data if the similarity value between the first data and the second data is larger than a second preset threshold value. According to the data fusion method and device, on the premise of guaranteeing the accuracy of data fusion, the rate of data fusion is increased.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a data fusion method and device. Background technique [0002] Data fusion is to merge and deduplicate data pointing to the same entity, and finally realize the retention of data pointing to different entities. For example, the song "Wangqingshui" from QQ music is stored in the song library, which contains several attributes, such as the singer Andy Lau, and the song is 4 minutes long; in addition, the song "Wangqingshui" from Xiami Music is also stored in the song library , including singer Andy Lau, release date 1994 and other attributes. Since the two songs are essentially the same song, in order to avoid song query errors, the system needs to fuse the two songs, that is, merge them into one song "Wangqingshui" and store it in the song library. The fused song contains the above All attributes of both songs. [0003] In the process of data fusion, it is necessary to first dete...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30G06F40/237
CPCG06F40/30G06F40/237G06F16/00
Inventor 甘骏苏可饶孟良
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products