Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Corrosion source joint de-noising method for multi-source heterogeneous big data

A multi-source heterogeneous and big data technology, applied in the information field, can solve problems such as inability to handle heterogeneous data

Active Publication Date: 2016-07-13
INST OF INFORMATION ENG CAS
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the drawback of the KTLQD method is that it cannot handle heterogeneous data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corrosion source joint de-noising method for multi-source heterogeneous big data
  • Corrosion source joint de-noising method for multi-source heterogeneous big data
  • Corrosion source joint de-noising method for multi-source heterogeneous big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be further described below through specific examples.

[0064] The corrosion source joint denoising method for multi-source heterogeneous big data provided by the present invention is composed of heterogeneous linear metric learning HLML and multi-source semi-supervised joint denoising MSCD algorithm, and the gradual optimization of the model is realized through a cyclic iterative process.

[0065] The HLML model in formula (7) can be simplified as:

[0066]

[0067] in, is a smooth objective function, Z=[A Z B Z ] represent optimization variables, is a closed convex set with respect to a single variable:

[0068]

[0069] Since D( ) is a continuously differentiable function with respect to the Lipschitz continuous gradient L (reference: Y.Nesterov.Introductorylecturesonconvexoptimization, volume87.SpringerScience & BusinessMedia, 2004.):

[0070]

[0071] . Therefore, it is suitable to use the Accelerated Projected Gradient (...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a corrosion source joint de-noising method for multi-source heterogeneous big data. The corrosion source joint de-noising method comprises two models, one model is an HLML (Heterogeneous Linear Metric Learning) model, and the other one model is an MSCD (Multi-source Semi-supervised Collaborative Denoising) model; wherein the HLML model linearly projects multi-source heterogeneous data to a high-dimensional feature isomorphic space by learning a plurality of heterogeneous linear measurements, and complementary information among heterogeneous sources is fully embedded into the high-dimensional feature isomorphic space, thus semantic complementarity and distribution similarity among different sources can be effectively captured. In order to eliminate intra-source and inter-source noise, the MSCD model utilizes an elementary transformation constraint and a gradient energy competition strategy to restore the complementary relation between heterogeneous noise description in the feature isomorphic space learnt by the HLML model, so as to purify corrosion sources of the multi-source heterogeneous data, and be help to obtain an accurate and robust multi-source data evaluation and analysis result.

Description

technical field [0001] The invention belongs to the field of information technology, and aims at the problem of intra-source noise and inter-source noise in a massive multi-source heterogeneous corrosion data environment, and proposes a joint denoising method for corrosion sources of multi-source heterogeneous big data. Background technique [0002] In recent years, with the emergence of a large number of high-tech digital products, the multi-source heterogeneous data (Multi-source Heterogeneous Data) generated by these heterogeneous electronic devices has spread to every corner of people's real life. The so-called multi-source heterogeneous data refers to data that comes from different sources or channels, but expresses similar content, and appears in various styles such as different forms, different modalities, different perspectives, and different backgrounds. For example, Sina Weibo, Tencent WeChat, and Sohu.com report different forms of the same news; the brains of Alzh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/2155G06F18/23
Inventor 张磊王树鹏云晓春
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products