Protein functional module excavating method for multi-view data fusion

A protein function and data fusion technology, applied in the field of data mining, can solve problems such as high noise

Inactive Publication Date: 2014-02-05
BEIJING UNIV OF TECH
View PDF5 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the high noise problem of protein interaction data, a pro

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein functional module excavating method for multi-view data fusion
  • Protein functional module excavating method for multi-view data fusion
  • Protein functional module excavating method for multi-view data fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] specific implementation plan

[0045] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0046] The basic idea of ​​the multi-data / view integrated non-negative matrix factorization method used in the present invention is that proteins with the same function generally tend to be closely connected in the interaction network, have similar expression patterns in the gene expression profile, and simultaneously have similar expression patterns in the gene expression profile. Functional annotation systems also tend to have similar semantic information. In order to detect the consistent information in the multi-view, the present invention transforms the multi-view A(i) into a linear combination of basic vectors, and uses the product of three factors to calculate the approximate decomposition result of the multi-view; at the same time, a limited penalty factor is added to guide the convergence of the objective f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of data excavation and discloses a protein functional module excavating method for multi-view data fusion. The method comprises the following steps: firstly performing quantifying description on strong and weak interaction of multiple data sources on protein and forming multi-view data; further performing uniform matrix decomposition on the multi-view data by utilizing a polymerization nonnegative matrix algorithm provided by the invention; determining the functional module of the protein by virtue of obtaining the optimal approximation of the multi-view information. The protein functional module excavating method for multi-view data fusion, provided by the invention, aims at simultaneously analyzing multiple biodata and comprises gene coexpression, GO annotation and PPIN and can be used for extracting the protein functional module with the most consistent polymerization characteristic from the multi-view. The method disclosed by the invention is especially suitable for interaction networks and biodata of the protein and meanwhile can be applied to community excavation problems of social complex networks and communication networks.

Description

technical field [0001] The invention belongs to the field of data mining, and relates to a protein function module detection method which integrates various biological data sources and a protein-protein interaction network (Protein-protein interaction network, PPIN). Background technique [0002] Analyzing the specific functions of proteins based on protein interaction networks is a hot spot in bioinformatics research. Protein-protein interaction (PPI) describes the physical direct connection between two proteins, or the indirect connection between two proteins with consistent functions. PPIN takes each protein as a node, and the relationship between two proteins is used as the edge of the two nodes, forming an undirected graph. In organisms, most proteins form functionally closely related sets through interaction, that is, functional modules, so as to jointly perform one or more corresponding life activities. Therefore, analyzing the functional significance of PPI is to un...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/24
Inventor 贾克斌张媛
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products