Protein complex recognizing method based on multi-source data fusion and multi-target optimization

A protein complex and multi-objective optimization technology, applied in the field of bioinformatics, can solve problems affecting the recognition accuracy of protein complexes, high false positives and false negatives of protein interaction data, etc., to improve the search range and calculation efficiency, Improve the recognition speed and recognition accuracy, and improve the effect of accuracy

Inactive Publication Date: 2018-05-08
CHINA UNIV OF GEOSCIENCES (WUHAN)
View PDF4 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the protein interaction data obtained by high-throughput sequencing technology often have high false positives and

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein complex recognizing method based on multi-source data fusion and multi-target optimization
  • Protein complex recognizing method based on multi-source data fusion and multi-target optimization
  • Protein complex recognizing method based on multi-source data fusion and multi-target optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0049] Please refer to figure 1 and figure 2 , the embodiment of the present invention provides a protein complex identification method based on multi-source data fusion and multi-objective optimization, comprising the following steps:

[0050] S1. The protein interaction network is regarded as a fully connected graph, preprocessed, and the adjacency matrix is ​​obtained;

[0051] Obtain the protein interaction database from the public website, and abstract the protein interaction network as a network connectivity graph G=(V, E) formed by multiple protein nodes and interactions between nodes, where V is the set of protein nodes, and E is the set of edges interacting between protein nodes. Since there are some redundant data of self-interactions and re-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a protein complex recognizing method based on multi-source data fusion and multi-target optimization, comprising: preprocessing protein interaction network data to obtain adjacent matrixes; primarily clustering protein complexes to obtain an initial protein complex module; further optimizing the initial protein complex module, fusing topological structural features of the protein interaction network data and functional similar features of GO (gene ontology) annotation data during optimizing, and performing optimizing operation in conjunction with an adaptive multi-target blackhole optimization algorithm to obtain a more precise protein complex module; postprocessing to obtain a final optimal protein complex. The method of the invention has the advantages that protein complex recognition speed and precision are increased, the method is applicable to protein interaction networks and extensible to the analysis of other complex community networks, and the method isvery practical in complex network analysis.

Description

technical field [0001] The invention relates to the field of bioinformatics, in particular to a method for identifying protein complexes based on multi-source data fusion and multi-objective optimization. Background technique [0002] Protein is the product of gene expression, the executor of the physiological functions of organisms, and the direct embodiment of life phenomena. Proteomics is the systematic study of the properties contained in proteins, which can provide a detailed description of the structure, function and regulation of biological systems in health and disease states. Almost all biological processes are completed through a series of protein interactions. From the perspective of systems biology, the use of protein interaction networks to study and analyze biological functions has important prospects and practical value. [0003] A protein complex is a collection of proteins composed of a multi-molecular mechanism through interactions at the same time and sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/24G06F19/18G06F19/12
Inventor 朱媛彭晓宇吴崇
Owner CHINA UNIV OF GEOSCIENCES (WUHAN)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products