Protein complex identification method based on core-accessory structure

A protein complex and identification method technology, which is applied in the field of identifying protein complexes by integrating network topology information and protein biological properties, can solve the problems of false positives in protein interaction data, ignoring protein biological features, and noise in PPI networks. Achieve the effect of expanding application range and practicability, overcoming high data noise, and improving efficiency

Pending Publication Date: 2021-11-30
YANGZHOU UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Before the present invention was proposed, the field of identification of protein complexes was initially identified by topological features of the network, for example, molecular complex detection algorithm MCODE, Markov clustering algorithm MCL, edge density map clustering algorithm DPClus, polar CMC is a clustering algorithm for identifying protein complexes, but the disadvantages of these methods for identifying key proteins are: (1) only considering the topological characteristics of the network itself, while ignoring the inherent biological characteristics of proteins
(2) There is noise in the PPI network obtained through biological experiments, which makes the protein interaction data have false positives

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein complex identification method based on core-accessory structure
  • Protein complex identification method based on core-accessory structure
  • Protein complex identification method based on core-accessory structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0111] We tested our proposed algorithm BOCAS on three datasets of Gavin, Krogan, and DIP respectively. Table 1 gives the detailed information of Gavin, Krogan, and DIP three data sets, including the number of proteins contained in each network and the number of interactions between proteins. Table 2 presents the information of the protein biological attribute dataset.

[0112] Table 1 Protein Interaction Network Dataset

[0113] PPI dataset protein quantity number of interactions Gavin 1430 6531 Krogan 2674 7075 DIP 5093 24743

[0114] Table 2 Protein biological attribute data set

[0115] biological data set Remark Gene Expression Profiles Version: GSE3431, includes expression values ​​for 36 time points per gene Go annotation set Go annotations include Go annotation information for 7014 proteins known complex set CYC2008

[0116] To evaluate the performance of the BO-CAS method in protein complex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a protein complex identification method based on a core-accessory structure. The protein complex identification method based on the core-attachment structure employs the idea of core-attachment, and comprises the steps: constructing a weighted dynamic PPI network, and then finding seed nodes in the PPI network by calculating related attributes of nodes; forming a core of a protein complex through seed nodes and neighbor nodes, then searching a proper attachment for each core, and finally deleting some protein complexes with relatively large overlapping degree to obtain a finally predicted protein complex. The method not only considers the topological characteristics of the protein interaction network, but also considers the biological attributes of the protein, thereby overcoming the negative influence caused by high data noise. By fusing biological attributes and topological characteristics, the accuracy of identifying the protein complex is improved, meanwhile, the prediction result is more accurate, and the prediction efficiency is improved. And the application range and practicability of the technology in the field of biological information are expanded.

Description

technical field [0001] The invention belongs to the technical field of biological information, and provides a protein complex identification method based on the core-appendix structure, mainly through the protein complex identification technology of the core-appendix structure in the protein interaction network, especially in the protein interaction network A method for identifying protein complexes by fusing network topology information and protein biological properties. Background technique [0002] Protein is an indispensable substance in life activities and almost participates in all cycles of life activities, and protein complexes are complexes formed by two or more functionally related polypeptide chains through disulfide bonds or other protein interactions. There are many types of protein complexes, but the properties and functions of many types are still unknown so far. Therefore, identifying protein complexes has become an important research object of proteomics res...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B25/00G16B45/00
Inventor 刘维唐玉亮
Owner YANGZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products