Method for identifying protein functions based on protein-protein interaction network and network topological structure features

A technology of network topology and protein function, applied in the field of proteomics, can solve the problems of changing the biological function of protein, not taking into account the different biological functions of homologous proteins, etc.

Inactive Publication Date: 2015-12-09
SYSU CMU SHUNDE INT JOINT RES INST +2
View PDF3 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods have some defects, for example: homology-based methods do not take into account the fact that homologous proteins can acquire different biological functions in the long evolutionary process
Structural sim

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying protein functions based on protein-protein interaction network and network topological structure features
  • Method for identifying protein functions based on protein-protein interaction network and network topological structure features
  • Method for identifying protein functions based on protein-protein interaction network and network topological structure features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0058] 1. Collect data sets and construct protein interaction networks weighted by nodes and edges

[0059] Human protein-protein interaction data were collected from the HIPPIE database, removing self-interactions, repeated interactions, and interactions with an interaction score of 0. According to the protein acquisition number, the protein primary structure data was obtained from the UniprotKB / Swiss-Prot database, and the amino acid composition, dipeptide composition, autocorrelation descriptors and protein primary structure descriptors such as composition, transformation and distribution were calculated. A protein-protein interaction network weighted by nodes and edges is constructed. Nodes represent proteins, edges represent interactions, node weights are the primary structural features of proteins, and edge weights are interaction trust scores.

[0060] 2. Collect protein function annotation data and build a data set

[0061] Proteins with molecular function and biologi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying protein functions based on a protein-protein interaction network and network topological structure features. Firstly, a node and side-weighted protein-protection interaction network is established, wherein the node represents protein while the edge represents the interaction; then the nodes and the sides in the network are weighted by protein first-grade structural description and protein-protein interaction trust scoring; protection functional annotation data is collected to establish a data set, and a new protein with overall and local information network topological structure features is provided based on a graph theory; and finally, the protein functions are predicated by choosing features through adopting a minimum-redundancy maximum-correlation method and by modeling through a support vector machine. The protein function predication method is greatly better than the prior art, and has robustness on sequence similarity and sampling; and meanwhile, information of three-dimensional structure and the like of protein is not required, so that the method is simple, rapid, accurate and efficient, and the method is expected to be applied in the research fields of proteomics and the like.

Description

technical field [0001] The invention belongs to the technical field of proteomics technology. More specifically, it relates to a method for identifying protein functions based on protein interaction networks and network topology features. Background technique [0002] A protein is a biomacromolecule composed of one or more peptide chains, and is the basic functional unit of cells. In organisms, proteins perform a variety of biological functions, such as DNA replication and transcription, catalyzing biological reactions, transporting biologically active molecules, etc. Therefore, the functional annotation of proteins is an important step in elucidating the nature of life at the molecular level, and plays an important role in the study of pathological mechanisms and the identification of potential drug targets. With the rapid development of high-throughput sequencing technology, various databases have stored massive amounts of protein sequence information. However, only a s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18
Inventor 李占潮邹小勇戴宗
Owner SYSU CMU SHUNDE INT JOINT RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products