Key protein recognition method based on largest neighbor sub-network

An identification method and protein technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of mixed false positive and false negative data, inability to efficiently predict key proteins, and incomplete interaction data. achieve the effect of improving the accuracy

Active Publication Date: 2018-11-13
CENT SOUTH UNIV
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Due to the limitations of existing experimental techniques and other issues, the currently obtained interaction data are not rich and perfect, and there are many false positives (false interactions that do not actually exist) and false negative data (interactions that actually exist but do not exist). have not been detected), so that the existing protein network-based centrality methods cannot efficiently predict key proteins

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Key protein recognition method based on largest neighbor sub-network
  • Key protein recognition method based on largest neighbor sub-network
  • Key protein recognition method based on largest neighbor sub-network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Biological data sets used in the present invention: Yeast is the most widely used species in current research, and there are a large number of key proteins of yeast species determined experimentally in existing public databases. The yeast protein interaction data set used in the present invention is downloaded from the DIP database, and self-interactions and repeated interactions in the data set are removed to obtain the original protein interaction network. Due to deficiencies in experimental techniques and other aspects, some false positive and false negative data inevitably exist in protein interaction data. In order to reduce the impact of these erroneous data on the identification of key proteins, the present invention uses yeast protein subsets in the COMPARTMENTS database. Cell localization information partitions the original network. The database integrates relevant subcellular localization information from databases such as UniProtKB, MGI, SGD, FlyBase, and Wor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a key protein recognition method based on a largest neighbor sub-network. The method comprises the steps of firstly performing sub-network division on an obtained protein interaction network according to sub-cell positioning information, and dividing proteins located in the same sub-cell position in the same sub-network, thereby obtaining a plurality of protein sub-networkswith an interaction relation in the same sub-cell interval; and then, performing topological feature analysis on the largest sub-network corresponding to each protein and where each protein is located, and calculating a critical comprehensive score value based on common neighbors, wherein the higher score value indicates that the protein tends more to be critical, thereby predicting potential critical proteins by means of score sorting. On the basis of simplicity and practicability, the accuracy of critical protein recognition can be well improved, and important reference values and practicalvalues are provided for researchers to perform experimental analysis and deeper research of the practical proteins.

Description

technical field [0001] The invention belongs to the technical field of systems biology, and mainly relates to a method for dividing a protein interaction network into subnets according to subcellular location information and identifying key proteins based on network topology characteristics. Background technique [0002] Protein is an indispensable component of cell and tissue structure, and is a high-molecular organic substance that maintains important life activities, carrying life activities and the execution of physiological functions. There are many types of proteins in organisms, which participate in various life activities such as nutrient transport, physiological function regulation, and biochemical reactions. There are obvious differences in the importance of different kinds of proteins to organisms. Among them, the loss of certain proteins will cause the loss of biological functions of the organism, resulting in pathological changes or cessation of growth and deve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/18
Inventor 李敏李文凯王建新
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products