Protein interaction site identification method

A technology of interaction sites and identification methods, applied in proteomics, instrumentation, genomics, etc., can solve the problems of difficult analysis of results, reduce space and time overhead, and improve recognition accuracy, reduce space and time overhead, Avoid uneven quality effects

Pending Publication Date: 2019-09-20
ANHUI UNIVERSITY OF TECHNOLOGY
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to overcome the disadvantages in the prior art that when predicting protein interaction sites, there are different degrees of "false positive" and "false negative" characteristics, which make the result analysis more difficult, and prov...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein interaction site identification method
  • Protein interaction site identification method
  • Protein interaction site identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] In the method for identifying protein interaction sites of the present invention, the protein chain data is collected first and the protein chain data is preprocessed. By preprocessing the protein chain data, the problem of uneven quality of the protein chain data can be avoided, namely It is more convenient for the follow-up work. Then, the preprocessed protein chain data is divided into interface residues and non-interface residues, and then the features of protein chains are extracted from the database, and the extracted features are fused to obtain a data set; it is worth noting that, by extracting features and Fusions are performed to better represent protein interactions. Afterwards, the imbalance of the data set is processed, and then the processed data set is classified and predicted to obtain protein interaction sites.

[0043] combine figure 1 As shown, a protein interaction site identification method of the present invention, the specific steps are as follo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a protein interaction site identification method which belongs to the field bioinformatics analysis. The method comprises the steps of acquiring protein chain data, performing preprocessing on the protein chain data, and dividing the preprocessed protein chain data to an interface residue and a non-interface residue; extracting a characteristic from a database, fusing the extracted characteristics for obtaining a data set, processing the unbalance of the data set, then dividing the processed data set into a training set and a testing set, training the XGBoost model by means of the training set, and finally obtaining the protein interaction site by means of the XGBoost model. The protein interaction site identification method aims to overcome a defect of relatively high result analysis difficulty caused by different degrees of false positive and false negative characteristics in predicting the protein interaction site. The protein interaction site identification method can overcome defects above and furthermore can improve identification precision of the protein interaction site.

Description

technical field [0001] The invention relates to the technical field of bioinformatics analysis, and more specifically relates to a method for identifying protein interaction sites. Background technique [0002] In all cells, proteins are the most important building blocks, and the interaction between proteins is the most fundamental activity in most cellular functions. The protein-protein interaction constitutes an important part of the cell biochemical reaction network. The protein-protein interaction network is the main way to realize the regulation of biological information and the key factor in determining the cell fate. The study of protein-protein interactions is the basis for understanding life activities and is one of the most important research fields in the post-gene era. With the implementation of the Human Genome Project, the data in the protein sequence database has increased significantly, and the structure and interaction between proteins cannot be determined...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B20/30G16B40/00
CPCG16B20/30G16B40/00
Inventor 王兵张欢汪文艳周郁明王彦程竹明
Owner ANHUI UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products