Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for predicting epitope through cost-sensitive integrating and clustering on basis of sequence

A cost-sensitive, prediction table technology, applied in the field of computational bioinformatics, to achieve the effect of true and reliable prediction results

Inactive Publication Date: 2016-08-17
NORTHEAST NORMAL UNIVERSITY
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the current research on conformational epitope prediction is still immature, more and more researchers have realized the importance of this research and began to focus on it

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for predicting epitope through cost-sensitive integrating and clustering on basis of sequence
  • Method for predicting epitope through cost-sensitive integrating and clustering on basis of sequence
  • Method for predicting epitope through cost-sensitive integrating and clustering on basis of sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] In order to understand the technical content of the present invention more carefully and clearly, in combination with figure 1 , figure 2 The present invention is described in detail. In particular, the examples are only used to illustrate the present invention, not to limit the present invention.

[0018] The method for predicting epitopes based on sequence using cost-sensitive integration and clustering of the present invention comprises the following steps:

[0019] (1) Feature construction: Based on the analysis of the characteristics of the antigen surface residues, the descriptive features of the antigenic and non-antigenic residues are calculated.

[0020] (2) Feature selection: For the constructed full feature matrix, select features with higher discrimination and more accurate descriptiveness, and build an optimal feature subset on this basis.

[0021] (3) Ensemble learning: In order to solve the problem of data sample imbalance and improve prediction perfo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to a computational biology information technique, and particularly relates to a method for predicting epitope through cost-sensitive integrating and clustering on the basis of a sequence. The method comprises the main steps that 1, descriptive features of antigen protein residues are constructed, wherein the features comprise the evolutionary conservation feature, the secondary structure feature, the disordered region feature, the dipeptide composition feature and physical and chemical attributes; 2, an optimal feature subset is selected through Fisher-Markov and an incremental iterative feature selection method; 3, unbalanced data sets are processed through cost-sensitive integrating learning; 4, potential epitope residues are predicted from antigenic determination residues through a spatial clustering algorithm. The method is suitable for antigen protein epitope prediction of known and unknown structure information and is also suitable for large-scale application and popularization.

Description

technical field [0001] The invention belongs to computational biological information technology, in particular to a method for predicting epitopes based on sequence-based cost-sensitive integration and clustering. Background technique [0002] With the development of the economy and the improvement of living standards, the demand for food, clothing, housing and transportation is not as unsatisfactory as in the era of shortage economy. People are turning their attention to health, and the corresponding industries are ushering in rapid development. As China gradually enters an aging society, the investment of the state and individuals in medicine is increasing year by year. The world of biopharmaceutical and vaccine production faces enormous opportunities. According to statistics, a person's medical expenses after the age of 60 account for more than 50% of his lifetime medical expenses on average. In 2010, the global pharmaceutical and vaccine market was close to US$25 bi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/18G06K9/62
CPCG16B20/00G06F18/2321
Inventor 马志强张健柴海挺高博
Owner NORTHEAST NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products