Anticancer peptide prediction method based on bidirectional long-short-term memory network and feature fusion

A long-term and short-term memory, feature fusion technology, applied in neural learning methods, biological neural network models, for analyzing two-dimensional or three-dimensional molecular structures, etc., can solve problems such as expensive, complex, time-consuming, etc. rate effect

Pending Publication Date: 2022-08-05
CHANGZHOU UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, Vidal et al. identified peptide cocktails targeting intracellular tumor proteins through the yeast two-hybrid system, and Peelle et al. discovered novel targeting peptides that are not cell-type specific through mammalian cell screening; however, these identification methods are time-consuming and expensive , is very complex and difficult to achieve in a high-throughput manner, so rapid and effective identification of anticancer peptides is particularly important
[0003] Wu et al. proposed a PTPD model, using feature vectors extracted by k-mer and Word2vec (word vectors), and input them into a convolutional neural network (CNN) to predict peptides; Rao et al. applied graph convolutional networks (GCN) to anti-cancer In the prediction of peptides, the ACP-GCN model is proposed; however, these deep learning methods only consider the original sequence information and physical and chemical properties of amino acids, ignoring the long-term correlation information of anti-cancer peptides at the time level, and cannot be low-cost, fast and efficient. anticancer peptide

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Anticancer peptide prediction method based on bidirectional long-short-term memory network and feature fusion
  • Anticancer peptide prediction method based on bidirectional long-short-term memory network and feature fusion
  • Anticancer peptide prediction method based on bidirectional long-short-term memory network and feature fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The present invention will be further described below with reference to the accompanying drawings and embodiments. This figure is a simplified schematic diagram, and only illustrates the basic structure of the present invention in a schematic manner, so it only shows the structure related to the present invention.

[0056] like figure 1 As shown, the anticancer peptide prediction method based on bidirectional long short-term memory network and feature fusion includes the following steps:

[0057]Step 1. Read the four benchmark peptide sequence data sets, and analyze the amino acid composition of the data sets. The data sets are in Table 1. The data set analysis is as follows image 3 shown;

[0058] Table 1 Four benchmark peptide sequence datasets

[0059]

[0060]

[0061] Amino acid alphabet coding The primary letter sequence of the peptide is numerically encoded, that is, the 20 basic amino acids are assigned numbers 1-20, and the peptide sequences with insuf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of anticancer peptide prediction, in particular to an anticancer peptide prediction method based on a bidirectional long-short-term memory network and feature fusion, which comprises the following steps: reading four reference peptide sequence data sets, and carrying out amino acid composition analysis on the data sets; feature extraction is carried out on the data set through Bi-LSTM, and a Bi-LSTM feature vector is generated; performing feature extraction on the five amino acid feature vectors through a full-connection neural network; and carrying out feature fusion on the feature vectors through a Concatenate algorithm, obtaining a probability score through a full connection layer with a unit 1 and a Sigmoid activation function, and distinguishing the probability score into an anti-cancer peptide and a non-anti-cancer peptide through the score. According to the method, the anticancer peptide prediction with high accuracy, high Marius correlation coefficient, high sensitivity, high specificity and high ROC curve area is realized.

Description

technical field [0001] The invention relates to the technical field of anti-cancer peptide prediction, in particular to an anti-cancer peptide prediction method based on bidirectional long short-term memory network and feature fusion. Background technique [0002] The discovery of anticancer peptide (ACP) has broadened people's horizons of anticancer road, its specificity and tumor cannot develop resistance to it, solved some side effects caused by traditional anticancer therapy, and hopefully become a kind of cancer Anticancer peptides are usually composed of 5-40 amino acids; in order to further understand the mechanism of action of anticancer peptides, there have been many biological experimental methods for the identification of anticancer peptides. For example, Vidal et al. identified peptide cocktails targeting intracellular tumor proteins by the yeast two-hybrid system, and Peelle et al. identified novel targeting peptides that were not cell-type specific by screening...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B15/30G16B40/00G06N3/04G06N3/08
CPCG16B15/30G16B40/00G06N3/08G06N3/048G06N3/044
Inventor 杨森叶晨阳朱轮封红旗
Owner CHANGZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products