Multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method with introducing attention

A prediction method and attention technology, applied in the fields of bioinformatics and deep learning, can solve problems such as large feature matrix, increased memory and calculation burden, and loss of effective information

Active Publication Date: 2020-06-26
DALIAN UNIV OF TECH
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

One-hot encoding will cause dimension disaster and longer sequences will make the feature matrix too large, increasing the burden of memory and calculation; label encoding is only digitally encoded according to the number of nucleotides in the sequence, and only considers Single nucleotide; complementary encoding also only considers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method with introducing attention
  • Multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method with introducing attention
  • Multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method with introducing attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The specific implementation manners of the present invention will be further described below in conjunction with the accompanying drawings and technical solutions.

[0068] Such as figure 2 As shown, the overall design idea of ​​the present invention is: because the traditional coding method does not consider the dependent relationship between adjacent nucleotides, in fact, there is a certain dependent relationship between adjacent nucleotides, therefore, the gene The sequence is encoded by k-mers, which preserves the dependent relationship between adjacent nucleotides and avoids the loss of potential effective information; then uses the embedding layer to map the encoded sequence into a matrix vector for convolution operations, and uses multi-scale convolution and Multi-pool operation replaces single-scale convolution and single-pool operation, so that features of different topic lengths can be extracted, and feature diversity can be enriched to achieve the purpose of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method with introducing attention, and belongs to the field of bioinformatics and deep learning. Themethod comprises the following steps: (1) proposing a coding mode k-mers suitable for a gene sequence; (2) using a multi-scale convolution kernel for replacing a single-scale convolution kernel, so that topic features with different lengths between sequences are captured, the feature diversity is enriched, and the model prediction performance is improved; carrying out down-sampling on each convolved feature map by using a plurality of pooling windows with different scales, so as to avoid ignoring potential effective information; (3) fusing a BiLSTM model on the basis of the CNN, so that long-distance information dependence between sequences can be better processed, and feature information is fully learned; and (4) introducing an attention mechanism, and distributing different weights to different words in the text vector by using the attention mechanism to distinguish the importance of the information, so that the attention mechanism pays more attention to the key information, and thepurpose of enhancing learning is achieved.

Description

technical field [0001] The invention belongs to the fields of bioinformatics and deep learning, and relates to a multi-scale CNN-BiLSTM non-coding RNA interaction relationship prediction method that introduces attention, including the design of k-mers coding and the construction of a deep learning prediction model. Background technique [0002] Since deep learning was proposed, its automatic learning features and good learning ability have made it widely used in many fields; for example, convolutional neural network CNN, recurrent neural network RNN ​​and bidirectional long short-term memory neural network BiLSTM have been used In genome research, it solves the problems of motif identification, gene expression inference and interaction relationship prediction. MicroRNA (miRNA) and long non-coding RNA (lncRNA) play an important role in regulating biological life activities, and they play an important role in regulating cell growth, differentiation and proliferation. Therefor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B40/20G06N3/04G06N3/08
CPCG16B40/20G06N3/084G06N3/045G06N3/044
Inventor 孟军石文浩
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products