RNA secondary structure prediction method based on recurrent neural network

A recursive neural network and secondary structure technology, applied in the field of biological research, can solve the problems of increased prediction time and cost, difficult to obtain crystals, and long time-consuming secondary structure, so as to improve the prediction accuracy.

Pending Publication Date: 2019-07-26
ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY
View PDF1 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the RNA molecule itself has the characteristics of difficult to obtain crystals and fast molecular degradation, so it is relatively time-consuming and costly to predict its secondary structure by experimental physics and chemical experiments.
The use of computers based on comparative sequence analysis and minimum free energy methods to predict these spatial structures has improved the efficiency ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RNA secondary structure prediction method based on recurrent neural network
  • RNA secondary structure prediction method based on recurrent neural network
  • RNA secondary structure prediction method based on recurrent neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] Embodiment 1: RNA secondary structure prediction method based on recursive neural network, carry out according to the following steps: download and obtain PDB data set from PDB database official website, such as figure 1Shown is a partial example of one of the RNA information '2JTP.pdb' in the downloaded PDB data. The PDB database contains three parts: RNA sequence information, RNA primary sequence and three-dimensional space coordinates. It can be seen from the figure The primary sequence of RNA is recorded in 'SEQRES'. Firstly, data preprocessing is performed on the PDB data set, and the primary sequence is extracted by means of regular expressions. Some of the data are except A, C, G, U For other characters, these characters need to be cleaned to obtain the correct RNA primary sequence. The modeling method of the machine learning model selects the SVM package in the scikit-learn package in the python extension library for direct modeling, and selects the Gaussian ker...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an RNA secondary structure prediction method based on a recurrent neural network. The RNA secondary structure prediction method includes the steps: carrying out data preprocessing on an RNA primary sequence data set in a PDB data set; dividing the RNA primary sequence into a long sequence, a medium sequence and a long sequence according to the length; vectorizing the sequence information to obtain feature information expressed in a matrix form, and filling the feature information of the sequence samples unsatisfying the standard by taking the longest sequence information of the long sequence, the medium sequence and the short sequence as the standard to obtain a feature matrix with a fixed dimension; and inputting the feature matrix into an LSTM model established based on a recurrent neural network, and performing RNA secondary structure prediction by using the LSTM model. The RNA secondary structure prediction method based on a recurrent neural network can predict the RNA secondary structure, and the prediction result is relatively accurate, and the implicit features of the RNA sequence can be further mined, and the more accurate RNA secondary structure canbe predicted.

Description

technical field [0001] The invention relates to the field of biological research, in particular to a method for predicting RNA secondary structures based on a recurrent neural network. Background technique [0002] Ribonucleotide molecule RNA, as a macromolecule in organisms, is an important substance existing in organisms. It not only cooperates with deoxyribonucleotide molecules DNA and proteins to maintain the activities of organisms, but also in Plays an important role in DNA and protein synthesis. Studies have found that the study of RNA structure can help us understand the function of RNA molecules more comprehensively, which is beneficial for biological researchers to explore the relationship between RNA, DNA and proteins, so as to understand the function of organisms and understand and treat diseases. [0003] The molecular structure of RNA consists of three parts: primary sequence, secondary structure, and tertiary space structure. The tertiary space structure of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B15/00
CPCG16B15/00
Inventor 孙婷婷苏静杰
Owner ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products