Method for predicting N6-methyladenosine modification site in RNA based on stacking integration

A technology of methyladenosine and prediction methods, applied in the field of systems biology, can solve problems such as single prediction model, single species, and limitations of traditional classifiers, and achieve good prediction performance

Active Publication Date: 2020-05-15
QINGDAO UNIV OF SCI & TECH
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005]Although researchers have made great contributions to the modification of RNA methylation and proposed many models to predict the m6A site, but Predictive models are mostly based on a single traditional classifier or restricted to a single species

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for predicting N6-methyladenosine modification site in RNA based on stacking integration
  • Method for predicting N6-methyladenosine modification site in RNA based on stacking integration
  • Method for predicting N6-methyladenosine modification site in RNA based on stacking integration

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the object, technical solution and advantages of the present invention clearer, the invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0034] A stacking-based integration of m in different species of RNA 6 A site prediction methods, such as figure 1 shown, including the following steps:

[0035] 1) Collect N in RNA 6 -Methyladenosine modification (m 6 A) Site information: Get the N of RNA from 3 different species 6 -Methyladenosine modification site data set, including the RNA sample sequence of the positive and negative data set and the corresponding category label.

[0036] 1-1) The real data of three species of S.cerevisiae, H.sapiens and A.thaliana are selected; the data set S.cerevisiae is used as a training set, and the data se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for predicting an N6-methyladenosine modification site in RNA based on stacking integration, belonging to the field of systems biology. The method comprises the following steps: extracting RNA sequence features of three species, namely saccharomyces cerevisiae, homo sapiens and arabidopsis thaliana through six feature extraction methods, and conducting feature fusion to obtain an initial feature space of an original data set; performing dimensionality reduction on the initial feature space by using an elastic network, eliminating redundant and noise features, and reserving important features related to model classification so as to obtain an optimal feature set; inputting optimal feature subsets and corresponding category labels into stacking integration for model training, and evaluating the prediction performance of a model in combination with evaluation indexes to obtain a prediction model; and inputting a to-be-predicted RNA sequence in a test set into the prediction model, predicting the m6A site and outputting the m6A site. The prediction accuracy of the model on the test set reaches 92.30% and 87.06% respectively, and the model has good development potential in the aspect of cross-species prediction and is expected to become a useful tool for identifying the m6A site.

Description

technical field [0001] The invention belongs to the field of systems biology, and relates to N in RNA based on stacking integration 6 - Methyladenosine modification site prediction method. Background technique [0002] The completion of the Human Genome Project has greatly promoted people's understanding of genetic organization information, transmission and expression rules, and at the same time made us aware of the extraordinary complexity of the expression mechanism of genetic information in cells. As the key link of the central dogma, RNA closely connects the genetic material DNA and the executor protein of life activities. Studies have shown that there are more than 100 chemical modifications in RNA, and these chemical modifications are mainly methylated, including: 5-methylcytosine (m 5 C), N 1 -Methyladenine (m 1 A), 5-hydroxymethylcytosine (hm 5 C), pseudouracil (ψ), N 6 -Methyladenine (m 6 A) etc. N discovered in the 1970s 6 -Methyladenosine (m 6 A) is a co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/30G16B20/30G16B30/10G16B40/00
CPCG16B20/30G16B30/10G16B40/00G16B15/30
Inventor 于彬禹昭敏王磊陈瑞欣
Owner QINGDAO UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products