RNA sequence coding potential prediction method and system based on data enhancement

A technology of sequence coding and prediction method, which is applied in the field of bioinformatics, can solve the problems that the accuracy of coding potential prediction needs to be further improved, and achieve the effect of improving performance

Pending Publication Date: 2021-04-16
SOUTH CENTRAL UNIVERSITY FOR NATIONALITIES
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the process of realizing the present invention, the inventors found that there are at least the following problems in the prior art: the accuracy of the current method for predicting the coding potential of the RNA sequence containing sORF needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RNA sequence coding potential prediction method and system based on data enhancement
  • RNA sequence coding potential prediction method and system based on data enhancement
  • RNA sequence coding potential prediction method and system based on data enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Reference will now be made in detail to specific embodiments of the invention, examples of which are illustrated in the accompanying drawings. While the invention will be described in conjunction with specific embodiments, it will be understood that it is not intended to limit the invention to the described embodiments. On the contrary, it is intended to cover alterations, modifications and equivalents as included within the spirit and scope of the invention as defined by the appended claims. It should be noted that the method steps described here can all be realized by any functional block or functional arrangement, and any functional block or functional arrangement can be realized as a physical entity or a logical entity, or a combination of both.

[0037] In order to enable those skilled in the art to better understand the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an RNA sequence coding potential prediction method and system based on data enhancement, and relates to the field of bioinformatics. The method comprises the following steps: calculating sequence features of a training sample, and performing double-end data enhancement in a feature space of the training sample to obtain enhanced sample features for training a machine learning model; and applying the trained machine learning model to prediction of RNA sequence coding potential. According to the method, the accuracy of predicting the coding potential of the human RNA sequence containing the sORF data can be remarkably improved.

Description

technical field [0001] The present invention relates to the field of bioinformatics, in particular to a data-enhanced RNA (Ribonucleic Acid, ribonucleic acid) sequence encoding potential prediction method and system. Background technique [0002] High-throughput sequencing technology has produced a large number of transcripts, which are a combination of DNA (DeoxyriboNucleic Acid, deoxyribonucleic acid) transcripts, including coding RNA (coding RNA) and ncRNA (non-coding RNA, non-coding RNA). ncRNA refers to RNA that does not code for protein and was once thought to be irrelevant to gene expression. It was later recognized that ncRNAs play key roles in the regulation of gene expression and disease pathogenesis. Estimating the coding potential of transcripts, i.e. distinguishing coding RNAs from ncRNAs, is crucial for downstream biological functional analysis. [0003] Researchers at home and abroad have proposed many computational methods for the prediction of RNA sequence...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B25/00G16B40/00G06K9/62
Inventor 谌先敢阳小飞章文李臣鸿陈素
Owner SOUTH CENTRAL UNIVERSITY FOR NATIONALITIES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products