Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

CircRNA function prediction method based on scoring mechanism and LightGBM

A prediction method and mechanism technology, applied in genomics, sequence analysis, instruments, etc., can solve the problems of difficult, time-consuming, laborious and cumbersome functional identification of CircRNA

Pending Publication Date: 2021-03-19
SUN YAT SEN UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, it has also become a more important potential biomarker in recent years, and the identification of its function is a cumbersome task. Traditional methods often use experimental methods to test the functions of new circRNAs one by one based on the existing functions of circRNAs. This method is time-consuming and laborious, and it is very difficult to identify the function of large quantities of CircRNA
At present, there is no method to predict the function of circRNA in advance, so as to test one of its functions in a targeted manner, and then analyze its specific role in clinical medicine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • CircRNA function prediction method based on scoring mechanism and LightGBM
  • CircRNA function prediction method based on scoring mechanism and LightGBM
  • CircRNA function prediction method based on scoring mechanism and LightGBM

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described below in conjunction with the embodiments and accompanying drawings.

[0022] refer to figure 1 This example is based on the evaluation mechanism and the flowchart of the CircRNA function method of LightGBM. The main steps of the technical solution adopted by the present invention to solve its problem are:

[0023] S1. Import the CircRNA of the large data sample as a (.bed) file, which contains the chromosome number, sequence start site, and positive and negative strand markers.

[0024] S2. Map the CircRNA (.bed) file to the whole human genome (hg19 version) according to the relevant information such as the start site. Get the specific CircRNA sequence information (.fasta) file.

[0025] S3. A feature fusion algorithm is proposed, which is used to use the specific function expressed by circRNA as a feature label, and extrac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

In order to overcome the defects in the prior art, the invention aims to predict the function of circRNA by using a scoring mechanism in combination with a LightGBM method. In order to solve the problem, the technical scheme adopted by the invention mainly comprises the following steps: (1) inputting circRNA of a big data sample in the form of a (. Bed) file; and (2) mapping the circRNA (. Bed) file to a whole human genome (hg19 version) to obtain a circRNA sequence information (. Fasta) file. And (3) proposing a feature fusion algorithm, and fusing the circRNA features. And (4) inputting thecharacteristics into a class A decision system, and distinguishing the encoded protein type circRNA. And (5) enabling other circRNAs to pass through the three models respectively, and judging variousfunctions of the circRNAs according to a sequence to obtain a prediction probability value. And (6) according to a scoring mechanism, enabling the obtained three prediction probability values to passthrough a B-type judgment system to obtain a final circRNA function classification prediction result.

Description

technical field [0001] The invention relates to the technical field of bioinformatics, in particular to the field of CircRNA function prediction. Background technique [0002] CircRNAs have multiple functions in biology, such as being rich in miRNA binding sites and acting as sponges in cells; regulating protein activity by binding to proteins; some circRNAs can even be translated into proteins. Therefore, it has also become an important potential biomarker in recent years, and the identification of its function is a cumbersome task. Traditional methods often use experimental methods to test the functions of new circRNAs one by one based on the existing functions of circRNAs. This method is time-consuming and laborious, and it is very difficult to identify the function of a large number of CircRNAs. At present, there is no method to predict the function of circRNA in advance, so as to test one of its functions in a targeted manner, and then analyze its specific role in clin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B20/30G16B30/00G16B40/00
CPCG16B20/30G16B30/00G16B40/00
Inventor 邓怡云王高平戴宪华
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products