Unlock instant, AI-driven research and patent intelligence for your innovation.

A method for constructing a circular RNA-RNA binding protein relationship prediction model

A technology that combines proteins and predictive models, applied in the field of biological information data mining, can solve problems such as limiting the accuracy of predictions, and achieve the effect of improving accuracy

Active Publication Date: 2022-06-28
SHANGHAI JIAOTONG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the limitation of the above three methods is that they only use circular RNA sequence information, and judge whether to bind to the target RNA binding protein by learning the specific pattern of the circular RNA sequence, and the binding of the circular RNA sequence to the protein sequence is Realized by the interaction of two sequences, only using circular RNA sequence information limits the prediction accuracy, and there is still room for improvement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for constructing a circular RNA-RNA binding protein relationship prediction model
  • A method for constructing a circular RNA-RNA binding protein relationship prediction model
  • A method for constructing a circular RNA-RNA binding protein relationship prediction model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described in detail below with reference to the accompanying drawings.

[0035] The original circular RNA sequence data set includes several RNA sub-data sets, each sub-data set corresponds to a class of RNA-binding proteins, each sub-data set contains several samples, and the samples are composed of RNA sequences and labels.

[0036] According to one or more embodiments, a method for constructing a prediction model for the relationship between circular RNA-RNA-binding proteins is disclosed, such as figure 1 shown, including the following steps:

[0037] S1. Construct a circular RNA-RNA-binding protein sequence pair by correspondingly combining the circular RNA sequences in the initial circular RNA sequence data set with the corresponding protein sequences, keeping the same label as the original sample, and reconstructing to form the target data set ;

[0038] Specifically, the original original RNA data set includes N RNA sub-dat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for constructing a circular RNA-RNA binding protein relationship prediction model, which solves the problem that the prior art only limits the prediction accuracy, and the main point of the technical solution is to construct the initial circular RNA sequence data set sample as In the form of circular RNA-RNA binding protein sequence pairs; self-supervised learning is used to train the word vector dictionary; according to the trained word vector dictionary, the sample sequence pair is mapped into the corresponding word vector matrix as a representation; the sample sequence pair The vector representation corresponding to the input pseudo-Siamese network, the encoded feature vector is obtained and input into the measurement function, the predicted value of the binding probability is calculated and the difference between it and the label is calculated, and the model parameters are optimized; the obtained model is saved after the model training iteration is completed. The method for constructing a prediction model for the relationship between circular RNA-RNA binding proteins of the present invention can perform data mining on RNA sequences and protein sequences, and can effectively improve the accuracy of binding predictions for circular RNA-RNA binding proteins.

Description

technical field [0001] The invention relates to a biological information data mining technology, in particular to a method for constructing a prediction model for the relationship between a circular RNA-RNA binding protein. Background technique [0002] Circular RNA is a special kind of non-coding RNA molecule. Different from traditional linear RNA, circular RNA molecule has a closed ring structure, which is not affected by RNA exonuclease, and its expression is more stable. [0003] Recent studies have shown that circular RNAs play an important regulatory role in diseases and have become the latest research hotspot in the field of RNA. Among them, the research on circular RNA and RNA binding protein (RBP) is a mainstream direction, mainly by studying the regulatory relationship between circular RNA and RBP to better understand the function of circular RNA. [0004] At present, the emergence of a large number of open source high-throughput sequencing experimental data has e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/00G16B50/30G16B20/00G06Q10/04G06N3/08G06F40/289G06F40/242
CPCG16B30/00G16B20/00G16B50/30G06F40/289G06F40/242G06N3/08G06Q10/04
Inventor 袁亮亮杨旸
Owner SHANGHAI JIAOTONG UNIV