Method and system for predicting association between long non-coding RNA and protein

A long-chain non-coding, protein technology, used in proteomics, special data processing applications, instruments, etc., can solve the problems of high cost and long time for the interaction between lncRNA and protein, saving experimental costs and ensuring The effect of accuracy

Inactive Publication Date: 2017-04-26
CENT SOUTH UNIV
View PDF1 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] At present, it is costly and time-consuming to predict the interaction be

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for predicting association between long non-coding RNA and protein
  • Method and system for predicting association between long non-coding RNA and protein
  • Method and system for predicting association between long non-coding RNA and protein

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] This embodiment discloses a method for predicting the connection between long non-coding RNA and protein, such as figure 1 shown, including:

[0032] Step S1, constructing subnetwork 1 of lncRNA-protein interaction, subnetwork 2 of lncRNA-lncRNA interaction and subnetwork 3 of protein-protein interaction.

[0033] Step S2. Combining the first subnet, the second subnet and the third subnet into a global heterogeneous network.

[0034] Step S3, determining at least two walking paths for any lncRNA to establish contact with any protein in the global heterogeneous network; any walking path connects subnetwork one with at least one of subnetwork two and subnetwork three.

[0035] In this step, usually, if the studied walking path is too complicated, the potential connection between the source end and the sink end corresponding to the corresponding walking path is smaller, which is less meaningful to the research, and it also increases The complexity of data processing, the...

Embodiment 2

[0052] Corresponding to the above method embodiments, this embodiment provides a system for predicting the association between long non-coding RNA and protein, including the following first to fifth processing modules. The functional scores for each module are as follows:

[0053] The first processing module is used to construct subnetwork one of lncRNA-protein interconnection, subnetwork two of lncRNA-lncRNA interconnection and subnetwork three of protein-protein interconnection;

[0054] The second processing module is used to combine the first subnet, the second subnet and the third subnet into a global heterogeneous network;

[0055] The third processing module is used to determine at least two walking paths for any lncRNA to establish contact with any protein in the global heterogeneous network; any walking path connects subnet one with at least one of subnet two and subnet three in series;

[0056] The fourth processing module is used to calculate the HeteSim score of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of biological information, and discloses a method and system for predicting the association between long non-coding RNA and protein, used for ensuring the prediction accuracy and reducing the experimental cost. The method comprises the following steps of: constructing a first subnet that lncRNA and protein are mutually associated, a second subnet that lncRNA and lncRNA are mutually associated, and a third subnet that protein and protein are mutually associated; combining the various subnets to construct a global heterogeneous network; determining at least two migration paths that any lncRNA and any protein are associated in the global heterogeneous network; connecting the first subnet with at least one of the second subnet and the third subnet in series by any migration path; calculating the HeteSim fractions of the various migration paths through a HeteSim algorithm; and performing construction and evaluation of a classification model used for predicting the association between non-coding RNA and protein according to the corresponding HeteSim fractions of the various migration paths between an lncRNA source end and a protein host end in the global heterogeneous network.

Description

technical field [0001] The invention relates to the technical field of biological information, in particular to a method and system for predicting the connection between long-chain non-coding RNA and protein. Background technique [0002] In recent years, the research on non-coding RNA has continued to heat up. During the 17 years from 1999 to 2015, research results related to non-coding RNA have been selected as the top ten scientific and technological breakthroughs of the Science magazine for many times. When it came to the Top 10 Scientific Breakthroughs of 2010, the non-coding field was placed first. In 2004, Science titled "Hidden DNA Treasures" and pointed out that there may be a large number of DNA regulatory elements, transcripts and non-coding RNA genes hidden in the so-called "useless DNA" sequences that account for more than 90% of the human genome. In the subsequent ENCODE research project, it was found that among the 3 billion base pairs that make up the human...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18
CPCG16B20/00
Inventor 邓磊肖云
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products