An Integrative Method for Multi-Class Biological Sequence Annotation

A biological sequence and sequence technology, applied in the field of integration of multi-type biological sequence annotations, can solve problems such as lack of annotation knowledge, achieve wide application value, improve utilization rate, and increase credibility

Active Publication Date: 2021-04-06
TSINGHUA UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

3. A large number of sequencing sequences are located in regions without prior functional annotations, lacking sufficient annotation knowledge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Integrative Method for Multi-Class Biological Sequence Annotation
  • An Integrative Method for Multi-Class Biological Sequence Annotation
  • An Integrative Method for Multi-Class Biological Sequence Annotation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The following will describe the embodiment of the method for integrating multi-type biological sequence annotations according to the present invention with reference to the accompanying drawings. Those skilled in the art would recognize that the described embodiments can be modified in various ways or combinations thereof without departing from the spirit and scope of the invention. Accordingly, the drawings and description are illustrative in nature and not intended to limit the scope of the claims. Also, in this specification, the drawings are not drawn to scale, and like reference numerals denote like parts.

[0053] The integration method of multi-type biological sequence annotation in this embodiment, such as figure 1 As shown, it includes the following steps:

[0054] 1) Organize data

[0055] To analyze the biological sequencing data results of different methods of the same biological individual, first analyze the sequencing method of the data and the sequenci...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for integrating annotations of multiple types of biological sequences, comprising: selecting one biological sequencing data from biological sequencing data as a main biological sequence set, and the rest as auxiliary biological sequence sets; establishing a sequence-gene association mapping set; According to the gene transcription start point, the basic association region and the extended association region of the gene are obtained; for the sequence of the main biological sequence set, the extended association region of the gene is traversed, and if the region where the sequence is located overlaps with the extended association region of a gene, then the Sequence-gene association mapping of genes and sequences; the hypergeometric test and binomial test are used to calculate the significance of the results of reference data applied to the biological sequence annotations in the sequence-gene association mapping set; the annotations obtained by the two methods are sorted separately, and Add up the sequence numbers of the same annotations and then sort them again as the annotation results of various biological sequence data. The invention realizes the annotation of various features, and has application value in the medical field.

Description

technical field [0001] The present invention relates to the field of biotechnology, in particular, to an integration method for multiple types of biological sequence annotations. Background technique [0002] Gene sequencing is a new type of genetic testing technology that can analyze and determine the full sequence of genes from blood or saliva. With the development of biomedical technology, in the field of precision medicine, the technology of analyzing patients through sequencing and other methods to achieve precise treatment has also become more and more perfect. However, due to the lack of a unified standard for many sequencing methods on the market, their target sites and data distribution characteristics are very different, which also restricts the further development of the field of precise diagnosis. In response to this problem, it is an effective solution to try to integrate multiple types of data and annotate the functions and characteristics of various sequenced...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10
CPCG16B30/10
Inventor 江瑞宋绍铭
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products