Method and system for quickly comparing whole genome annotation intervals

A whole-genome, annotation technology, applied in bioinformatics, instruments, etc., can solve the problems of processing speed limit of reading and writing, complicated operation, long time-consuming, etc., and achieve the effect of small calculation, simple logic and accurate judgment
CN113362901AActive Publication Date: 2021-09-07HAINAN UNIVERSITY

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
HAINAN UNIVERSITY
Publication Date
2021-09-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a method and a system for quickly comparing whole genome annotation intervals. The method comprises the following steps: S1, respectively obtaining a gene annotation interval A fragment coordinate set and a gene annotation interval B fragment coordinate set; s2, combining the coordinates of the fragment set A and the coordinates of the fragment set B to obtain a combined fragment set; s3, performing redundancy elimination operation and sorting operation on the combined fragment set; s4, calling a preset interval intersection function to calculate the intersection of all the annotation intervals in the merged fragment set in a single cycle; and S5, outputting a calculation result of the preset interval intersection function. According to the method, the intersection of different types of gene annotation intervals can be rapidly calculated, and compared with a traditional comparison method, the method is simple in implementation logic, small in calculation amount, accurate in judgment and beneficial to improvement of comparison efficiency.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of genome annotation, in particular to a method for rapidly comparing annotation intervals of the whole genome. Background technique

[0002] At present, in the field of pests and diseases or rapid detection, it is often necessary to calculate the relationship between a large number of different annotation fragments in the whole genome. Since the number of genomes of organisms is usually very large, the number of annotation fragments involved is huge. Traditional calculation methods mainly include two , one is to establish a database index and compare it through a specific data structure. For example, in the paper "Building a Populus trichocarpa Gene Database and Realization of a Visualization Platform", it is disclosed that an index is established in a database visualization platform to search for genome fragments. The operation of the first method is relatively complicated, and for very large-scale data, i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More