Next-generation sequencing short sequence rapid alignment analysis method and device

A second-generation sequencing and analysis method technology, applied in the field of bioinformatics, can solve the problems of low comparison efficiency and high memory usage of sequencing data, and achieve the effects of shortening comparison time, fast comparison speed and improving resource efficiency
CN106295250BActive Publication Date: 2019-03-29北京普康瑞仁医学检验所有限公司

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
北京普康瑞仁医学检验所有限公司
Publication Date
2019-03-29

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a method and a device for quick contrast and analysis of a short sequence for second-generation sequencing, which can solve the problems of low contrast efficiency and high memory occupation ratio of sequencing data. The method comprises the following steps of obtaining a DNA (deoxyribonucleic acid) short sequence obtained by sequencing, and respectively mapping and encoding the DNA short sequence by a first hash algorithm and a second hash algorithm, so as to respectively obtain a first index and a second index; according to a preset index query library, the first index and the second index, contrasting the DNA short sequence and a reference gene group, wherein the index query library consists of an unit structure array, and each unit structure comprises value and index 2; storing the array index offset of each unit structure as the corresponding index 1, namely the index value corresponding to the structure array, wherein K is the length of segment sequence; according to the contrast result, when the contrast result is correct, obtaining the value of the K-mer segment contrasted with the corresponding DNA short sequence, and determining the chromosome number of the corresponding DNA short sequence and the site on the chromosome.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to the field of biological information engineering, and relates to biological information technology and computer application technology, in particular, to a short-sequence rapid comparison and analysis method for second-generation sequencing of DNA sequences. Background technique

[0002] DNA sequencing plays the most fundamental and broadest role in deciphering the genetic sequence codes of species life. As early as the discovery of the DNA double helix, DNA sequencing technology was reported, but the process was too complicated. Shortly afterwards, in 1977, Sanger invented the terminal termination sequencing method, which was a milestone. So far, with the development of bioinformatics science, the Sanger sequencing method has been unable to meet the needs of research, so the second-generation sequencing technology with lower cost, higher throughput and faster speed has emerged as the times require. Its core idea is to synthes...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More