Gene sequencing data compression and transmission method

A technology for gene sequencing and sequencing data, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as difficult to meet big data analysis, reduce data transmission efficiency, large data storage space, etc., to ensure basic accuracy , The effect of high data transmission efficiency and small storage space
CN106971090AInactive Publication Date: 2017-07-21首度生物科技(苏州)有限公司 +1

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
首度生物科技(苏州)有限公司
Publication Date
2017-07-21
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a gene sequencing data compression and transmission method. The method comprises the following steps of A, establishing a standard DNA sequence database; B, deploying the standard DNA sequence database to a data processing device; C, preprocessing DNA sequencing data: comparing the DNA sequencing data with the standard DNA database one by one, generating a corresponding relationship, replacing an original text of the DNA sequencing data with numbers of the standard DNA database, and separately storing a part, different from the standard DNA database, of the DNA sequencing data; D, performing compression; and E, performing storage or transmission. The standard DNA sequence database is stored in the data processing device, so that a large amount of information contained in the DNA sequencing data can be represented by the numbers of the standard DNA database, and the data capacity after the step of preprocessing the DNA sequencing data is greatly reduced; through further compression, the capacity is smaller, so that the storage space of the DNA sequencing data is smaller, and the data transmission efficiency is higher; and the data in the method is matched with output data of a second-generation sequencing technology and even a third-generation sequencing technology.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of gene detection, in particular to a gene sequencing data compression and transmission method. Background technique

[0002] With the development of gene sequencing technology and the reduction of sequencing costs, especially the application and popularization of next-generation sequencing (NGS), the output of sequencing data is increasing exponentially, and how to efficiently store and transmit sequencing data has become a challenge for the industry. major challenge. Mature DNA sequencing technology began in the 1970s with chemical degradation and dideoxy chain termination methods, followed by fluorescence and hybridization and other sequencing methods, collectively referred to as the first generation of DNA sequencing technology, the output data volume is usually in bp or On the order of kb. Around 2005, technologies such as 454 sequencing, solexa sequencing and SOLiD sequencing appeared successively, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More