Method for de novo assembly of genome by comprehensively applying third-generation ultra-long reads and second-generation linked reads

A genome assembly and linking technology, which is applied in the field of mixed assembly of third-generation sequencing data and second-generation sequencing data, can solve the problems of increasing sequencing and computing costs, increasing sequencing data, assembly problems and obstacles, etc., to achieve low Effects of sequencing cost and computational cost, avoidance of mismatches, and reduction of complexity
CN110858503APending Publication Date: 2020-03-03KUNMING INST OF ZOOLOGY CHINESE ACAD OF SCI

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
KUNMING INST OF ZOOLOGY CHINESE ACAD OF SCI
Publication Date
2020-03-03

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a method for high-efficiency and high-quality de novo assembly of a genome by comprehensively applying third-generation ultra-long reads and second-generation linked reads. Thethird-generation ultra-long reads are ultra-long reads generated by a third-generation sequencing technology Nanopore and PacBio which are most widely applied at present. The second-generation linkedreads are reads generated by a 10x Genomics sequencing platform. A high-quality genome sequence is assembled through high-efficiency hybrid assembly software. According to the method, the advantagesof the third-generation super-long reads and the second-generation linked reads are brought into full play and are integrated, and high-efficiency assembly software DBG2OLC and SPARC are combined, sothat the application cost of the third-generation sequencing technology is greatly reduced. The invention provides an efficient, reliable and economic method for large-scale and high-quality genome denovo assembly by applying the third-generation sequencing technology.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to a method for de novo assembly of genome sequencing data, in particular to a mixed assembly method for third-generation sequencing data and second-generation sequencing data. Third-generation sequencing data are mainly ultra-long reads generated by Pacbio, Nanopore or other sequencing technologies, and second-generation sequencing data are mainly linked reads generated by 10x Genomics sequencing. Combined with the high-efficiency assembly software-DBG2OLC, the cost of sequencing and calculation (especially the application cost of third-generation sequencing technology) is greatly reduced. Provides an efficient and reliable method for large-scale, high-quality genome de novo assembly using third-generation sequencing technology Background technique

[0002] With the development of sequencing technology, genome sequence information generated by genome de novo assembly is becoming more and more detailed and accurate. The 10X Geno...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More