Metagenome data analysis method based on next-generation sequencing technology

A second-generation sequencing technology and data analysis technology, applied in the field of metagenomic data analysis based on second-generation sequencing technology, can solve the problems of short length of single sequence, existence of pollution, short original sequence of second-generation sequencing, etc. , the analysis results are accurate, the analysis process is reasonable
CN112071366APending Publication Date: 2020-12-11NANKAI UNIV

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
NANKAI UNIV
Publication Date
2020-12-11

Smart Images

  • Figure 1
    Figure 1
Patent Text Reader

Abstract

The invention discloses a metagenome data analysis method based on a next-generation sequencing technology, which comprises the following steps of: (1) carrying out quality control on original sequencing data to obtain clean reads; (2) performing species annotation on the clean reads subjected to the quality control; (3) performing statistical analysis on the sample diversity based on a species abundance matrix; (4) performing statistical analysis on species with significant differences among sample groups based on the species abundance matrix; (5) splicing and assembling the clean reads to obtain a contigs sequence; (6) packaging the contigs obtained by splicing and assembling into boxes to obtain bins; (7) carrying out gene annotation on the bins subjected to boxing; (8) performing statistical analysis on the genes with significant differences among the sample groups based on the gene abundance matrix; and (9) based on the gene annotation result, performing function and species annotation on the sequence. A whole process from metagenome next-generation sequencing data processing to species composition analysis, gene composition analysis and function annotation is provided, an accurate analysis result is provided for researchers, and the metagenomics problem is comprehensively analyzed.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention generally relates to the technical field of next-generation sequencing, and specifically relates to a method for analyzing metagenomic data based on next-generation sequencing technology. Background technique

[0002] In 1977, Frederick Sanger and Walter Gilbert invented the first sequencer and used it to determine the first genome sequence, phage X174, with a total length of 5375 bases. From then on, human beings gained the ability to explore the genetic nature of life , the research of life science has entered the era of genomics. The sequencing method invented by Sanger is called the first-generation sequencing technology. This technology is still widely used until now, but it can only obtain a sequence with a length of 700-1000bp at a time. The throughput is too low, resulting in cost in many cases. Too high, unable to meet the urgent needs of modern scientific development for the acquisition of biological gene sequences. Hi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More