Metagenome data analysis method based on next-generation sequencing technology

A second-generation sequencing technology and data analysis technology, applied in the field of metagenomic data analysis based on second-generation sequencing technology, can solve the problems of short length of single sequence, existence of pollution, short original sequence of second-generation sequencing, etc. , the analysis results are accurate, the analysis process is reasonable

Pending Publication Date: 2020-12-11
NANKAI UNIV
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the sequencing throughput of the second-generation sequencing technology has been greatly increased, the length of a single sequence obtained by it is too short, only 200-500bp. To obtain accurate gene sequence information, it depends on high sequencing coverage and accurate splicing technology
[0004] Due to the limited principle of sequencing technology, the ori

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metagenome data analysis method based on next-generation sequencing technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0054] see figure 1 , the present invention provides a technical solution: the present invention provides the following technical solution: a metagenomic data analysis method based on next-generation sequencing technology, comprising the following steps:

[0055] 1) Perform quality control on the original sequencing data to obtain clean reads;

[0056] 2) Perform species annotation on the sequence clean reads that have undergone quality control;

[0057] 3) ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a metagenome data analysis method based on a next-generation sequencing technology, which comprises the following steps of: (1) carrying out quality control on original sequencing data to obtain clean reads; (2) performing species annotation on the clean reads subjected to the quality control; (3) performing statistical analysis on the sample diversity based on a species abundance matrix; (4) performing statistical analysis on species with significant differences among sample groups based on the species abundance matrix; (5) splicing and assembling the clean reads to obtain a contigs sequence; (6) packaging the contigs obtained by splicing and assembling into boxes to obtain bins; (7) carrying out gene annotation on the bins subjected to boxing; (8) performing statistical analysis on the genes with significant differences among the sample groups based on the gene abundance matrix; and (9) based on the gene annotation result, performing function and species annotation on the sequence. A whole process from metagenome next-generation sequencing data processing to species composition analysis, gene composition analysis and function annotation is provided, an accurate analysis result is provided for researchers, and the metagenomics problem is comprehensively analyzed.

Description

technical field [0001] The present invention generally relates to the technical field of next-generation sequencing, and specifically relates to a method for analyzing metagenomic data based on next-generation sequencing technology. Background technique [0002] In 1977, Frederick Sanger and Walter Gilbert invented the first sequencer and used it to determine the first genome sequence, phage X174, with a total length of 5375 bases. From then on, human beings gained the ability to explore the genetic nature of life , the research of life science has entered the era of genomics. The sequencing method invented by Sanger is called the first-generation sequencing technology. This technology is still widely used until now, but it can only obtain a sequence with a length of 700-1000bp at a time. The throughput is too low, resulting in cost in many cases. Too high, unable to meet the urgent needs of modern scientific development for the acquisition of biological gene sequences. Hi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B25/10G06K9/62
CPCG16B25/10G06F18/2135
Inventor 刘健田妹陈娇
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products