Automatic method for analyzing constitution and diversity of bacterial community of 16SrRNA gene

A diversity analysis and community technology, applied in the field of molecular biology, can solve problems such as unsatisfactory analysis needs of researchers, data leveling processing, and uneven sequencing depth, so as to reduce workload, eliminate analysis errors, and comprehensive analysis results Effect

Active Publication Date: 2017-06-09
SHANGHAI PASSION BIOTECHNOLOGY CO LTD
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In some cases, the analytical needs of researchers cannot be met
In addition, the original analysis process does not flatten the data when performing subsequent comparative analysis such as PCA and PCoA, which will introduce analysis errors caused by different sequencing depths

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic method for analyzing constitution and diversity of bacterial community of 16SrRNA gene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In a specific embodiment, the method is as figure 1 The following steps are shown:

[0055] Step 1: For the original paired-end sequencing data of the Illumina Miseq platform, execute the MiSeqQuality16S.pl script with the original off-machine data as input data, the window size is 10bp, and the step size is 1bp, starting from the first base position at the 5' end, The average quality of the bases in the window is required to be ≥ Q20 (that is, the average sequencing accuracy of the bases is ≥ 99%), and the sequence is truncated from the first window whose average quality value is lower than Q20, and the length of the truncated sequence is required to be ≥ 150bp, and no Ambiguity base N is allowed. Then, use the FLASH software to pair and join the double-ended sequences that have passed the quality screening according to the overlapping bases: the overlapping base length of the two sequences of Read 1 and Read 2 is required to be ≥ 10 bp, and base mismatching is not al...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic method for analyzing constitution and diversity of a bacterial community of a 16SrRNA gene. The automatic method comprises the following steps: in 16SrRNA sequencing data analysis process, by taking original sequencing sequence data as input, calling a standard analysis tool (such as Mothur and QIIME) of the industry, and performing visualization on data finally, thereby obtaining analysis results which are easy to analyze. The automatic method includes popular main analysis items at present, meanwhile analysis contents can be modularized, relatively rich and deep data mining analysis methods can be available, different analysis module contents can be combined according to different demands, and process procedures can be relatively reasonably arranged; and in addition, analysis errors caused by different sequencing depths can be eliminated, and the analysis results can be relatively comprehensive, accurate and reliable.

Description

[0001] Technical field: [0002] The present invention generally relates to the technical field of molecular biology, in particular to the technical field of high-throughput sequencing data analysis, and more specifically, relates to an automated method for bacterial community composition and diversity analysis of 16S rRNA genes. [0003] Background technique: [0004] The new generation of high-throughput sequencing technology has greatly reduced the time and cost of sequencing, making large-scale sequencing gradually become a routine research and detection method, and the amount of data generated by sequencing has increased dramatically. How to efficiently analyze these data has become an urgent problem to be solved. [0005] At present, there are many high-throughput sequencing data analysis tools, and the bioinformatics tools for analyzing sequence information are complex. For large-scale sequencing data analyzing the microecology of flora, a variety of mature analysis tool...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/22
CPCG16B30/00
Inventor 薛正晟寇文伯王慧娟姜丽荣孙子奎
Owner SHANGHAI PASSION BIOTECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products