Method for analyzing microbial population function by using metagenome data

A metagenomic and microbial technology, applied in the field of microbial population function analysis using metagenomic data, can solve problems such as unreliability, increase the difficulty of integrated databases, and inability to splicing, and achieve improved utilization, excellent universality and information. comprehensive effect

Active Publication Date: 2018-11-13
BEIJING INST OF GENOMICS CHINESE ACAD OF SCI CHINA NAT CENT FOR BIOINFORMATION
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The sequencing and analysis of such a large sample size requires a lot of human and economic costs, which increases the difficulty of establishing such an integrated database.
[0009] Third, in terms of data utilization, the utilization rate of reads by conventional methods is too low to fully reflect the real state of the research object
Specifically, in the process of assembly and splicing of sequencing data, if there is no overlapping sequence between the reads, the splicing cannot be performed; if the splicing can be completed, if the resulting s

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for analyzing microbial population function by using metagenome data
  • Method for analyzing microbial population function by using metagenome data
  • Method for analyzing microbial population function by using metagenome data

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0063] Example 1 Construction of Microbial Metagenomic Reference Database

[0064] This embodiment takes NCBI and KEGG as the target biological information database. Those skilled in the art can choose other biological information databases in the field to use the method of the present invention to construct a microbial metagenomic reference database. These choices are all conventional choices based on the inventive concept of this application, and therefore do not exceed this application. The scope of protection.

[0065] 1. Download the whole genome sequence of all microorganisms from different sources and different types in NCBI. The data obtained is a .fna file, which includes the NC number, base sequence, gi number and corresponding species information of the whole genome sequence.

[0066] 2. Download the .gbk annotation files of all microorganisms in NCBI, and extract the classification information of related species from the .gbk annotation files based on the NC number and ...

Example Embodiment

[0073] Example 2 Method of using metagenomic data to analyze microbial population function

[0074] 1. Sequencing the metagenomics of the microbial population to be tested and quality control of the sequencing data: remove the bases with a sequencing quality value of less than 20, further remove sequences with reads less than 25bp in length, and remove reads from the host DNA to reduce Errors that may occur during sample extraction and sequencing, resulting in high-quality whole-genome sequencing data.

[0075] 2. Species abundance calculation: compare the microbial species data set in the microbial metagenomic reference database constructed in Example 1 of the high-quality reads obtained in step 1, and calculate the species abundance to obtain the data Abundance values ​​of all species. Based on the species abundance value, the difference analysis of various microbial compositions between different samples is carried out.

[0076] 3. Gene abundance calculation: compare the high-qu...

Example Embodiment

[0078] Example 3 Microbial population functional metagenomic analysis of poultry intestinal contents in a small sample amount

[0079] In this embodiment, the metagenomic data of the intestinal contents of individuals in the health and disease groups of a small sample of poultry were sequenced and analyzed. The experimental subjects used were 18 individual poultry individuals from the same industrial farm, divided into two groups with 9 samples in each group; among them, the disease test group was an individual who had been diagnosed with avian colibacillosis by a veterinarian. The group is healthy individuals.

[0080] Through microbial whole-genome shotgun sequencing, metagenomic sequencing was performed on the DNA extracted from the above 18 intestinal contents samples. Then, the method for analyzing the function of the microbial population using metagenomic data established in Example 2 of the present invention and the existing conventional method (the method introduced in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for analyzing a microbial population function by using metagenome data. The method comprises the following steps: collecting all known microbial species, gene and function information, and integrating the information as a reference database; sequencing the metagenome of the to-be-tested microbial specifies, controlling the sequencing data quality, computing speciesabundance and gene abundance, analyzing the composition difference of microbes among different samples and gene level difference; annotating the gene function, clustering the genes with the same function to obtain a function module, performing adduction computation on related abundance of all non-redundancy genes in various function modules to obtain abundance values of all function modules, performing difference comparison analysis or overall evaluation on the functions of the to-be-tested sample microbes. Through the method provided by the invention, the step of respectively comparing the splicing data, the assembling data, the predicting data and the sequencing data with the single function database is saved, the time is saved, the utilization efficiency of the sequencing data is improved, and the method can be used for analyzing high-throughput microbe whole-genome sequencing data and screening the function microbe.

Description

technical field [0001] The present invention relates to the field of bioinformatics, in particular to a method for analyzing the functions of microbial populations using metagenomic data, which can save analysis steps and time and improve the utilization rate of sequencing data. Background technique [0002] With the continuous development of high-throughput sequencing technology, people have been able to explore complex biological functions from the genome level, which has given us a deeper understanding of the organism itself and disease-related research. More and more studies have found that there is a mutually beneficial balance between the microbial flora and the health of the host. Among them, the microbial flora can help the host ferment undigested food, participate in energy metabolism and nutrient absorption, provide the host with various trace elements, essential amino acids, and some antibacterial polypeptides, and decompose some toxins or harmful substances in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/20G06F19/28G06F19/24C12Q1/689C12Q1/10C12Q1/06C12Q1/04
CPCC12Q1/689
Inventor 米双利邢志凯郭翀晔李蒙
Owner BEIJING INST OF GENOMICS CHINESE ACAD OF SCI CHINA NAT CENT FOR BIOINFORMATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products