Unlock instant, AI-driven research and patent intelligence for your innovation.

Toxin gene abundance detection method based on metagenomics and annotation database construction method

A technology of metagenomics and detection methods, applied in biochemical equipment and methods, combinatorial chemistry, chemical library, etc., can solve problems such as inability to measure strength, achieve convenient transportation, and improve the effect of treatment

Pending Publication Date: 2022-06-14
康美华大基因技术有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, method one can usually only identify and type a few common genes, while method two can theoretically annotate a large number of toxin genes, but it can only calculate the relative abundance and determine the existence of toxin factors. measure its strength

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Toxin gene abundance detection method based on metagenomics and annotation database construction method
  • Toxin gene abundance detection method based on metagenomics and annotation database construction method
  • Toxin gene abundance detection method based on metagenomics and annotation database construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0038] see attached figure 1 , is a schematic flowchart of a metagenomics-based toxin gene abundance detection method provided in the embodiment of the present invention, the method may include the following steps:

[0039] S1. Sample collection and library building and sequencing to obtain metagenomic sequencing data of the sample to be tested.

[0040] In order to study the characteristics of intestinal endotoxin genes, 1005 volunteers were recruited, their stool samples were collected and quantitative metagenomic sequencing was performed. The amount of sequencing data for each sample was not lower than that of 5G sequencing data to obtain as many toxins as possible. genetic information.

[0041] In one implementation of the embodiment of the present invention, the fecal tissue of the user is collected and detected, the bacterial flora DNA in the fecal tissue is extracted and quality controlled, and a known amount of internal reference DNA is added to the bacterial flora DN...

Embodiment 2

[0060] Another aspect of the present invention provides a method for constructing and annotating a specific toxin factor annotation database,

[0061] Annotate the species origin and functional classification of each toxin gene sequence, integrate the annotation information into the toxin factor reference sequence database, and form the toxin factor annotation database, and then conduct KEGG annotation analysis on the toxin factor through the toxin factor annotation database, and the results are shown in the appendix figure 2 shown.

[0062] from figure 2 As can be seen in , genes for ubiquitous adhesion, antiphagocytic and secreted toxin factors were detected.

[0063] Based on the pre-established toxin factor annotation database, the detected toxin factors were further annotated, and the species associated with the toxin factors, the mechanism of action, etc. were explained. Some results of the final detection are shown in Table 1.

[0064] Table 1 Annotation results of ...

experiment example

[0067] data simulation

[0068] In order to study the performance of this method in detecting toxin genes, using the DNA sequences in the toxin factor reference sequence database, 10M simulated sequencing sequences were simulated using InSilicoSeq software, and the theoretical sequence number and relative abundance of each toxin gene were calculated.

[0069] Toxin gene abundance calculation

[0070] Using Bowtie2 to compare the simulated sequencing sequence to the reference sequence of the toxin gene, count the number of sequences on each toxin gene alignment, and use the RPKM method for further normalization to obtain the relative abundance of each toxin gene.

[0071] Algorithm performance analysis

[0072] In order to evaluate the accuracy and sensitivity of this calculation method, the theoretical relative abundance was compared with the real abundance, and the results are shown in the attached image 3 As shown, it can be seen from the figure that in the simulated data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a toxin gene abundance detection method based on metagenomics and an annotation database construction method, and belongs to the technical field of biological information.The toxin gene abundance detection method comprises the following steps that S1, samples are collected, a database is built for sequencing, and metagenomic sequencing data of a to-be-detected sample are obtained; s2, screening the metagenome sequencing data to obtain a sequencing sequence; s3, comparing the sequencing sequence with a toxin gene reference database to obtain a comparison reads number of each gene; s4, standardizing the number of reads into relative abundance according to the gene length, and then calculating the absolute abundance of each toxin gene based on the copy number of the reference gene sequence; wherein the new toxin factor reference sequence database is formed by adding a known amount of reference gene sequences into the toxin factor reference sequence database. By adopting the detection method, the toxin factor gene related to the disease can be detected, and the real copy number of the gene in the sample and the real difference between sample groups can be reflected more accurately.

Description

technical field [0001] The invention belongs to the technical field of biological information, in particular to a method for detecting the abundance of toxin genes based on metagenomics, and also to a method for constructing an annotation database of toxin genes. Background technique [0002] Virulence factors (VFs) refer to molecules with virulence properties such as invasiveness and toxins produced by bacteria, viruses and fungi. Microorganisms infect the host mainly because the related pathogenic bacteria carry genes encoding toxin factors that can cause host cell damage, which can inhibit or evade the host's immune response, and then can enter and leave the host cells, and further rob the host of nutrients to achieve self-proliferation. purpose of growth. [0003] In frontier scientific research, common toxin factor databases include TADB (https: / / bioinfo-mml.sjtu.edu.cn / TADB / ), Tox-Prot (http: / / www.expasy.org / sprot / tox-prot) ), T3DB (http: / / www.t3db.ca / ) and VFDB (htt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): C12Q1/6806C12Q1/6869C40B50/06G16B30/00G16B50/00
CPCC12Q1/6806C12Q1/6869C40B50/06G16B30/00G16B50/00
Inventor 罗文韩丽娟
Owner 康美华大基因技术有限公司