A taxon component calculation method for sequencing data

A sequencing data and sequencing technology, applied in the field of bioinformatics analysis, can solve the problems of high false-positive species results, low specificity, and impact on the accuracy of pathogen results.

Active Publication Date: 2021-03-16
SIMCERE DIAGNOSTICS CO LTD +2
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0015] However, the conventional analysis method of the above-mentioned sequencing data has the defect of high false positive results (low specificity) for species, which has a great impact on the accuracy of pathogen results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A taxon component calculation method for sequencing data
  • A taxon component calculation method for sequencing data
  • A taxon component calculation method for sequencing data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0136] Embodiment 1 invention design

[0137] The present invention does not consider a) contamination of samples introduced during sampling, library building, and sequencing, and b) contamination of samples introduced by barcode splitting errors. Because the former is used as the pollution introduced by the experimental operation, the pollution investigation can be carried out by establishing negative control and other experimental methods in the operation, which is not within the scope of the present invention; the latter can be solved by selecting a barcode system with better distinguishing effect on the one hand. (not in the scope of discussion of the present invention), on the other hand, through some quantitative positive control experiments not in the scope of discussion of the present invention, the empirical value of the wrong introduction ratio can be obtained, and used for abundance screening to solve this false positive.

[0138] 1. During sequence alignment, a seq...

Embodiment 2

[0177] Embodiment 2 clinical experiment verification

[0178] The present invention collects 114 urine samples of urinary infection patients, conducts microbial culture and PCR detection on each sample, and judges whether there is a certain taxonomic unit in the sample based on the comprehensive results of microbial culture and PCR detection. Wherein 36 samples are used to calculate the subtaxon frequency threshold; the remaining 78 samples are used to calculate the performance of the taxon results of the conventional bioinformatics analysis method and the new method of the present invention, to illustrate that the new method is compared with the original conventional method The results of the improvement effect are as follows:

[0179] 1. The present invention takes 36 samples of urinary infection patients as a training set, and obtains the taxon identification results of the samples through a culture method. Taxon composition results for samples were obtained using conventi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for calculating taxa unit components of sequencing data. The present invention is based on the "subtaxon frequency of sequencing read sequence" index and its calculation framework, which is used to measure the misalignment of taxa in sequence alignment results, and can effectively remove false positive results in the calculation of taxon components. Improve the specificity and accuracy of component calculations. At the same time, the present invention also realizes the regression of misalignment sequences to real component results through the strategy of re-statistics after removing abnormal taxa, and effectively corrects the quantitative results of taxonomic unit abundance.

Description

technical field [0001] The invention relates to the field of bioinformatics analysis, in particular to a taxon component calculation method of sequencing data. [0002] technical background [0003] Infectious diseases are a class of diseases caused by pathogenic microorganisms. There are many types of infection sources and many patients, which have a major impact on public health in countries around the world. According to the World Health Organization, in 2016 as an example, lower respiratory tract infections alone caused about 3 million deaths worldwide. At the same time, the abuse of antibiotics caused by the blind treatment of infectious diseases is also becoming more and more serious. Accurate detection of infectious pathogens is the most important part of solving the above problems. [0004] The traditional method for detecting pathogens of infectious diseases is microbial culture, but the culture has the disadvantages of long detection time and low sensitivity. The...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10G16B30/20G16B40/00
CPCG16B30/10G16B30/20G16B40/00
Inventor 梁忱胡龙吴苏生杨帆肖念清任用
Owner SIMCERE DIAGNOSTICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products