Optimized overlapping hybrid sequencing method

A hybrid sequencing and sequencing technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as inability to determine sequencing costs

Inactive Publication Date: 2014-12-17
SOUTHEAST UNIV
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the existing overlapping sequencing methods still have the following problems: it is impossible to determine what kind of sequencing depth is needed to ensure accurate determination of positive samples and minimize the sequencing cost? How many mixing pools are needed? How are individual samples assigned to individual pools overlappingly? How to choose the best sequencing solution?

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimized overlapping hybrid sequencing method
  • Optimized overlapping hybrid sequencing method
  • Optimized overlapping hybrid sequencing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment example

[0090] Two mutation carriers were identified in 200 diploid samples, and the sequencing region was set to be 30Mb on the genome (Mb=Megabase, consistent with the total length of the human exon region). First set the sequencing error rate (p error ) is 0.01, and the mixed pool judgment error rate (α) allowed by the overlapping mixed sequencing design is 0.01, we calculate the optimal depth of mixed sequencing according to the sequencing depth model, and calculate the positive mixed pool threshold, when the sequencing fragments containing mutations When the number exceeds the threshold, the mixed pool is considered to be a positive mixed pool (Table 1). The number of diploid samples in the mixed pool is 1, which represents the individual sequencing depth required by the non-mixed sequencing strategy.

[0091] Table 1 Optimal sequencing depth and positive pool threshold for mixed sequencing

[0092]

[0093]

[0094]Then, the separation matrix is ​​used to construct the m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an optimized overlapping hybrid sequencing method, which includes the following steps: on the basis of the general law that sequencing depth follows negative binomial distribution and sequencing errors follow binomial distribution in the process of sequencing, a depth model of hybrid sequencing is put forward, moreover, the optimal depth of hybrid sequencing is calculated and designed on the basis of the model, and sequencing cost is effectively reduced by reducing redundant sequencing depth; a grouped overlapping hybrid sequencing method based on rare mutation distribution probabilities is put forward, and compared with direct sequencing, the grouping strategy can greatly reduce the demand of sequencing on data volume and increase the efficiency of hybrid sequencing; a sequencing cost model is established, and on the basis of the model, an optimal overlapping hybrid sequencing scheme is chosen to screen rare mutation carriers. The optimized overlapping hybrid sequencing method reduces the sequencing cost of screening rare mutation carriers to the max.

Description

technical field [0001] The invention belongs to the field of gene sequencing, in particular to an optimized overlapping hybrid sequencing method. Background technique [0002] Using high-throughput DNA sequencing technology to analyze the relationship between genetic mutations and human diseases is an important method in biomedical research, and screening and detection of rare DNA mutations is the focus of current research. In order to discover rare mutations in the human genome and explore the relationship between rare mutations and diseases, a large number of individual DNA samples need to be sequenced and analyzed. In order to improve sequencing efficiency and make full use of the sequencing capabilities of existing sequencing instruments, it is necessary to mix multiple samples together for simultaneous sequencing, that is, hybrid sequencing. [0003] The key to mixed sequencing is how to separate DNA sequencing fragments from different samples from the sequencing resul...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/22
Inventor 孙啸曹唱唱李成
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products