Single-gene or polygene copy number detection system and method based on next generation sequencing technology

A second-generation sequencing technology and copy number technology, applied in sequence analysis, biostatistics, instruments, etc., can solve the problems of strong batch effect and low detection accuracy, and achieve the effect of less computing resources and time consumption

Active Publication Date: 2019-06-14
上海序祯达生物科技有限公司
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention: the present invention discloses a single-gene or multi-gene copy number detection system and method based on next-generation sequencing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Single-gene or polygene copy number detection system and method based on next generation sequencing technology
  • Single-gene or polygene copy number detection system and method based on next generation sequencing technology
  • Single-gene or polygene copy number detection system and method based on next generation sequencing technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0075] Example 1 Using the method of the present invention to detect single gene copy number variation

[0076] The LASSO linear regression model was trained using the data action training sets of 500 normal males and 500 normal females respectively. The positive samples to be tested include 6 samples with partial deletion or increase of DMD gene, 2 samples with partial deletion of SLC26A4 gene, and 1 sample with partial deletion of PTRRA gene. The detailed information of the sample copy number variation status is shown in Table 1, and the consistency of the 9 test samples reached 100%.

[0077] Table 1

[0078]

[0079]

[0080] Test results such as figure 2 as shown, figure 2 The abscissa in the middle indicates the division of the divided capture interval, the ordinate indicates log2Ratio, each dot indicates the log2Ratio value of an interval, the broken line on the dot indicates the log2Ratio value after correction, and the upper and lower dotted lines indicate ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a single-gene or polygene copy number detection system and method based on a next generation sequencing technology. The single-gene or polygene copy number detection system uses a machine learning algorithm based on a regularized linear regression model (LASSO) and shifting level models to infer copy number variation of single-gene or polygene exons, and includes successively connected sequence alignment module, repeat sequence removal module, calculation coverage depth module, standardized coverage depth module, regularized linear regression training module (LASSO linear regression training module), coverage depth prediction module, breakpoint detection and log2Ratio value correction module, and copy number state inference module. The single-gene or polygene copy number detection system and method based on a next generation sequencing technology utilize the machine learning method to train large-scale next-generation targeted capture sequencing data, and combine the shifting level models to reduce technical and biological errors caused by batch effects, thereby achieving better copy number detection accuracy and precision.

Description

technical field [0001] The present invention belongs to the field of biomedicine, in particular to a data analysis method related to copy number variation detection in the field of next-generation sequencing, and in particular to a single-gene or multi-gene copy number detection system based on next-generation sequencing technology; in addition, the present invention also It relates to a method for detecting the copy number of a single gene or multiple genes based on a next-generation sequencing technology. Background technique [0002] Copy number variation is responsible for the development of many genetic and rare diseases. For example, α-thalassemia, spinal muscular atrophy, Duchenne muscular dystrophy, etc., are all caused by abnormal copy number of exons on one or more genes. Therefore, the copy number detection of these genes plays a crucial role in the control of genetic diseases and the prevention of birth defects. Traditional detection methods such as qPCR (real-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B30/10G16B40/20
Inventor 蔡秋娴旷婷孟鑫
Owner 上海序祯达生物科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products