Copy number variation detection method and system, storage medium and computer equipment

A copy number variation and detection method technology, applied in the fields of systems, copy number variation detection methods, storage media, and computer equipment, can solve the problem of inaccurate detection results, low copy number variation detection sensitivity, and inaccurate detection results of low-coverage data and other problems, to achieve the effect of improving the accuracy and sensitivity, the method is simple and easy to implement, and improving various indicators

Pending Publication Date: 2020-10-30
LIAOCHENG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the proportion of normal cells in the sample is too high, the detection results of these methods will be inaccurate due to the small signal of tumor cells and the influence of noise.
[0006] Although the existing technology has good performance in the detection of copy number variation, there are generally the following shortcomings: the detection method is too dependent on the original data, resulting in inaccurate detection results for low-coverage data; the threshold used by the detection method is manually determined , resulting in low sensitivity of the detection results; the detection method involves an iterative or recursive process, resulting in a large computational complexity
[0007] Through the above analysis, the existing problems and defects of the existing technology are: the existing technology has low sensitivity to copy number variation detection, inaccurate copy number variation detection, and large computational complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Copy number variation detection method and system, storage medium and computer equipment
  • Copy number variation detection method and system, storage medium and computer equipment
  • Copy number variation detection method and system, storage medium and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0066] In view of the problems existing in the prior art, the present invention provides a copy number variation detection method, system, storage medium, and computer equipment. The present invention will be described in detail below with reference to the accompanying drawings.

[0067] Such as figure 1 As shown, the copy number variation detection method provided by the present invention comprises the following steps:

[0068] S101: Extract the read depth value of each window from the bam file, and perform GC correction on the data;

[0069] S102: Perform segment calculation on the processed data;

[0070] S103: Using ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of gene sequencing, and discloses a copy number variation detection method and system, a storage medium and computer equipment. The method comprises the steps of extracting a read depth value of each window from a bam file, and performing GC correction on data; carrying out segment calculation on the processed data; calculating the data generated by thesegment calculation by using a density peak detection algorithm; obtaining standardized two-dimensional data of density and distance, and screening out a variation region through two characteristicsof the density and the distance of each seg; obtaining a variation region, and determining whether variation is lost or added. The method is simple, easy to implement and low in resource overhead, a final result can be detected through a given bam file and a reference sequence file, data can be rapidly processed through a density peak algorithm, then Gaussian distribution statistics is conducted on the data, and a variation region is obtained according to probability density judgment.

Description

technical field [0001] The invention belongs to the technical field of gene sequencing, and in particular relates to a copy number variation detection method, system, storage medium and computer equipment. Background technique [0002] Currently: The density peak algorithm is a density-based algorithm that can quickly find global outliers. The most important part of the whole algorithm is the following three concepts: d ij : Indicates the Euclidean distance between the i-th point and the j-th point. ρ i : The number of points whose distance to the i-th point is less than dc, where dc is a threshold. σ i : In the set of points with higher density than the i-th point, the minimum Euclidean distance to the i-th point, if the density of the i-th point is the largest, then σ i Equal to the largest Euclidean distance to the i-th point. The main idea of ​​the density peak algorithm: first calculate the density of each data point and the Euclidean distance between them, and fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/10G16B20/30
CPCG16B20/10G16B20/30
Inventor 赵海勇田野袁细国
Owner LIAOCHENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products