Analysis system for gene copy number variation

A gene copy number and variation analysis technology, applied in the field of genetic information data processing, can solve problems such as low accuracy, high false positives of copy number variation, and no exclusion, so as to improve accuracy, reduce false positives, improve accuracy and The effect of sensitivity

Inactive Publication Date: 2016-10-26
WANKANGYUAN TIANJIN GENE TECH CO LTD
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the current copy number variation analysis software does not exclude the highly complex region near the centromere of the genome and the gap region of the reference genome, resulting in high false positives and low accuracy of detected copy number variation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Analysis system for gene copy number variation
  • Analysis system for gene copy number variation
  • Analysis system for gene copy number variation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other.

[0025] The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0026] Such as figure 1 Shown is the operation flow of the software system of the present invention.

[0027] The system framework of the present invention is:

[0028] Operating platform: Windows, Linux

[0029] Programming language: Perl

[0030] Software dependencies: BWA, Samtools, SVG drawing package.

[0031] 1. The analysis module is equipped with the main program CNVAnalyzer.pl, which reads in the data index file and the reference genome (two necessary parameters), calls each auxiliary program, and completes the analysis of each part in turn.

[0032] 2. The segmentation module is equipped with cut_sam_bychr.pl program, which divides the sam file of the comparison result of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an analysis system for gene copy number variation. The system comprises an analysis module, a division module, a statistics module, a window computing module and a picture module, wherein the analysis module is used to read in an index document and a reference genome of data, and make comparison; the division module is used to divide a sam document of a comparison result of the whole genome according to chromosomes; the statistics module is used for statistics of a comparison result of comparison sequencing data; the window computing module is used to compute an average covering depth of each window on the genome with 1KB as the window, and results are given in the form of a list; the picture module is used to draw a chromosome covering depth picture according to computing results; and the analysis module is a major module which successively calls other modules to complete each part of analysis work. The system provided by the invention has the advantages that the copy number variation on a human genome level can be accurately analyzed by high-throughput sequencing data, and high-resolution pictures can be displayed; and the statistics can be carried out to data comparison information, so that data assessment becomes convenient.

Description

technical field [0001] The invention belongs to the field of gene information data processing, in particular to a gene copy number variation analysis system. Background technique [0002] Human gene copy number variation is one of the most important causes of various diseases, especially the occurrence and development of tumors. In recent years, with the rapid development of high-throughput sequencing technology, the cost of sequencing has been greatly reduced, and the analysis of gene copy number variation based on whole genome sequencing has increasingly become a routine method for clinical testing and scientific research. Accurate detection and analysis of gene copy number variation has also become one of the common problems faced by bioinformatics workers. [0003] High-throughput sequencing technology amplifies and sequences a random region with a length of 200-500bp on the genome, and uses the read length (reads) obtained by sequencing to characterize the copy number ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/22
CPCG16B30/00
Inventor 薛成海马飞张广发
Owner WANKANGYUAN TIANJIN GENE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products