A statistical test method for multiple-sample copy number consistent variation regions

A technology of copy number variation and statistical testing, which is applied in the field of statistical testing of multi-sample copy number consistent variation regions, can solve the problems of not being too large, difficult to obtain statistically significant detection results, and high computational complexity, to achieve The effect of accurate positioning

Active Publication Date: 2019-03-26
XIDIAN UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a statistical test method for the copy number consistent variation region of multiple samples, aiming at solving the problem that the statistical test method of the existing sample copy number consistent variation region is too dependent on the change in the number of sequencing reads, and it is difficult to obtain The detection effect of statistical significance; the sample size should not be too large, and the calculation complexity is high, which is not conducive to the detection of copy number consistent variation regions in multiple samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A statistical test method for multiple-sample copy number consistent variation regions
  • A statistical test method for multiple-sample copy number consistent variation regions
  • A statistical test method for multiple-sample copy number consistent variation regions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0027] The application principle of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0028] Such as figure 1 As shown, the statistical testing method of the multi-sample copy number consistent variation region provided by the embodiment of the present invention includes the following steps:

[0029] S101: Calculate the number of reads at each site on the basis of the comparison of the sequencing data files (ie Fastq files); normalize the number of reads according to the average value of the number of reads in the samples, so as to obtain a signal of the number of read...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a statistical testing method of copy number consistency variation region in multiple samples. The method comprises the steps that a quasi synthetic curve is constructed based on the relation coefficients of the copy digital points, the derivative value of each digital point is calculated, significant derivative values are tested through the hypothesis test method, and a variable candidate region for the copy numbers is built from the determination of the breakpoint of the copy numbers; a hypothesized null distribution is constructed through the method of randomly replacing the CNVs on the two directions of the whole genome group and the samples, and the copy number consistency variable region in multiple samples is tested. According to the method, directly using the read number of sequence is avoided, certain sequence mistakes and noises can be accommodated, and a boundary of the variable region for the copy numbers can be relatively accurately located;based on the random replacements of the CNVs on the two directions of the whole genome group and the samples, compared to the replacements on a single direction, more real head distribution of the hypothesis test can be obtained by the method; meanwhile the test of the diversified consistency variable CNVs, namely, the variable region of the copy number consistency existed in multiple sample subclasses, is facilitated.

Description

technical field [0001] The invention belongs to the technical field of copy number variation, and in particular relates to a statistical testing method for the consistent variation region of copy number of multiple samples. Background technique [0002] Next-generation sequencing technology provides more comprehensive and richer genomic variation data, providing an important platform for in-depth understanding of life mechanisms and cancer cell development mechanisms. Copy number variation (CopyNumber Variation, CNV) is an important variation phenomenon in the genome, which is closely related to the occurrence and development of cancer. Therefore, the systematic analysis of CNV data on the next-generation sequencing platform provides an important way to discover cancer genes and study the molecular mechanism of cancer cells. Diverse CNV patterns were detected. Existing technical solutions: At present, experts at home and abroad have proposed different copy number variation...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B40/00G16B20/20G16B20/10
CPCG16B20/00G16B30/00G16B40/00
Inventor 袁细国李杰张军英杨利英高美虹
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products