Unlock instant, AI-driven research and patent intelligence for your innovation.

Detection of significance of somatic copy number variation based on two-dimensional statistical model

A copy number variation and statistical model technology, applied in the field of significance detection of somatic copy number variation, can solve the problem of small number of samples

Active Publication Date: 2016-10-05
XIDIAN UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, the difficulty of the problem itself: a) the number of sites is as high as 1.8 million and the number of samples is often small, forming a data pattern of high latitude and small samples; b) there is a strong correlation between SCNA sites, It is not independent, so that there is an interaction between the detection factors; c) the copy number amplification or deletion status includes two characteristics, namely the variation frequency and the variation magnitude, which requires a reasonable mechanism to balance these two characteristics; d) SCNA Structural patterns vary in length, which requires consideration of the different background distributions of SCNAs of different lengths
Second, the theories and methods for solving the problem are challenging: a) The scale of data is large, and it is a challenge to effectively control the complexity of computing time and space; b) How to fully consider the correlation between SCNA sites and reduce the significance of SCNA The conservatism of the estimation of the significance level is a difficult problem; c) How to establish a null hypothesis distribution that is consistent with the statistics and enhance the statistical significance of the estimation of the significance level is a key issue that has not yet been broken through

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection of significance of somatic copy number variation based on two-dimensional statistical model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Please refer to figure 1 , a method for detecting the significance of somatic cell copy number variation based on a two-dimensional statistical model, characterized in that: it includes,

[0026] S1 collects SCNA data and preprocesses the SCNA data;

[0027] S2 calculates the relationship coefficient between SCNA adjacent sites, and divides the chromosome into multiple relatively independent SCNA structural units;

[0028] S3 calculates the statistics of each SCNA structural unit, and implements two-dimensional random permutation on the whole genome;

[0029] S4 Aiming at different lengths L of SCNA structural units, by calculating the statistics of SCNA patterns of arbitrary length L in the permutation sample, construct a zero distribution D based on L in two-dimensional space L ; Compare the statistics of the corresponding SCNA with D L For comparison, the statistics of the SCNA and the D L Recorded as p-value; if the p-value is less than the set threshold, the co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a somatic copy number alteration obviousness detection method based on a two-dimension statistic model. The somatic copy number alteration obviousness detection method based on the two-dimension statistic model comprises the steps that S1, SCNA data are acquired and pre-processed; S2, the coefficient of relationship between every two adjacent SCNA loca is worked out, and a chromosome is divided into a plurality of SCNA structural units which are independent relatively; S3, the statistic of each SCNA structural unit is worked out, and two-dimension random permutation is conducted on a whole genome; S4, according to different lengths L of the SCNA structural units, null distribution D[L] based on the L is established in a two-dimension space through calculation of the statistic of any SCNA structural unit with the length L in a permutation sample; the statistic of the corresponding SCNA is compared with the D[L], and the statistic of the corresponding SCNA and the D[L] are marked as a value p; if the value p is smaller than a set threshold value, the corresponding SCNA is obvious, and potential cancer exists.

Description

technical field [0001] The invention discloses a method for detecting the significance of somatic cell copy number variation based on a two-dimensional statistical model. Background technique [0002] Somatic copy number alteration (SCNA) is an important phenomenon in cancer genomes. It is mainly manifested in two states of copy number amplification and deletion, and is closely related to the occurrence and development of cancer cells. Therefore, the systematic analysis of SCNA provides an important way to study the pathogenic mechanism of cancer at the molecular level. The bottom and core problem is how to distinguish the SCNA pattern with cancer function from the randomly occurring SCNA. [0003] Numerous studies have shown that SCNA functional patterns are often hidden in the consistent variation regions of cancer genome samples, then a calculation method based on statistical theory is established to detect the recurrent (Recurrent) significance level of SCNA in multiple...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F19/18
Inventor 袁细国张军英杨利英张胜利
Owner XIDIAN UNIV