Nucleotide unit point variation detecting method based on neural network

A mutation detection and neural network technology, applied in the field of neural networks, can solve the problems of inaccurate detection position, low tumor purity, and reduced depth of mutation sites, and achieve the effects of low time complexity, accurate test results, and simple operation.

Inactive Publication Date: 2019-09-06
XIDIAN UNIV
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 3. The method of detecting SNP, the idea of ​​detecting SNP method: judge whether the site is a SNP by comparing the genotype of a sample with the reference base, and then combine with public databases (such as dbSNPC) to screen out some public mutation sites; The disadvantage of this method is that the undisclosed database SNP will be mistaken for SNV, that is, we can only judge the existing results, but it is not helpful to detect new SNVs, and has little value for cancer research
[0009] (1) Existing methods for detecting SNV mutations on the genome are not accurate enough
[0010] (2) Existing methods for detecting SNV mutations on the genome have low accuracy under low tumor purity
[0011] (3) Existing methods for detecting SNV mutations on the genome rely only on one read on the comparison, which may easily cause the omission of mutation detection
[0013] (1) When the sequencing depth is insufficient, some positive mutations with mutation significance are likely to be filtered or covered
[0014] (2) Tumor cells may also have some gene mutations such as copy number variation and indel
[0015] (3) Due to the presence of non-tumor cells in the tumor body, the depth of the mutation site will be reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Nucleotide unit point variation detecting method based on neural network
  • Nucleotide unit point variation detecting method based on neural network
  • Nucleotide unit point variation detecting method based on neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0035] For the existing methods for detecting SNV mutations on the genome, the detection position is not accurate enough; the accuracy rate is low under low tumor purity; relying on only one read for comparison is likely to cause the omission of mutation detection. The present invention uses the eigenvalues ​​set by genetic information; trains these eigenvalues ​​through a TensorFlow framework to obtain a data model, and detects and screens samples according to the trained model to obtain SNV.

[0036] The application principle of the present invention will be described in detail below in conjunction with the accompanying d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of a neural network, and discloses a nucleotide unit point variation detecting method based on a neural network. The method comprises the steps of performing comparison on original fastq data by means of bwa comparing software, and generating a sam file; converting the sam file to a binary-system-form bam file through samtools, performing sequencing onthe bam file and converting to a pileup-format file for finishing preprocessing of the original data; extracting 38 characteristic values which comprise sequencing depth, number of converted bases, base conversion frequency and base; for aiming at the data of the characteristic value, performing training and then storing a training model; detecting the same through the frame of the training modelfor obtaining the SNV. The method can settle a problem of low accuracy in detecting the SNV variation position by means of Fasd-somatic technology, a problem of leakage in detecting the SNV variationby means of somatic snipper method, and a problem of low detecting accuracy on the condition of low tumor purity in prior art.

Description

technical field [0001] The invention belongs to the technical field of neural networks, and in particular relates to a method for detecting single-site variation of nucleotides based on neural networks. Background technique [0002] Currently, the closest available technique: combined sample analysis to detect SNVs. This method is mostly based on the Bayesian model, considering the relationship between normal samples and their paired tumor samples and the purity of the tumor samples, and then setting the threshold to detect SNV, SomaticSniper and Fasd_somatic are both Bayesian models. change. However, the existing schemes also have certain defects. For example, more and more evidences show that the heterogeneity of tumors exists not only within the tumor, but also among tumors; when the sequencing depth is insufficient, some positive mutations with mutational significance are likely to be filtered out. Or it is covered; tumor cells may also have some gene mutations such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/20G16B30/00G16B50/30
CPCG16B20/20G16B30/00G16B50/30
Inventor 袁细国马超杨利英习佳宁张军英
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products