Method for identifying SNP in individual in Sanger sequencing oriented to PCR products of diploid

A recognition method and diploid technology, applied in character and pattern recognition, biological neural network models, instruments, etc., can solve problems such as inability to analyze sequencing files and inappropriate sequencing.

Inactive Publication Date: 2014-02-19
SOUTH CHINA AGRI UNIV +1
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of these software can analyze a single sequencing file. For example, novoSNP and Mutation Surveyor require a reference sequence, which is not feasible when the reference gene sequence is sequenced and the measured sequence has introns (not in the gene sequence); PolyPhred5. 0 The comparison of more than 8 sequencing files is required to accurately interpret the SNP, which is not suitable for sequencing of a single or a small number of samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying SNP in individual in Sanger sequencing oriented to PCR products of diploid
  • Method for identifying SNP in individual in Sanger sequencing oriented to PCR products of diploid
  • Method for identifying SNP in individual in Sanger sequencing oriented to PCR products of diploid

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The present invention will be further described below in conjunction with specific examples.

[0069] The method for identifying SNPs in individuals in the Sanger sequencing of diploid PCR products described in this embodiment is as follows:

[0070] 1) Separate the fluorescence data of the four bases of adenine A, guanine G, cytosine C and thymine T from the chromatograms of Sanger sequencing of diploid PCR products; the original data are sequencers from Applied Biosystems The generated sequencing chromatogram file with the extension .ab1 complies with the ABIF file format. Refer to the "Applied Biosystems Genetic Analysis Data File Format" released by the company in September 2009 to obtain the directory for storing file information. The directory contains The name of the file, the data type of the element, the number of elements and other related attributes can be used to separate the fluorescence data of A, G, T and C separately through information such as the eleme...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying SNP in an individual in Sanger sequencing oriented to PCR products of diploid. According to the method, firstly, fluorescent data of four bases of adenine A, guanine G, cytosine C and thymine T contained in a chromatogram map are independently separated, filtering and noise reduction processing is carried out on the separated fluorescent data by adopting a small wave multiscale analysis method; the waveform characteristics of the fluorescent data of the four bases are further analyzed, a first peak and a second peak of the waveform are detected, and the peak distance, the height specific value and the fluctuation degree specific value of the waveform characteristics are selected as the elements for judging SNP loca, a BP nerve net with the structure of 3-10-1 is selected as a classifier for the detection of the SNP loca, and training is carried out on the BP nerve net by adopting a Levenberg Marquardt algorithm; output is mapped as SNP evaluation scores from 0 to 100 by adopting piecewise linear transformation, the classification of the SNP loca is defined from a 1 level to a 5 level according to the evaluation scores, and the SNP confidence coefficient of the loca is judged according to the classification. The method for identifying the SNP in the individual in the Sanger sequencing oriented to the PCR products of the diploid can effectively detect the SNP loca in the individuals in sequencing files.

Description

technical field [0001] The invention belongs to the field of computer automatic recognition, relates to bioinformatics, pattern recognition, statistics, signal processing and computer software technology, and in particular relates to a method for targeting diploid polymerase chains when there is no reference sequence and only a few samples. A method for identifying single nucleotide polymorphisms (Single nucleotide polymorphisms, SNPs) within individuals in Sanger sequencing of Polymerase chain reaction (PCR) products. Background technique [0002] SNP refers to the variation (or polymorphism) caused by the substitution of a single nucleotide at the level of genetic material DNA. SNP has the characteristics of universality, representativeness, heredity, stability, etc., and reflects rich genetic information. The most common heritable variation, SNPs have also become widely used genetic markers. SNPs may lead to differences in individual phenotypes. For example, SNPs may be ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/46G06N3/02
Inventor 邓继忠甘四明黄华盛李梅于晓丽袁之报金济
Owner SOUTH CHINA AGRI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products