Analysis method and device for detecting point mutations based on third-generation sequencing data

A technology for sequencing data and analysis methods, applied in sequence analysis, genomics, instruments, etc., can solve problems such as long running time, limited sequencing quality data distribution, and inability to meet point mutation detection well

Active Publication Date: 2022-03-22
QITAN TECH LTD CHENGDU
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The PCR method has the characteristics of high sensitivity, and the technology is mature, but each pair of primers can only detect one mutation, cannot detect too many samples and sites at the same time, and the throughput is low
The cost of Sanger sequencing is low, but the amount of sample required is large, and the detection sensitivity for low-frequency mutations is low
Next-generation sequencing has the characteristics of high throughput, and the cost of sequencing is also decreasing year by year. However, the current methods and tools commonly used to detect point mutations are not high in specificity (such as Varscan) and low in sensitivity to low-frequency detection (such as Mutect). Or the use of partial assembly steps leads to too long running time (such as Mutect2), which cannot well meet the needs of point mutation detection
Although there are some methods for detecting point mutations developed based on three-generation sequencing data at this stage (as mentioned above), their respective shortcomings are also very obvious. The most important ones are limited by the quality of sequencing and the comparison algorithm or deep learning training set they rely on. The data distribution, etc., and the applicable scenarios are not wide enough, and the robustness (robust) is not enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Analysis method and device for detecting point mutations based on third-generation sequencing data
  • Analysis method and device for detecting point mutations based on third-generation sequencing data
  • Analysis method and device for detecting point mutations based on third-generation sequencing data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0092] Example 1 Analysis of data using the method of the present invention

[0093] 1. will contain BRAF-V600E, EGFR-L858R, EGFR-T790M, KRAS-G13D as well as AKT1-E17K The standard sample and the standard sample of the negative control sample NA12878 were prepared by the experimental library and repeated three times. The nanopore sequencer of QNome-9604 was used for sequencing, and 6 original long-read sequencing data were obtained, including HUM964, HUM965 and HUM966 For positive control data, HUM967, HUM968 and HUM969 for negative control data.

[0094] 2. Extract 9 short sequences with a fixed length of 101 bp on the genome for the 5 target sites to be detected in step 1 according to their positions, wherein the positions of the target sites on the extracted short sequences are respectively fixed on the first 11 bases, 21 bases, 31 bases, 41 bases, 51 bases, 61 bases, 71 bases, 81 bases and 91 bases base (ie D=10bp), to obtain the final set of 9 short-sequence fragme...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an analysis method and device for detecting point mutations based on three-generation sequencing data. The analysis method of the present invention includes: 1) extracting the first sequence subset containing the point mutation to be detected; 2) extracting the seed sequence from the first sequence subset to obtain the second sequence subset; 3) obtaining the original sequence subset with expected quality Data set; 4) Use the seed sequence pairs of the second sequence subset to obtain N data sets containing the target sequence; 5) Perform point mutation detection and analysis on the N data sets containing the target sequence; 6) Analyze the N detection results The result of each point mutation in is assigned a weight W; 7) Calculate the point mutation result and its frequency according to the formula. The invention also provides a device for detecting point mutations based on three-generation sequencing data. Using the method of the present invention not only effectively avoids the problem of false negatives caused by low alignment rates caused by random indels or high sequencing errors from the perspective of data characteristics, but also can more effectively control the results of false positives.

Description

technical field [0001] The invention belongs to the field of sequencing technology and biological information technology analysis of sequencing data, in particular to a method for detecting point mutation based on third-generation sequencing data, and also relates to a device and system for detecting point mutation based on third-generation sequencing data. Background technique [0002] A point mutation is when only one base pair is changed. A general point mutation can be a base substitution, a single base insertion or a base deletion; a narrow point mutation is also called a single base substitution. Base substitutions are divided into two categories: transitions and transversions. At present, common methods for detecting gene point mutations include PCR, Sanger sequencing (first-generation sequencing) and second-generation sequencing. PCR method has the characteristics of high sensitivity, and the technology is mature, but each pair of primers can only detect one mutati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B30/10G16B20/50
CPCG16B30/10G16B20/50
Inventor 郎继东孙继国
Owner QITAN TECH LTD CHENGDU
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products